Google Sitemaps How to Part 1: What is It?
Posted by Hendry Lee on 07/21/05 in Google, SEO Tools
Google invented Google Sitemap Protocol (GSP) to help webmasters and publishers get their sites indexed more accurately by the biggest search engine on the planet. This article is part of a series of article intended to show you how to create Google XML Sitemaps — some would call it RSS Sitemap — for free.
What Google Sitemaps exactly is? According to Google:
Google Sitemaps is an experiment in web crawling. Using Sitemaps to inform and direct our crawlers, we hope to expand our coverage of the web and and speed up the discovery and addition of pages to our index. By placing a Sitemap-formatted file on your web server, you enable our crawlers to find out what pages are present and which have recently changed, and to crawl your site accordingly.
If you have Google spider come to your web site on a regular basis, ignore this article. But if your site is content rich, consists of quality content or dynamic web pages that change often and Google fail to crawl them fully, you may try Google Sitemaps.
Google Sitemaps use the Sitemap Protocol, which is based on XML for summarizing information for Google web crawler. In this XML file, you can add information such as last modified date of the page and approximate change frequency.
Many marketers hype up the technology by saying this method ensures Google spider crawls your web pages fully and quickly. They are simply wrong. It’s true, Google created this for a purpose but as stated in the Frequently Asked Questions (FAQ), Google doesn’t by any mean guarantee that all of your pages will be crawled once you put them into your Sitemap.
Of course, it is absolutely worth the effort to try this. It’s been working for some publishers and it might work for you too. Moreover, there’s nothing to lose since both personal and commercial use is free anyway.
Basically there are three steps required to create and maintain Google Sitemaps:
- Creating a Sitemap in a supported format
- Submitting the Sitemap file to Google
- Updating the Sitemap file when your site changes
Besides the Sitemap Protocol XML file format, Google supports three other formats:
- OAI-PMH (Open Archives Initiative Protocol for Metadata Harvesting) - This is an application-independent interoprability framework based on metadata harvesting. If your site doesn’t use this protocol then forget about what it is.
- Syndication feed - If your site is operated using weblog software like WordPress or MovableType for content management, Google accepts RSS (Really Simple Syndication) 2.0 and Atom 0.3 feeds. The drawback of using this format is that you only can list a few URLs which are the most recent web pages (posts).
- Text file - List one URL per line and save as plain text file. This format is acceptable, although not recommended. Google seems to give lower priority to text based Sitemaps compared to XML formatted Sitemaps. You can generate XML Sitemap from a text sitemap file properly by using Sitemap Generator.
That’s it for now. I hope you get a basic understanding about this new technology. In the next part, I’ll walk you through the process of manually creating a valid XML Sitemap file using text editor. If you have small site with less than a hundred web pages, this come handy without the need to install any program or script on your web server.
| permalink | trackback url | comment(1)
