Free Web Hosting by Netfirms
Web Hosting by Netfirms | Free Domain Names by Netfirms

Don Pedro's®          Capt. Peter - web design

XML Sitemaps

In the beginning of 2006 Google introduced their XML Sitemaps.
In November the same year Google, Yahoo!, and Microsoft agreed to
use these sitemaps together.

Site Goldaward - Pakistani Maritime  International Association of Webmasters and Designers

Site Gold Awards for Excellence on the Web in 2004
Classification: Maritime, Marine, and Boating

VERSION 08.0
Last up-dated: Dec. 23, 2008

At the bottom of the page, there is
a link to a print ready version.
What are the XML Sitemaps ?
Video Sitemaps (Added in this version)
Google
Yahoo, Microsoft, and Ask
What's Autodiscovery ?
How can a Webmaster or Designer Use These?
XML Sitemaps on Free Servers (Added in this version)
This page is best in any browser
Check Menu
Please note: Every link on this page opens in a new window. If your "Pop-up killer" is too efficient it can also stop new windows. When this happens, please press "Ctrl" and click on the link you want.

What are the XML Sitemaps ?

The intention with the sitemap is to make three parties happy:
  • The Search Engines; by saving time for their spiders when searching the Net and downloading pages.

  • Server owners; when the spiders download only updated pages less bandwidth is used on the server, which means money saved.

  • Web site owner and/or Web Master; who gets updated pages indexed faster.
The Web Master can influence the spiders by giving "priority" for each page as well as date and time when each page has last been updated.

"Priority" number, between 0.0 - 1.0, is website specific and have no meaning if compared with webpages on another site. It's intended to give an indication of the Web Master's feeling of the relative importance of each page on his/her site only, default value is 0.5.

Additionally you can also give the time when pages last up-dated. The time is given in GMT (UTC) plus / minus your time difference. Plus (+) is East and minus (-) is West. See Don Pedro's Time Zone Explanation.

There are two kinds of site maps, compressed or non-compressed (maximum size 10 MB). If your site have only about 50-100 pages and you update may be only 10-15 pages per month or less, the non-compressed version is enough. This you can easily handle manually in just a few minutes per month only. The compressed version is very difficult to read and change manually.

You get either a compressed or a non-compressed sitemap at:
Copy the code and paste it into Notepad, save in your root directory, i.e. "www.example.com/sitemap.xml" :
  • Save as: sitemap.xml
  • File type: All files
  • Encoding: UTF-8
Get full description from Sitemaps Org.. The beginning of the sitemap looks like this:
<?xml version="1.0" encoding="UTF-8"?>
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
   <url>
      <loc>http://www.example.com/</loc>
      <lastmod>2005-01-01</lastmod>
      <changefreq>monthly</changefreq>
      <priority>0.8</priority>
   </url>
</urlset>
If you want to validate the sitemap, you add temporarily a couple of lines of code at the beginning between the first line and "urlset", see the "protocol" for advice. When you want to validate the xml sitemap, go to list of validators at xml.com.

If you don't find a validator you like and you cannot find the error Google Webmasters tells you they have found - generate a new one and upload that one. Because these are automated programs there can be for humans "invisible" errors. Before you upload a new xml sitemap, check your current "faulty" one is in UTF-8 format.

Video Sitemaps

In December 2007 Google added a video specific sitemap. It follows the standard xml sitemap protocol but the format is slightly different.

You add it to Google from webmaster tools exactly same way as the standard xml sitemap and you keep it in your server's site folder together with your standard xml sitemap.

Google

Once you have uploaded the sitemap on your server, initially you add your xml sitemap to Google's Webmaster's Tools, i.e. Webmaster's "console". Follow instructions on Google's pages.

Once Google Bot has picked up the sitemap their spider will continue visiting and checking on up-dates. The more incoming links you have, the more often the spider will visit.

Full Description of Google's XML sitemap. See also Google's Sitemaps FAQ.

End of December 2008 Google informed the sitemaps can be submitted without specifying file type. See list of kinds of sitemap files Google supports.

Yahoo, Microsoft, and Ask

You can submit your xml sitemap to Yahoo! Search through Site Explorer. Use "Submit Site Feed" and treat your xml sitemap as a RSS feed, i.e. you type in the full URL of your xml sitemap.

As of end of July 2007 MSN - Live Search started to read xml sitemaps. In the same blog it's said they will also honour autodiscovery, see below. End of November 2007 MSN-Live Search opened Live Search Webmaster Tools ( Beta ), where you get reports on crawling and indexing problems, submit sitemaps, and view statistics. And of course, Sign Up for the service.

With Ask.com you can use the "ping" at this URL : http://submissions.ask.com/ping?sitemap=http://www.the URL of your sitemap here.xml Starting April 2007 Ask.com supports sitemaps in the robots.txt file.

What's Auto discovery ?

In April 2007 the four major search engines (including Ask.com) agreed on something they call Sitemaps Autodiscovery. Because all search engines are supposed to read the robots.txt file they agreed it would be reasonable to add the location of a xml sitemap in the robots.txt file.

According to the new protocol you can add this anywhere in the robots file. I have added it as a first line before the first "Disallow". You write it like this:
Sitemap: http://www.example.com/sitemap.xml

Immediately followed on the next line by the rest of the file. Leave one empty space after the ":", same as after Disallow.

How can a Webmaster or Designer Use These?

This is intended for those who are site owners, web masters and/or designers for their own websites. Big companies have their teams organized and should be very well prepared to do all this.

If you try to get out of the Sandbox by pinging Google with the xml sitemap - be prepared it has no effect at all. I would suggest the best time to ping Google is just after your site has "come out" and then combine it with publishing a new page. May be in memory of a successful graduation. Of course you have to follow your web site traffic data continuously to know when Google starts sending traffic to you again.

Once you have "pinged" Google with the sitemap and you think the indexing of especially new pages is slow or non-existent, you can check if the spider have any problems with your code, your server, or something else. I would suggest you include your robots.txt file in the xml sitemap.

XML Sitemaps on Free Servers

Your XML sitemap must be according to the required format and error free to be of any use. Even one small printing error at the very end renders the whole document unreadable. Because it's XML.

Free servers add an advertisement banner on top - and sometimes also at the bottom - of every page. They add this also to the sitemap.xml.

Because of this the formatting isn't recognised by the search engine spiders and they cannot read it. In other words, it's useless. It means you cannot use a xml sitemap if your website is on a free server.

And if it would be possible to treat the xml sitemap differently from other pages and exclude the advertisement banners, you could be suspected of cloaking. Cloaking is defined as what a website author does to return different content to search engines compared with what human visitors get.
Home   -   Site Map

Website Design Services

Free Backgrounds

Free Internet Pictures

Website Design Handbook

Computer Viruses and Worms

How to Choose Website Colours

How to Change my Pictures and Photos

Webpage Optimization

Website Promotion

Search Engine Marketing

List of Search Engines



Search this site

powered by FreeFind
Return to TOP
Related pages:
| Search Engine Marketing | Website Promotion |
| What is Google's Sandbox ? |

VERSION 07.1

Get version (3 pages)

© by Capt. Peter Forsberg.
You are allowed to print out the text for your personal needs.
You are also allowed to copy and distribute the printout for educational purposes when free of charge,
as long as you give the source: www.donpedrowebdesign.netfirms.com/xml-sitemaps.html.

Last updated:
Dec. 23, 2008

Visitor counter
since Dec. 12, 2006
according to: www.digits.com/

eXTReMe Tracker