Les nouveautés et Tutoriels de Votre Codeur | SEO | Création de site web | Création de logiciel

Seo Master present to you:

History of Sitemap, Importance of Sitemap, Sitemap Format, Sitemap Submission Search Engines
Sitemaps protocol gives a information to search engines about URLs that are available for crawling. A Sitemap is a format of XML file and save as .xml. Sitemap file contain the list of URLs for a website. Sitemap allows webmasters to include information about each URL: when it was last updated, how often it changes, and how important it is in relation to other URLs in the site.
History of Sitemap:
  • Google first introduced Sitemaps 0.84 in June 2005 so any one can publish lists of urls.
  • Google, MSN and Yahoo announced joint support for the Sitemaps protocol in November 2006. The schema version was changed to "Sitemap 0.90", but no other changes were made.
  • In April 2007 - Ask.com and IBM announced support for Sitemaps. Also, Google, Yahoo, MS announced auto-discovery for sitemaps through robots.txt.
  • In May 2007 - the state governments of Arizona, California, Utah and Virginia announced they would use Sitemaps on their web sites.
Sitemap Format:
The Sitemap Protocol format consists of XML tags. The file itself must be UTF-8 encoded.
Sample
<?xml version='1.0' encoding='UTF-8'?>
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://www.sitemaps.org/schemas/sitemap/0.9
http://www.sitemaps.org/schemas/sitemap/0.9/sitemap.xsd">
<url>
<loc> http://online-seo-information.blogspot.com/</loc>
<lastmod>2006-11-18</lastmod>
<changefreq>daily</changefreq>
<priority>0.8</priority>
</url>
</urlset>
The following table lists the Sitemap submission URLs for several major search engines:
Search engineSubmission URLHelp page
Googlehttp://www.google.com/webmasters/tools/ping?sitemap=How do I resubmit my Sitemap once it has changed?
Yahoo!
  • http://search.yahooapis.com/SiteExplorerService/V1/updateNotification?appid=SitemapWriter&url=
  • http://search.yahooapis.com/SiteExplorerService/V1/ping?sitemap=
  • Does Yahoo! support Sitemaps?
    Ask.comhttp://submissions.ask.com/ping?sitemap=Q: Does Ask.com support sitemaps?
    Live Searchhttp://webmaster.live.com/ping.aspx?siteMap=Webmaster Tools (beta)
    YandexSitemaps files
    Sitemap limits:
    Sitemap files have a limit of 50,000 URLs and 10 megabytes per sitemap. Sitemaps can be compressed using gzip format. Multiple sitemap files are supported, with a Sitemap index file serving as an entry point for a total of 1000 sitemaps.
    2013, By: Seo Master
    Seo Master present to you:

    Meta Robots Tags Meta Robots All Robots.txt Robots META tag
    We can control the search engines spiders with the help of robots.txt, and certain pages should not be indexed and not be followed by search engine robots. The Robots META tag, placed in the HTML <HEAD> section of a page, can specify either or both of these actions.
    The default values are now assumed to be INDEX, FOLLOW, ARCHIVE, ODP, SNIPPET and YDIR.
    Examples
    <HEAD>
    <title>SEO Robots Meta Tag </title>
    <META name="robots" content="NOINDEX,NOFOLLOW" />
    </HEAD>
    <HEAD>
    <title> SEO Robots Meta Tag </title>
    <META name="googlebot" content="NOARCHIVE, NOODP, NOSNIPPET" />
    <META name="slurp" content="NOARCHIVE, NOYDIR, NOSNIPPET" />
    </HEAD>

    TaskEntry
    Indexer: ignore content;
    Robot: follow links
    <META name="ROBOTS" content="NOINDEX">
    Indexer: include content;
    Robot: do not follow links
    <META name="ROBOTS" content="NOFOLLOW, INDEX ">
    Indexer: ignore content;
    Robot: do not follow links
    <META name="ROBOTS" content="NOINDEX,NOFOLLOW">
    Indexer: include content;
    Robot: follow links
    <META name="ROBOTS" content="INDEX,FOLLOW">
    Search results pages should not show "cache" link<META name="ROBOTS" content="NOARCHIVE">
    Search results pages should not display the Open Directory Project (ODP) title and description for the page.<META name="ROBOTS" content="NOODP">
    Danny Sullivan provides good examples of how outdated descriptions and even titles show up when the ODP content is used for search results.
    Search results pages should not display the Yahoo Directory title and description for the page <META name="ROBOTS" content="NOYDIR">
    (Yahoo Slurp robot only)
    Search results pages should not display any description or text context for this page. Title only, I guess.<M
    2013, By: Seo Master
    Seo Master present to you:

    Meta Robots Tags, About Robots.txt and Search Indexing Robots
    EntryMeaning
    User-agent: *
    Disallow:
    Because nothing is disallowed, everything is allowed for every robot.
    User-agent: mybot
    Disallow: /
    mybot robot may not index anything, because the root path (/) is disallowed.
    User-agent: *
    Allow: /
    For all user agents, allow.
    User-agent: BadBotAllow: /About/robot-policy.htmlDisallow: /
    The BadBot robot can see the robot policy document, but nothing else.All other user-agents are by default allowed to see everything.This only protects a site if "BadBot" follows the directives in robots.txt
    User-agent: *Disallow: /cgi-bin/
    Disallow: /tmp/
    Disallow: /private
    In this example, all robots can visit the whole site, with the exception of the two directories mentioned and any path that starts with private at the host root directory, including items in privatedir/mystuff and the file privateer.html
    User-agent: BadBot
    Disallow: /

    User-agent: *
    Disallow: /*/private/*
    The blank line indicates a new "record" - a new user agent command. All other robots can see everything except any subdirectory named "private" (using the wildcard character)
    User-agent: WeirdBotDisallow: /links/listing.htmlDisallow: /tmp/
    Disallow: /private/

    User-agent: *
    Allow: /
    Disallow: /temp*
    Alllow: *temperature*

    Disallow: /private/
    This keeps the WeirdBot from visiting the listing page in the links directory, the tmp directory and the private directory.
    Allother robots can see everything except the temp directories or files,but should crawl files and directories named "temperature", and shouldnot crawl private directories. Note that the robots will use thelongest matching string, so temps and temporary will match the Disallow, while temperatures will match the Allow.
    Bad Examples - Common Wrong Entries
    use one of the robots.txt checkers to see if your file is malformed
    User-agent: googlebot
    Disallow /
    NO! This entry is missing the colon after the disallow.
    User-agent: sidewiner
    Disallow: /tmp/
    NO! Robots will ignore misspelled User Agent names (it should be "sidewinder"). Check your server logs for User Agent name and the listings of User Agent names.
    User-agent: MSNbot
    Disallow: /PRIVATE
    WARNING! Many robots and webservers are case-sensitive. So this path will not match any root-level folders named private or Private.
    User-agent: *
    Disallow: /tmp/
    User-agent: Weirdbot
    Disallow: /links/listing.html
    Disallow: /tmp/
    Robots generally read from top to bottom and stop when they reach something that applies to them. So Weirdbot would probably stop at the first record, *.
    Ifthere's a specific User Agent, robots don't check the * (all useragents) block, so any general directives should be repeated in thespecial blocks.
    2013, By: Seo Master
    Powered by Blogger.