salam every one, this is a topic from google web master centrale blog:
Recently, Danny Sullivan brought up good questions about how search engines handle meta tags. Here are some answers about how we handle these tags at Google. this is a topic published in 2013... to get contents for your blog or your forum, just contact me at: devnasser@gmail.com
Multiple content values We recommend that you place all content values in one meta tag. This keeps the meta tags easy to read and reduces the chance for conflicts. For instance: <META NAME="ROBOTS" CONTENT="NOINDEX, NOFOLLOW"> If the page contains multiple meta tags of the same type, we will aggregate the content values. For instance, we will interpret <META NAME="ROBOTS" CONTENT="NOINDEX"> <META NAME="ROBOTS" CONTENT="NOFOLLOW"> The same way as: <META NAME="ROBOTS" CONTENT="NOINDEX, NOFOLLOW"> If content values conflict, we will use the most restrictive. So, if the page has these meta tags: <META NAME="ROBOTS" CONTENT="NOINDEX"> <META NAME="ROBOTS" CONTENT="INDEX"> We will obey the NOINDEX value. Unnecessary content values By default, Googlebot will index a page and follow links to it. So there's no need to tag pages with content values of INDEX or FOLLOW. Directing a robots meta tag specifically at Googlebot To provide instruction for all search engines, set the meta name to "ROBOTS". To provide instruction for only Googlebot, set the meta name to "GOOGLEBOT". If you want to provide different instructions for different search engines (for instance, if you want one search engine to index a page, but not another), it's best to use a specific meta tag for each search engine rather than use a generic robots meta tag combined with a specific one. You can find a list of bots at robotstxt.org. Casing and spacing Googlebot understands any combination of lowercase and uppercase. So each of these meta tags is interpreted in exactly the same way: <meta name="ROBOTS" content="NOODP"> <meta name="robots" content="noodp"> <meta name="Robots" content="NoOdp"> If you have multiple content values, you must place a comma between them, but it doesn't matter if you also include spaces. So the following meta tags are interpreted the same way: <META NAME="ROBOTS" CONTENT="NOINDEX, NOFOLLOW"> <META NAME="ROBOTS" CONTENT="NOINDEX,NOFOLLOW"> If you use both a robots.txt file and robots meta tags If the robots.txt and meta tag instructions for a page conflict, Googlebot follows the most restrictive. More specifically:
Googlebot interprets the following robots meta tag values:
As defined by robotstxt.org, the following direction means NOINDEX, NOFOLLOW. <META NAME="ROBOTS" CONTENT="NONE"> However, some webmasters use this tag to indicate no robots restrictions and inadvertently block all search engines from their content. Update: For more information, please see our robots meta tag documentation. |
Labels: crawling and indexing