Tuesday 27 November 2012

Updated Bing Webmaster Guidelines


Bing Webmaster Guidelines

These guidelines cover a broad range of topics and are intended to help your content be found and indexed within Bing.  These guidelines will not cover every instance, nor provide prescriptive actions specific to every website.  For more information, you should read our self-help documents and follow the Bing Webmaster Blog.  Inside Bing Webmaster Tools account, you will find SEO Reports and our SEO Analyzer tool for on-demand scanning of individual pages.  Both resources will offer basic guidance around what SEO work can be applied to a given website.  Beyond these features and sources, you may wish to consider seeking advice from a third party expert.

Content

Content is what Bing seeks.  By providing clear, deep, easy to find content on your website, we are more likely to index and show your content in search results.  Websites that are thin on content, showing mostly ads, or otherwise redirect visitors away from themselves to other sites quickly tend to not rank well.  Your content should be easy to navigate to, rich enough to engage the visitor and provide them the information they seek and as fresh as possible.  In many cases, content produced today will still be relevant years from now.  In some cases, however, content produced today will go out of date quickly.

Links

Links help Bing find new content and establish a vote of confidence between websites.  The site linking to your content is essentially telling us they trust your content.  Bing wants to see links grow organically, and abuses to this such as buying links or participating in link schemes (link farms, etc.) lead to the value of such links being deprecated.

Social

Social media plays a role in today’s effort to rank well in search results.  The most obvious part it plays is via influence.  If you are influential socially, this leads to your followers sharing your information widely, which in turn results in Bing seeing these positive signals.  These positive signals can have an impact on how you rank organically in the long run.

Indexation

Being indexed is the first step to developing traffic from Bing.  The main pathways to being indexed are:
  • Links to your content help Bing find it, which can lead us to index your content
  • Use of features within Bing Webmaster Tools such as Submit URL and Sitemap Upload are also ways to ensure we are aware of your content
Managing how Bingbot crawls your content can be done using the Crawl Control feature inside Bing Webmaster Tools.  This feature allows you to control when, and at what pace, Bingbot crawls your website.  Webmasters are encouraged to allow Bingbot to crawl quickly and deeply to ensure we find and index as much content as possible.

Technical

Page Load Time (PLT)

This element has a direct impact on the satisfaction a user has when they visit your website.  Slow load times can lead to a visitor simply leaving your website, seeking their information elsewhere.  If they came from our search results that may appear to us to be an unsatisfactory result that we showed.  Faster is better, but take care to balance absolute page load speed with a positive, useful user experience.

Robots.txt

This file is a touch point for Bingbot to understand how to interact with your website and its content.  You can tell Bingbot where to go, where not to go and by doing so guide its efforts to crawl your content.  The best practice is to have this file placed at the root of your domain (www.yourwebsite.com/robots.txt) and maintain it to ensure it remains accurate. This file is very powerful and has the capacity to block Bingbot from crawling your content.  Should you block Bingbot, we will not crawl your content and your site or content from your site may not appear in our search results.

Sitemap

This file often resides at the root of your host, say, www.yourdomain.com/sitemap.xml, and contains a list of all of the URLs from your website.  Large sites may wish to create an index file containing links to multiple sitemap.xml documents, each containing URLs from the website.  Care should be taken to keep these files as clean as possible, so remove old URLs if you take that content off your website. Most websites have their sitemap files crawled daily to locate any fresh content.  It’s important to keep your sitemap files clean and current to help us find your latest content.

Site technology

The technology used on your website can sometimes prevent Bingbot from being able to find your content.  Rich media (Flash, JavaScript, etc.) can lead to Bing not being able to crawl through navigation, or not see content embedded in a webpage.  To avoid any issue, you should consider implementing a down-level experience which includes the same content elements and links as your rich version does.  This will allow anyone (Bingbot) without rich media enabled to see and interact with your website.

Redirects

If you move content on your website from one location to another, using a redirect makes sense.  It can help preserve value the search engine has assigned to the older URL, helps ensure any bookmarks people have remain useful and keeps visitors to your website engaged with your content.  Bing prefers you use a 301 permanent redirect when moving content, should the move be permanent.  If the move is temporary, then a 302 temporary redirect will work fine.  Do not use the rel=canonical tag in place of a proper redirect.

Canonical Tags

The rel=canonical element helps us determine which version of a URL is the original, when multiple version of a URL return the same content.  This can happen when, for example, you append a tracking notation to a URL.  Two discrete URLs then exist, yet both have identical content.  By implementing a rel=canonical, you can tell us the original one, giving us a hint as to where we should place our trust.  Do not use this element in place of a proper redirect when moving content.
 

SEO

Search engine optimization is a valid practice which seeks to improve a website, making content easier to find and more relevant.  Taken to extremes, some practices can be abused.  The vast majority of instances render a website more appealing to Bing, though performing SEO-related work is no guarantee of rankings or the possibility to receive traffic from Bing.  The main area of focus when optimizing a website should include:
  • <title> tags – keep these clear and relevant
  • <meta description> tags – keep these clear and relevant, though use the added space to expand on the <title> tag in a meaningful way
  • <alt> tags – use the field to describe an image so we can understand the content of the image
  • <h1> tag – helps users understand the content of a page more clearly when properly used
  • Internal links – helps create a view of how content inside your website is related.  Also helps users navigate easily to related content.
  • Links to external sources – be careful who you link to as it’s a signal you trust them.  The number of links pointing from your page to external locations should be reasonable.
  • Social sharing – enabling social sharing encourages visitors to share your content with their networks
  • Crawlability
    • XML Sitemaps – make sure you have these set up and that you keep them fresh and current
    • Navigational structure – keep it clean, simple and easy to crawl
    • Rich media cautions – don’t bury links to content inside JavaScript
    • Graceful degradation – enable a clean down-level experience so crawlers can see your content
    • URL structure – avoid using session IDs, &, # and other characters when possible
    • Robots.txt – often placed at root of domain, be careful as its powerful; reference sitemap.xml (or your sitemap-index file) in this document
      • Verify that Bingbot is not disallowed or throttled in robots.txt: reference
    • Define high crawl rate hours in the Bing Webmaster Tools via the Crawl Control feature.
    • Verify that Bingbot is not blocked accidentally at the server level by doing a “Fetch as Bingbot”: reference
    • Webmasters are encouraged to use the Ignore URL Parameters (found under Configure My Site) tool inside Bing Webmaster Tools to help Bingbot understand which URLs are to be indexed and which URLs from a site may be ignored
  • Site Structure
    • Links – cross link liberally inside your site between relevant, related content; link to external sites as well
    • URL structure and keyword usage - keep it clean and keyword rich when possible
    • Clean URLs – no extraneous parameters (sessions, tracking, etc.)
    • HTML & XML sitemaps – enable both so users and crawlers can both find what they need – one does not replace the other
    • Content hierarchy – structure your content to keep valuable content close to the home page
    • Global navigation – springs from hierarchy planning + style of nav (breadcrumb, link lists, etc.) – helps ensure users can find all your content
  • Rich media warnings – don’t bury links in Javascript/flash/Silverlight;keep content out of these as well
  • On-Page
    • Head copy
      • Titles – unique, relevant, 65 characters or so long
      • Descriptions – unique, relevant, grammatically correct, roughly 160 or fewer characters
    • Body Copy
      • H1, H2 and other H* tag usage to show content structure on page
      • Only one <H1> tag per page
      • ALT tag usage – helps crawlers understand what is in an image
      • Keyword usage within the content/text – use the keyword/phrase you are targeting a few times; use variations as well
    • Anchor text – using targeted keywords as the linked text (anchor text) to support other internal pages
    • Content
      • Build based on keyword research – shows you what users are actually looking for
      • Down-level experience enhances discoverability – avoid housing content inside Flash or JavaScript – these block crawlers form finding the content
      • Keep out of rich media and images – don’t use images to house your content either
      • Create enough content to fully meet the visitor’s expectations.  There are no hard and fast rules on the number of words per page, but providing more relevant content is usually safe.
      • Produce new content frequently – crawlers respond to you posting fresh content by visiting more frequently
      • Make it unique – don’t reuse content from other sources – critical – content must be unique in its final form on your page
      • Content management – using 301s to reclaim value from retiring content/pages – a 301 redirect can pass some value from the old URL to the new URL
      • <rel canonical> to help engines understand which page should be indexed and have value attributed to it
      • 404 error page management can help cleanse old pages from search engine indexes; 404 page should return a 404 code, not a 200 OK code. Reference.
    • Links
      • Plan for incoming &  outgoing link generation – create a plan around how to build links internally and externally
      • Internal & external link management – execute by building internal links between related content; consider social media to help build external links, or simply ask websites for them; paying for links is risky
      • Content selection – planning where to link to – be thoughtful and link to only directly related/relevant items of content internally and externally
      • Link promotion via social spaces – these can drive direct traffic to you, and help users discover content to link to for you
      • Managing anchor text properly – carefully plan which actual words will be linked – use targeted keywords wherever possible

Avoid

Cloaking

This is when you show one version of a webpage to a search crawler like Bingbot, and another to normal visitors.

Link schemes – link farms, three way linking, etc.

Such schemes are intended to inflate the number of links pointed at a website.  While they may succeed in increasing the number, they fail to bring quality links to the site, netting no positive gains.

Social media schemes

Like farms are similar to link farms in that they seek to artificially exploit a network effect to game the algorithm.  The reality is these are easy to see in action and their value is deprecated. Auto follows encourage follower growth on social sites such as Twitter.  They work by automatically following anyone who follows you.  Over time this creates a scenario where the number of followers you have is more or less the same as the number of people following you.  This does not indicate you have a strong influence.  Following relatively few people while having a high follower count would tend to indicate a stronger influential voice.

Meta refresh redirects

These redirects reside in the code of a website and are programmed for a preset time interval.  They automatically redirect a visitor when the time expires, redirecting them to other content. Rather than using meta refresh redirects, we suggest you use a normal 301 redirect.

Duplicate content

Duplicating content across multiple URLs can lead to Bing losing trust in some of those URLs over time.  This issue should be managed by fixing the root cause of the problem.  The rel=canonical element can also be used but should be seen as a secondary solution to that of fixing the core problem. If excessive parameterization is causing duplicate content issue, we encourage you to use the Ignore URL Parameters tool.

Source: http://www.bing.com/webmaster/help/webmaster-guidelines-30fba23a

1 comment :

  1. I was searching for this stuff very long. Good to know about bing webmaster tools, thanks for writing up.

    ReplyDelete