Sign up for a
15-Day Free Trial
Take the
5 Minute Tour
Take a Tour of Clickable
Sign up for a
Demo
Sign Up
New research from Borrell Associates explores challenges and opportunities in the $5.3 billion local search advertising industry.
Free Download ebook
Free Search Marketing ebook
ebook Download the 2009 Clickable Guru’s Guide to Search Engine Marketing.
No tags have been created or used yet.

Can Duplicate content across multiple search engines impede Rankings?

Previous | Next
Latest post 10-15-2009 5:41 PM by Andrew Bernero. 1 replies.
10-15-2009 1:07 PM
Advertiser
Joined on 04-13-2009
Posts 1
Can Duplicate content across multiple search engines impede Rankings?

I got this email today and it was news to me.  Any input from the gurus out there.  Below is the email:


Google: Cross-Domain Canonical Tag This Year

Duplicate content is a common occurrence on the web and in many cases can hurt search engine rankings. While the search engines may not always technically penalize webmasters for duplicate content, there are still a lot of ways it can hurt.

WebProNews is covering the Search Marketing Expo (SMX) East in New York, where representatives from the three major search engines (Google, Yahoo, and Bing) discussed how their respective web properties handle duplicate content issues. Following are some takeaways from each.

Duplicate Content in Google

The way Google handles duplicate content has been discussed a lot in recent memory. This is largely due to a video Google's Greg Grothaus uploaded, in which he discusses at length, the way Google handles a variety of different elements of the duplicate content conversation.

Joachim Kupke, Sr. Software Engineer of Google's Indexing Team reiterated much of what Grothaus said. He also said that Google has a ton of infrastructure for content duplication elimination:

- redirects
- detection of recurrent URL patterns (the ability to 'learn' recurrent url patterns to find duplicated content)
- actual contents
- most recently crawled version
- earlier content
- contents minus things that don’t change on a site

Kupke said to avoid dynamic URLs when possible (although Google is "rather good" at eliminating dupes). If all else fails, use the canonical link element. Kupke calls this a "Swiss Army Knife" for duplicate content issues.

Have you followed all the duplicate content rules and still been penalized?
Let us know about it in the comments


Google says the canonical link element has been tremendously successful. It didn't even exist a year ago, and is has grown exponentially. It has had a huge impact on Google's canonicalization decisions, and 2 out of 3 times, the canonical tag actually alters the organic decision in Google.

Google says a common mistake is designating a 404 as canonical, and this is typically caused by unnecessary relative links. So, avoid changing rel="canonical" designations, and avoid designating permanent redirects as canonical.

Also, do not disallow directives in robots.txt to annotate duplicate content. It makes it harder to detect dupes, and disallowed 404s are a nuisance. There is an exception however, and that is that interstitial login pages may be a good candidate to "robot out," according to Kupke.

Kupke says that canonical works, but indexing takes time. "Be patient and we WILL use your designated canonicals." Cleaning up an existing part of the index takes even longer, and this may leave dupes serving for a while despite rel=canonical, Kupke adds.

At SMX, Google announced that cross domain rel=canonical is coming within this year. So for example, if the Chicago Tribune has an article on the New York Times, and the rel=canonical points to the Chicago Tribune then Google will only credit the Chicago Tribune with the content.

Duplicate Content in Bing

As far as how Bing views duplicate content, intention is key. If your intent is to manipulate the search engine, you will be penalized.

Sasi Parthasarathy, Program Manager of Bing says to consolidate all versions of a page under one URL. "Less is more, in terms of duplicate content." If possible, use only one URL per piece of content.
Bing isn't supporting the canonical link element (as a ranking factor) yet, but it is coming. They do say to use it, but it's just not really a ranking factor in Bing yet. Bing says that there has been an increase in the usage of canonical tags in the past 6 months, but adoption issues still exist. According to Parthasarathy, 30% of canonical tags point to the same domain (which is fine), and 9% use it to point to other domains. This could be a mistake or it could be manipulative. Bing says they will look for other factors to try and determine which it is.

Bing says canonical tags are hints and not directives. "Use it with caution," and not as an alternative to good web design.

With regards to www vs non-www, just pick one and stick with it consistently. Remove default filenames at the end of your URLs. Bing also says 301 redirects are your best friend for redirecting, use rel="nofollow" on useless pages, and use robots.txt to keep content you don't want crawled out.

Duplicate Content in Yahoo

If everything goes according to plan, you're going to need to worry about how Bing handles duplicate content if you're worried about how Yahoo handles it, but Yahoo's Cris Pierry, Sr. Director of Search, offered a few additional tips.

Pierry says descriptive URLs should be easily readable, and it's not a good idea to change URLs every year. In addition, use canonical, avoid case sensitivity, and avoid session IDs and parameters.

Pierry also says to use sitemaps, and submit them to Yahoo Site Explorer. Improve indexing by proper robots.txt usage, and use Site Explorer to delete URLs that you dont' want Yahoo to index. Finally, provide feeds to Yahoo Site Explorer, and report spam sites linking to you in Site Explorer.

Yahoo says metadata and SearchMonkey are enhancing presentation.

WebProNews reporter Mike McDonald contributed to this article from SMX East. 

 

10-15-2009 5:41 PM In reply to
Andrew Bernero
Joined on 03-19-2009
Posts 3
BlogAdministrator
Clickable Gurus
Re: Can Duplicate content across multiple search engines impede Rankings?

Fortunately, duplicate content should not be an issue when managing paid search listings in terms of it affecting an advertiser's position, average costs, Quality Score, etc...

Duplicate content can however become an issue when it comes to position and indexing your pages in the non-sponsored, search index results.

In the dark ages, one could manipulate rankings using tricks like Meta keyword tag stuffing and leveraging the content of a well-ranking page by duplicating it in order to receive multiple well-ranking page listings.

It didn’t take long for the search engine algorithms to respond and become more sophisticated, thereby penalizing some pages and Web sites for using duplicate content. There are examples of legitimate uses for duplicate content, but for SEO best practices, I would try to err on the side of caution and avoid using it if possible.

What this announcement means is that a new tag has been created that can be implemented in the head of html documents to “point” the search engines (Google,  Bing and Yahoo) to the page that you would like indexed for this content by specifying a URL. This will only work within a Web site and a Web site’s subdomains and not across different Web sites - So for example, if you operate several sites and have duplicate content across them, you cannot point one Web site to a different site's page with this tag, only within the same Web site.

Matt Cutts, one of the chief engineers at Google who authored many of their search technology patents, explains the canonical tag in greater detail here.

 

 

Page 1 of 1 (2 items)
Industry Awards and Recognition
Award - Tech Crunch 40 Award - Always on New Corner of the year top 100 Media Company Award - Small Business Computing.com Award - Ernst and Young Award - American Business Award - ClickZ Marketing Excellence