Unmasking the truth about the Google duplicate content debate.

Sunday, December 20, 2009
By admin

Unmask­ing the truth about the Google dupli­cate con­tent debate.

Google files a patent that will set­tle the debate for good.

Some believe that dupli­cate con­tent will hurt your SEO efforts, while oth­ers dis­miss the claim as a myth. Who is right? Well the answer lies in the new patent Google just filed: Dupli­cate doc­u­ment detec­tion in a web crawler sys­tem. The patent explains how the search engine’s con­tent server inter­acts with a dupli­cate con­tent server. The patent cov­ers what dupli­cate con­tent is, how it is detected, and how it effects you.

I guess the myths are in place because of the inter­pre­ta­tion of the web­mas­ter guides out­line on dupli­cate con­tent. The answer was dif­fer­ent to some who believed the con­tent to mean one thing and oth­ers belived the con­tent meant some­thing else entirely. What is dupli­cate con­tent accord­ing to Googles patent? Well the patent difene this by stating:

“Dupli­cate doc­u­ments are doc­u­ments that have sub­stan­tially iden­ti­cal con­tent, and in some embod­i­ments wholly iden­ti­cal con­tent, but dif­fer­ent doc­u­ment addresses.”

The patent also details three sep­per­ate sce­nar­ios in which dupli­cate doc­u­ments are encoun­tered by a web crawler:

  1. Two pages, com­pris­ing any com­bi­na­tion of reg­u­lar web page(s) and tem­po­rary redi­rect page(s), are dupli­cate doc­u­ments if they share the same page con­tent, but have dif­fer­ent URLs.
  2. Two tem­po­rary redi­rect pages are dupli­cate doc­u­ments if they share the same tar­get URL, but have dif­fer­ent source URLs.
  3. A reg­u­lar web page and a tem­po­rary redi­rect page are dupli­cate doc­u­ments if the URL of the reg­u­lar web page is the tar­get URL of the tem­po­rary redi­rect page or the con­tent of the reg­u­lar web page is the same as that of the tem­po­rary redi­rect page.

A per­ma­nent redi­rect page is not directly involved in dupli­cate doc­u­ment detec­tion because the crawlers won’t down­load the con­tents of a redi­rect page.

Accord­ing to the appar­ent descrip­tion, Google’s web crawler con­sults the alike agree­able server to analy­sis if a begin page is a arche­type of addi­tion doc­u­ment. The algo­rithm again deter­mines which adap­ta­tion is the a lot of impor­tant version.

Google can use altered meth­ods to ascer­tain alike con­tent. For exam­ple, Google abil­ity yield “con­tent fin­ger­prints” and ana­lyze them if a new web page is found.

Inter­est­ingly, it’s not con­sis­tently the page with the accom­plished PageR­ank that is called as the a lot of impor­tant URL for the con­tent. The patent states; The patent states:

“In some embod­i­ments, a canon­i­cal page of an equiv­a­lence class is not nec­es­sar­ily the doc­u­ment that has the high­est score (e.g., the high­est page rank or other query-independent metric).”

© 2009, admin. Copy­right 2009. All rights reserved.

Tags: , ,

10 Responses to “Unmasking the truth about the Google duplicate content debate.”

  1. […] Read the orig­i­nal here: Unmask­ing the truth about the Google dupli­cate con­tent debate … […]

    #238
  2. […] Unmask­ing the truth about the Google dupli­cate con­tent debate … Cat­e­gories: SEO, SEO Tips and […]

    #245
  3. Garden Burgers mydragonlair.com

    Oh yeah I totally agree as well.

    #309
  4. News & Reviews sergeantsdogtags.com

    Thanks for the great post.

    #310
  5. buy levitra online rescueourplanet.com

    gr8 stuff to have a such thing…keep the good work on … 49111@gmail.com

    #325
  6. Yuri walkinginyourshoes.com

    Yur site is excvel­lent buddy..i like it..keep up the good work .…i rec­om­mend it to all

    #338
  7. FreeMind dorsetmassage.com

    Free Your Mind!!!

    #342
  8. KoolGuy sergeantsdogtags.com

    Kool site, suits a Kool guy like you. Keep in touch…

    #343
  9. Great site! I’m good sites fan keep up the good work!

    #385
  10. Bill silhouettelingerie.com

    gr8 stuff to have a such thing…keep the good work on .

    #386

Leave a Reply

CommentLuv Enabled

This site uses KeywordLuv. Enter YourName@YourKeywords in the Name field to take advantage.

Get Adobe Flash playerPlugin by wpburn.com wordpress themes