Do as I say…
Google made an announcement on their webmaster’s blog spawning a lot of speculation over the death of scraped content.
Notably from the post:
…These techniques are usually accomplished by abusing qlweb style catalogues or by scraping content from sources known for good, valid content, like Wikipedia or the Open Directory Project.
These methods violate Google’s webmaster guidelines. Purely scraped content, even from high quality sources, does not provide any added value to your users. It’s worthwhile to take the time to create original content that sets your site apart. This will keep your visitors coming back and will provide useful search results. …
I take exception to this post purely on the basis that in the second paragraph they seem to infer that wiki is a “high quality source”, just because a site ranks in the top 5 for just about every query in Google doesn’t mean its high quality. It does mean that they’ve successfully designed their site to get a ton of Google-Love but as many, many people have discussed, these wiki results are ruining Google’s quality.
I am however hoping that this is just a shot across the bow of webmasters, a warning that changes are coming. Hopefully they are going to be true to their word and start removing these RSS feed generated sites, wiki clones, affiliate sites, shopping comparison junk, etc.
I’ll believe it when I see some of the biggest offenders like Google Directory and Google News deindexed. Currently The Google Directory enjoys 783,000 indexed pages and Google news has 13,200 pages.
Other than slapping up a view of the pagerank and reducing the competition by removing links to the other search engines can you tell me the value that Google has added to these pages?


