I have been researching some duplicate content topics lately for a friend/colleague of mine and while trying to find some examples for him I accidentally came across some interesting issues that Google is having on their own domain/subdomain with content being duplicated. I understand that massive sites (like Google) with as many different domains/subdomains and services as they have might be tough to manage. I imagine that their duplicate pages could get overwhelming to manage for anyone really quickly. Since they don’t seem to be aware of the issues I happened across (maybe more), I decided to write this post to help them out with a couple of the things I found (if they listen, I will consider posting more about other services).

In addition to the duplicate content issues I found, I also wanted to bring to light the many Froogle pages that still remain in Google’s index, despite being replaced by Google Product Search back in April 2007. Right now, if you type froogle.com into your address bar or even froogle.google.com, you’ll be redirected to the new product service at www.google.com/products. This is common, especially when changing domains or file locations, and it is good practice to use these types of 301 redirects, as Google stated about a year ago on the Webmaster Central blog.