Go Back   vBulletin Setup > vBulletinSetup Information > vBulletin SEO Tips and SEO Questions > Search Engine News


Please Register to get full access to the forums.
Post New Article  Comment
is a spammer exploiting your sitemap files ?
Published by News
05-17-2007
Exclamation is a spammer exploiting your sitemap files ?

A recent thread in a webmaster forum indicated that some search engine spammers might exploit the new XML sitemaps files. Has your sitemaps file been abused by spammers? Can using a sitemaps file harm your search engine rankings?
What is a sitemaps XML file?
The big search engines (Google, Yahoo, MSN and Ask) introduced the Sitemaps protocol earlier this year.
In its simplest form, a sitemap is an XML file that lists URLs for a site along with additional metadata about each URL: when it was last updated, how often it usually changes, how important it is, relative to other URLs in the site, etc.
That information helps search engines to more intelligently crawl your site. The Sitemaps news is a standard that makes it easier to create a sitemap that can be parsed by all search engines.
How can such a file harm your rankings?
Some webmasters reported problems with duplicate content after adding a sitemaps XML file to their web sites.
The content of their websites appeared on dubious websites that had nothing to do with the original sites. The content of the original websites had been duplicated on many other sites. The result was that the original sites might have received ranking penalties due to duplicate content.
What happened?
Some search engine spammers used the sitemaps XML files to easily find contents for their scraper sites.
A scraper site is a website that pulls all of its information from other websites using automated tools. The scraper software pulls different contents from other websites to create new web pages that are designed around special keywords. The scraped pages usually show AdSense ads with which the spammers hopes to make money.
The new sitemaps XML files make it very easy for scraper tools to find content rich pages. Although the original intention of the sitemaps files was to inform search engines about every single page of your web site, they can also be used to inform spam bots about your pages.
What can you do to avoid problems with your sitemaps file?
One possible solution is not to use any sitemaps file at all. In that case, scraper bots can still parse your web pages through the normal links on your web pages but that would be more difficult for them than using your sitemaps file.
Another solution is to set up a sitemaps file and delete as soon as search engines have indexed that file.
Do not use free sitemap generator tools. You don't know what they will do with your data and they might even use it to create scraper sites with your content.
Unfortunately, there's not much that you can do to stop

[Source..]
Article Tools

Featured Articles
  #1  
By dmiller68 on 05-17-2007, 11:02 PM
Re: is a spammer exploiting your sitemap files ?

yeah, site duplication has been around for a while done different ways. It makes since that the sitemap could make it easier. I can’t think of a way around it other than IP blocking. I you could get all the IPs the big three search engines and block all other requests. The key would be can we get the IPs.
Reply With Quote
  #2  
By magnaromagna on 05-18-2007, 01:46 AM
Re: is a spammer exploiting your sitemap files ?

Interesting and... bad news!
Checking the IP is a big effort (if you made an error the bot cannot spider), I see only the "manual" solution: generate the sitemap, submit to google, delete it. Re-start after 5/7 days.
Reply With Quote
  #3  
By Brandon on 05-18-2007, 06:46 AM
Re: is a spammer exploiting your sitemap files ?

Quote:
Originally Posted by magnaromagna View Post
Interesting and... bad news!
Checking the IP is a big effort (if you made an error the bot cannot spider), I see only the "manual" solution: generate the sitemap, submit to google, delete it. Re-start after 5/7 days.
I've thought about that, but I just hate to do all that work
Reply With Quote
  #4  
By joopss on 05-21-2007, 03:03 PM
Re: is a spammer exploiting your sitemap files ?

Tenksssssss ............
Reply With Quote
Post New Article  Comment
vBulletin Setup > vBulletinSetup Information > vBulletin SEO Tips and SEO Questions > Search Engine News


Article Tools
Display Modes

 
Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On

Similar Threads
Article Article Starter Category Comments Last Post
how can i add Attach Files to Post New Thread area? loka vBulletin Questions 2 11-11-2007 11:27 AM
how can i add Attach Files to Post New Thread area? loka vBulletin Graphic Questions 0 11-10-2007 06:26 PM
Hackers Exploiting IE7 Flaw Through Google Sponsored Links Brandon General Discussion 6 04-29-2007 06:35 PM
Does Google read your CSS files? News Search Engine News 0 01-10-2007 12:41 AM


All times are GMT -6. The time now is 04:29 AM.

vBulletin Setup, vBulletin Setup Forums, vBulletin Services, vBulletin Blogs, vBulletin SEO, vBulletin Questions, vBulletin Skins, Styles, Templates
vBulletin Hacks / Modifications, vBulletin Monetization, Blogs, vBulletin Link Directory,Quality Link Directory