Go Back   vBulletin Setup > vBulletinSetup Information > vBulletin SEO Tips and SEO Questions > Search Engine News

Reply 
 
LinkBack Thread Tools Display Modes
Old 11-02-2006, 08:45 PM   #1
Community Manager
Supporters
vBulletin Owner
vBSetup Mods
 
Brandon Sheley's Avatar
 
Join Date: Jul 2006
Location: Topeka, KS
Posts: 14,080
Blog Entries: 35
Brandon Sheley is a splendid one to beholdBrandon Sheley is a splendid one to beholdBrandon Sheley is a splendid one to beholdBrandon Sheley is a splendid one to beholdBrandon Sheley is a splendid one to beholdBrandon Sheley is a splendid one to behold
Send a message via AIM to Brandon Sheley Send a message via MSN to Brandon Sheley Send a message via Yahoo to Brandon Sheley
Yahoo! Search Crawler (Yahoo! Slurp) - Supporting wildcards in robots.txt

Quote:
I was going through my notes from Danny Sullivan's Open Feedback sessions that occur during the ‘Meet the Crawlers’ panel at Search Engine Strategies. One of the items on my list was a request for enhanced syntax in robots.txt to make it easier for webmasters to manage how search crawlers, including Slurp, access your content.

For those who may not be as familiar with search index terminology, webmasters use the robots.txt file to direct robots that visit their site, including search engine crawlers, which files should be crawled and which shouldn't be. You can read about our support for robots directives in the help for Yahoo! Slurp.
Well, we can scratch that one off the list, since we have just updated Yahoo! Slurp to recognize two additional symbols in the robots.txt directives – '*' and '$'. The semantics of these is what is as widely understood for robots.txt files.
'*' - matches a sequence of characters
You can now use '*' in robots directives for Yahoo! Slurp to wildcard match a sequence of characters in your URL. You can use this symbol in any part of the URL string you provide in the robots directive. For example,
User-Agent: Yahoo! Slurp
Allow: /public*/
Disallow: /*_print*.html
Disallow: /*?sessionid
The robots directives above will:
  • allow all directories that begin with 'public', such as '/public_html/' or '/public_graphs/' to be crawled
  • disallow any files or directories which contain '_print', such as '/card_print.html' or '/store_print/product.html' to be crawled
  • disallow any files with '?sessionid' in their URL string, such as '/cart.php?sessionid=342bca31’ to be crawled
Note that a trailing '*' is redundant since that is existing matching behavior for Slurp. So, the following two directives are equivalent:
User-Agent: Yahoo! Slurp
Disallow: /private*
Disallow: /private

'$' – anchors at the end of the URL string
You can now also use '$' in robots directives for Slurp to anchor the match to the end of the URL string. Without this symbol, Yahoo! Slurp would match all URLs against the directives, treating the directives as a prefix. For example:
User-Agent: Yahoo! Slurp
Disallow: /*.gif$
Allow: /*?$
The robots directives above will
  • Disallow all files ending in '.gif' in your entire site. Note that without the '$', this would disallow all files containing '.gif' in their file path
  • Allow all files ending in '?' to be included. This would not automatically allow files that just contain '?' somewhere in the URL string
As you can see, this symbol only makes sense at the end of the string. Hence, when we see it, we assume that your directive terminates there and any characters after that symbol are ignored.
Oh, by the way, if you thought we didn't support the 'Allow' tag, as you can see from these examples, we do.
If you have any questions about the new syntax or any particular cases you are concerned about, please write in at the Site Explorer forums or read up our area.
Next time you see me at SES, you should ask me what else is on my list!

Priyank Garg
Product Manager, Yahoo! Search
[Source....]
Thought this was some great info about Yahoo
__________________
Brandon Sheley / vBulletinSetup Staff
Check the
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
&
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
for the latest deals.
Looking for a place to
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
?

To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
and other
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
.

To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
/
Read the->
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.

Are you on Twitter?
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
I'm offering a few
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
& here is my
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
experiment
Brandon Sheley is offline   Reply With Quote

Advertisement [Remove Advertisement]

Reply 
vBulletin Setup > vBulletinSetup Information > vBulletin SEO Tips and SEO Questions > Search Engine News

Tags
crawler, robotstxt, search, slurp, supporting, wildcards, yahoo

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
Yahoo Search Submit? Greek76 vBulletin SEO Tips and SEO Questions 5 08-11-2008 06:23 AM
Search Results Get Richer With Yahoo Open Search Brandon Sheley Search Engine News 0 02-26-2008 08:33 AM
Google, Yahoo, the X-Robots directive and your website rankings Brandon Sheley Search Engine News 0 01-12-2008 06:39 PM
Yahoo! gives away desktop search for enterprises Brandon Sheley Search Engine News 0 12-18-2006 09:59 PM
Yahoo Partners With Go2 For Mobile Search Ads Brandon Sheley Search Engine News 0 08-16-2006 08:15 PM


All times are GMT -8. The time now is 03:14 AM.