Go Back   vBulletin Setup > vBulletinSetup Information > vBulletin SEO Tips and SEO Questions > Search Engine News

Reply 
 
LinkBack Thread Tools Display Modes
Old 10-05-2006, 08:35 AM   #1
Community Manager
Supporters
vBulletin Owner
vBSetup Mods
 
Brandon Sheley's Avatar
 
Join Date: Jul 2006
Location: Topeka, KS
Posts: 14,080
Blog Entries: 35
Brandon Sheley is a splendid one to beholdBrandon Sheley is a splendid one to beholdBrandon Sheley is a splendid one to beholdBrandon Sheley is a splendid one to beholdBrandon Sheley is a splendid one to beholdBrandon Sheley is a splendid one to behold
Send a message via AIM to Brandon Sheley Send a message via MSN to Brandon Sheley Send a message via Yahoo to Brandon Sheley
Talking Google Launches Code Search

Now Google is embarking on, perhaps, its most ambitious indexing venture yet: indexing countless billions of lines of code as part of the new Google Code Search.

Google Code Search on Google Labs gives users a place to search for publicly accessible source code. It looks like a regular Google Search page but instead of searching Web pages, it's going to search billions of lines of code.

"The two ways that source code lives on the Internet is in archives, things like Zip files, gzip, etc. And then in software-control repositories like SourceForge.net, Google's code hosting, and other places," Google product manager Tom Stocky told internetnews.com.
"We'll be crawling all of that."

Google isn't just going to index the Zip archive files. They're actually going to open up the files and index all the individual files within in.

In the case of software-control repositories like CVS and SVN, Google will go into the public access and index the individual files within them.

Google's regular Googlebot crawler is being used to find and identify the Zip files. In the case of software-control repositories, Stocky noted that it's a different kind of crawler that has to access the CVS or SVN server and speak in a different protocol to then get the information back.
The total task is staggering.

Stocky was unable to provide a figure, but he did note the Google Code Search has billions of lines of code.

"We're not getting more specific than that, but it is a significant number," Stocky noted.

Google Code Search will offer users a number of different ways to find the code they are looking for. Users can perform search queries based on software license, programming language and by file name.

"We also support regular expressions, so instead of searching for keywords you can search for patterns of words," Stocky explained. "For people that know how to use regular expressions well you can get really specific search and search over some really obscure stuff."

Google is also launching an API for Code Search as part of the launch. The API will utilize Google's GDATA API format.

At launch there will be no Google AdSense ads on the results pages and the Code Search results are not integrated into the main Google index.

One of the possible uses of Google Code Search is for developers to do searches for their own code and see where people are using it. It may also help to combat plagiarism and software license use infractions.

"If you own code and someone else is posting illegally, there is a process where we can remove it from the index," Stocky noted.

Most of the code indexed by Google Code Search is open source-licensed. Stocky noted that Google doesn't believe that much, if any, is proprietary since it's all posted in public places.

"In the case of CVS and SVN there is a password capability so we believe if someone didn't want it to be seen by the outside world they would either have a password or not post it to a public place," Stocky commented.
Google Code Search is available here or via the advanced search option from Google.com
__________________
Brandon Sheley / vBulletinSetup Staff
Check the
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
&
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
for the latest deals.
Looking for a place to
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
?

To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
and other
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
.

To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
/
Read the->
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.

Are you on Twitter?
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
I'm offering a few
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
& here is my
To view links or images in signatures your post count must be 10 or greater. You currently have 0 posts.
experiment
Brandon Sheley is offline   Reply With Quote

Advertisement [Remove Advertisement]

Reply 
vBulletin Setup > vBulletinSetup Information > vBulletin SEO Tips and SEO Questions > Search Engine News

Tags
code, google, launches, search

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
Twitter launches business guide, search widget Brandon Sheley Community Forum Management 3 07-24-2009 02:36 PM
Installing Google AdSense code. Lampwick Make Money with vBulletin 11 03-22-2008 10:18 AM
Microsoft launches Windows Live search engine Brandon Sheley Search Engine News 0 09-12-2006 12:04 PM
Yahoo! launches 'social search' in Britain with multimillion-pound ad campaign Brandon Sheley Search Engine News 0 09-03-2006 11:14 PM
Google Launches Project Hosting Brandon Sheley Search Engine News 1 08-09-2006 07:05 PM


All times are GMT -8. The time now is 10:32 AM.