< Back to Blog

Yahoo! Uses Distributed Computing to Speed up Search Index Processing
Tue, 26 Feb 2008 10:18:56 by Kerry Dye

After wading through the official blogs http://www.ysearchblog.com/archives/000521.html on this subject, which mainly deal with the fact it is the world's largest Hadoop installations, I've got a few interesting facts from the SEO point of view.

  • The Webmap (which is the database that feeds their algorithm) now generates in a third less time than it did before
  • It keeps track of roughly 1 trillion links
  • It uses 10,000 Linux cores (which doesn't mean 10,000 computers or even 10,000 processors, as processors are multi-core, but I guess it makes a nice round number)

So hopefully that's some information that might actually be of use when talking to someone non-technical or if you have a need to discuss it with a client.



Kerry Dye
Campaign Delivery Manager


Subscribe

Archives

Related Blogs
Search Engine Optimisation, Wikipedia, and wasted effort
Fri, 26 Jun 2009 16:40:17 by Joe Bursell
Redesigned Google Webmaster Tools available to more users
Wed, 17 Jun 2009 10:32:31 by Emily Mace
The future of SEO has BO
Wed, 17 Jun 2009 10:20:12 by Matt Hopkins
UKs most dangerous Search Terms
Wed, 10 Jun 2009 10:09:00 by Pete Handley
Bing an update on Microsoft's new search engine
Wed, 10 Jun 2009 09:23:16 by Emily Mace
SEO Speak: Teleporting
Tue, 9 Jun 2009 17:45:16 by Emily Mace
Search Engine Optimisation and Sandboxing
Tue, 9 Jun 2009 16:41:11 by Joe Bursell