Yahoo! Uses Distributed Computing to Speed up Search Index Processing
26th February 2008 by Kerry Dye
After wading through the official blogs http://www.ysearchblog.com/archives/000521.html
on this subject, which mainly deal with the fact it is the world’s largest Hadoop installations, I’ve got a few interesting facts from the SEO point of
view.
- The Webmap (which is the database that feeds their algorithm) now generates in a third less time
than it did before - It keeps track of roughly 1 trillion links
- It uses 10,000 Linux cores (which doesn’t mean 10,000 computers or even 10,000 processors, as processors are multi-core, but I guess it makes a nice round number)
So hopefully that’s some information that might actually be of use when talking to someone non-technical or if you have a need to discuss it with a client.
Related Posts
- What is the Supplemental Index?
- Yahoo Search Marketing Update for 2010
- Google announce that site speed may become a ranking factor
- Yahoo Opens Search Results
- More on the Supplemental Index
- How to get out of the Supplemental Index
- Yahoo Sponsored Local Search
- Problems for Yahoo! Google drop ad sharing deal and Yahoo ask Microsoft to come and buy them!