Recently there was a number of questions on this list about Hadoop in general (maturity, scale, benchmarks, etc) and Yahoo involvement in particular.
I combined some useful links below, please feel free to add: Presentations and Articles about Hadoop http://wiki.apache.org/lucene-hadoop/HadoopPresentations http://wiki.apache.org/lucene-hadoop/HadoopArticles Applications and organizations using Hadoop http://wiki.apache.org/lucene-hadoop/PoweredBy Hadoop and Distributed Computing at Yahoo! http://developer.yahoo.com/blogs/hadoop/ http://developer.yahoo.net/blog/archives/2007/07/yahoo-hadoop.html http://biz.yahoo.com/bw/071112/20071112005373.html Eric Baldeschwieler about Yahoo's involvement in Hadoop. http://developer.yahoo.com/blogs/hadoop/2007/11/hadoop-blog-welcome.html Sort benchmark on 2000 nodes http://www.nabble.com/Sort-benchmark-on-2000-nodes-td12494697.html Thanks, Konstantin