Apologies for responding to myself, but after some more testing I've concluded that we had a minor network bottleneck that was partially masking the real problem: not enough disks. Deductions based on ganglia metrics in a follow-up blog post:
http://gbif.blogspot.com/2012/03/hbase-performance-evaluation-continued.html Cheers, Oliver On 2012-02-28, at 5:10 PM, Oliver Meyn (GBIF) wrote: > Hi all, > > I've spent the last couple of weeks working with PerformanceEvaluation, > trying to understand scan performance in our little cluster. I've written a > blog post with the results and would really welcome any input you may have. > > http://gbif.blogspot.com/2012/02/performance-evaluation-of-hbase.html > > Cheers, > Oliver -- Oliver Meyn Software Developer Global Biodiversity Information Facility (GBIF) +45 35 32 15 12 http://www.gbif.org