Re: PerformanceEvaluation results

2012-06-08 Thread Oliver Meyn (GBIF)
, lars hofhansl lhofha...@yahoo.com wrote: We should like to this from the reference guide. - Original Message - From: Stack st...@duboce.net To: user@hbase.apache.org Cc: Sent: Tuesday, March 20, 2012 9:17 AM Subject: Re: PerformanceEvaluation results On Tue, Mar 20, 2012 at 8

Re: PerformanceEvaluation results

2012-03-20 Thread Oliver Meyn (GBIF)
Apologies for responding to myself, but after some more testing I've concluded that we had a minor network bottleneck that was partially masking the real problem: not enough disks. Deductions based on ganglia metrics in a follow-up blog post:

Re: PerformanceEvaluation results

2012-03-20 Thread Stack
On Tue, Mar 20, 2012 at 8:53 AM, Oliver Meyn (GBIF) om...@gbif.org wrote: Apologies for responding to myself, but after some more testing I've concluded that we had a minor network bottleneck that was partially masking the real problem: not enough disks.  Deductions based on ganglia metrics

Re: PerformanceEvaluation results

2012-03-20 Thread lars hofhansl
We should like to this from the reference guide. - Original Message - From: Stack st...@duboce.net To: user@hbase.apache.org Cc: Sent: Tuesday, March 20, 2012 9:17 AM Subject: Re: PerformanceEvaluation results On Tue, Mar 20, 2012 at 8:53 AM, Oliver Meyn (GBIF) om...@gbif.org wrote

Re: PerformanceEvaluation results

2012-03-20 Thread Stack
On Tue, Mar 20, 2012 at 9:55 AM, lars hofhansl lhofha...@yahoo.com wrote: We should like to this from the reference guide. It would work well as a second case study to follow the explorsys one: http://hbase.apache.org/book.html#trouble.casestudy St.Ack

PerformanceEvaluation results

2012-02-28 Thread Oliver Meyn (GBIF)
Hi all, I've spent the last couple of weeks working with PerformanceEvaluation, trying to understand scan performance in our little cluster. I've written a blog post with the results and would really welcome any input you may have.

Re: PerformanceEvaluation results

2012-02-08 Thread Tim Robertson
Hey Stack, Because we run a couple clusters now, we're using templating for the *.site.xml etc. You'll find them in: http://code.google.com/p/gbif-common-resources/source/browse/cluster-puppet/modules/hadoop/templates/ The values for the HBase 3 node cluster come from:

Re: PerformanceEvaluation results

2012-02-07 Thread Lars Francke
Hi Stack, Hi everyone, I do feel the HBase project would benefit from some example metrics for various operations and hardware or else it will remain a difficult technology for some people to get into with confidence.  We'll blog our findings, and hopefully it might be of benefit to other

Re: PerformanceEvaluation results

2012-02-07 Thread Stack
On Tue, Feb 7, 2012 at 3:27 AM, Lars Francke lars.fran...@gmail.com wrote: [1] http://code.google.com/p/gbif-common-resources/source/browse/#svn%2Fcluster-puppet I don't see your hbase-site.xml up here Lars. Am I looking in the wrong place? Good on you, St.Ack

Re: PerformanceEvaluation results

2012-02-02 Thread Tim Robertson
Thanks all for the comments. Ganglia set up is in progress. We'll keep plugging away. I should mention that this is our first real dev cluster for evaluation, and production would likely be more like a 6-7+ node cluster of better machines, but for sure we are the small fry leprechauns Ted

Re: PerformanceEvaluation results

2012-02-02 Thread Stack
On Thu, Feb 2, 2012 at 8:00 AM, Tim Robertson timrobertson...@gmail.com wrote: I do feel the HBase project would benefit from some example metrics for various operations and hardware or else it will remain a difficult technology for some people to get into with confidence.  We'll blog our

PerformanceEvaluation results

2012-02-01 Thread Tim Robertson
Hi all, We have a 3 node cluster (CD3u2) with the following hardware: RegionServers (+DN + TT) CPU: 2x Intel(R) Xeon(R) CPU E5630 @ 2.53GHz (quad) Disks: 6x250G SATA 5.4K Memory: 24GB Master (+ZK, JT, NN) CPU: Intel(R) Xeon(R) CPU X3363 @ 2.83GHz, 2x6MB (quad) Disks: 2x500G SATA 7.2K

Re: PerformanceEvaluation results

2012-02-01 Thread Michael Segel
No. What tuning did you do? Why such a small cluster? Sorry, but when you start off with a bad hardware configuration, you can get Hadoop/HBase to work, but performance will always be sub-optimal. Sent from my iPhone On Feb 1, 2012, at 6:52 AM, Tim Robertson timrobertson...@gmail.com wrote:

Re: PerformanceEvaluation results

2012-02-01 Thread Tim Robertson
Thanks Michael, It's a small cluster, but is the hardware so bad? We are particularly interested in relatively low load for random read write (2000 transactions per second on 1k rows) but a decent full table scan speed, as we aim to mount Hive tables on HBase backed tables. Regarding tuning...

Re: PerformanceEvaluation results

2012-02-01 Thread Doug Meil
Hi there- These perf-tests on small clusters are fairly common questions on the dist-list, but it needs to be stressed that Hbase (and HDFS) doesn't begin to stretch it's legs until about 5 nodes. http://hbase.apache.org/book.html#arch.overview On 2/1/12 7:51 AM, Tim Robertson

Re: PerformanceEvaluation results

2012-02-01 Thread Stack
On Wed, Feb 1, 2012 at 4:51 AM, Tim Robertson timrobertson...@gmail.com wrote: We're setting up ganglia now to start debugging, but any suggestions on how to diagnose this would be greatly appreciated. Get Ganglia set up Tim and then lets chat. You've checked out the perf section in the

Re: PerformanceEvaluation results

2012-02-01 Thread Michel Segel
Tim, Here's the problem in a nutshell, With respect to hardware, you have 5.4k rpms ? 6 drive and 8 cores? Small slow drives, and still a ratio less than one when you compare drives to spindles. I appreciate that you want to maximize performance, but when it comes to tuning, you have to