I am on EC2, with HBase 0.94.2 and I can't find an explanation to this
phenomena:
I'm having these metrics on my RegionServers:

blockCacheHitRatio=87%
fsReadLatencyHistogramMean=  5886103.47
fsReadLatencyHistogramMedian=6280445
fsReadLatencyHistogram75th= 28117916.5
fsReadLatencyHistogram95th= 53674180.05

Now, I am wondering, if 87% of my reads are cached, what could be the
reason for 75% of the reads falling on 28.1ms - that is, I expect that the
75th percentile reads would be from cache - and with cache performance, not
to mention the 95th percentile. As far as I am concerned, the mean and
median are great for our use case but the tails are too wide and thick.

Secondly, I've noticed that when we're running mapreduce jobs, the number
of requestsPerSecond and all fs* and *requestsCount are peaking - afaik the
mapper (running cascading-maple) just reads directly from HDFS and should
not have any influence on the RPC's (for example) to the RegionServer and
the associated metrics, what is going on with the RS metrics ?

Thanks
Amit

Reply via email to