[ 
https://issues.apache.org/jira/browse/HBASE-2888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893851#action_12893851
 ] 

ryan rawson commented on HBASE-2888:
------------------------------------

here are some of the things i've identified as issues:

- HFile states, eg: fsReadLatency, is in milliseconds, and it should really be 
in microseconds.
- we should generate 99th and 95th percentile for many of the stats (eg: 
fsReadLatency) and publish it.  Perhaps a 1 and/or 5 minute 99th rolling 
percentile.
- The HFile metrics integration is a little weak, we use some volatiles and 
scrape them, for the enhanced 99th/95th pc stats we'll need access to the 
richer stats classes.  HFile depends on Hadoop and hbase.util so with a little 
moving of things around, hopefully it'll be possible to actually make better 
stats w/o having HFile depends on HRS (for example)

> Review all our metrics
> ----------------------
>
>                 Key: HBASE-2888
>                 URL: https://issues.apache.org/jira/browse/HBASE-2888
>             Project: HBase
>          Issue Type: Improvement
>          Components: master
>            Reporter: Jean-Daniel Cryans
>             Fix For: 0.90.0
>
>
> HBase publishes a bunch of metrics, some useful some wasteful, that should be 
> improved to deliver a better ops experience. Examples:
>  - Block cache hit ratio converges at some point and stops moving
>  - fsReadLatency goes down when compactions are running
>  - storefileIndexSizeMB is the exact same number once a system is serving 
> production load
> We could use new metrics too.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to