[ https://issues.apache.org/jira/browse/HBASE-19722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16491066#comment-16491066 ]
Andrew Purtell commented on HBASE-19722: ---------------------------------------- bq. Lossy counting algorithm designed the sweeping happens every "1 / errorRate" items arrived. For example, if e is 0.02, sweep() method will be called every 50 times. It's still done inline with the metrics update. How expensive is the sweep? Was thinking if done from a chore in another thread the average work per update would be less, but I suppose there would be locking while the sweep happens, so never mind. Thanks. 350 might be more than an operator would want. We don't have a precise definition for what would be a manageable number of counters because it would be a matter of opinion. 100 would be fine IMHO, 1000 not so much. Just some numbers I made up right now thinking about scraping the JMX directly. If an operator would query this information from an external metrics DB, they could sort and cut the list with the DB's query tools, so it doesn't really matter for those poeple. Need to consider others who might process the raw JMX data, though. > Implement a meta query statistics metrics source > ------------------------------------------------ > > Key: HBASE-19722 > URL: https://issues.apache.org/jira/browse/HBASE-19722 > Project: HBase > Issue Type: Sub-task > Reporter: Andrew Purtell > Assignee: Xu Cang > Priority: Major > Attachments: HBASE-19722.branch-1.v001.patch, > HBASE-19722.master.010.patch, HBASE-19722.master.011.patch, > HBASE-19722.master.012.patch, HBASE-19722.master.013.patch > > > Implement a meta query statistics metrics source, created whenever a > regionserver starts hosting meta, removed when meta hosting moves. Provide > views on top tables by request counts, top meta rowkeys by request count, top > clients making requests by their hostname. > Can be implemented as a coprocessor. -- This message was sent by Atlassian JIRA (v7.6.3#76005)