[ https://issues.apache.org/jira/browse/HBASE-4463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13113847#comment-13113847 ]
Karthik Ranganathan commented on HBASE-4463: -------------------------------------------- @Stack - we can find the exact amount of data we are writing to the dfs (only hfile blocks will contribute to this during compactions). So adding a threshold like this is not too hard... but there could be disk iops pressure (instead of network bandwidth) and detecting that would be hard. So we would still need to set off-peak time. I was trying to come up with a more generic solution but that involves setting up a feedback loop inside the regionserver - keep track of max, min and average latencies over the last k days (would have to store this in META or some other location as it needs to persist beyond restarts). Need to remove any spikes in the values. When we run an aggressive compaction, we need to make sure the latencies are still acceptable, otherwise dont run aggressive compactions. This is much harder to get right though. > Run more aggressive compactions during off peak hours > ----------------------------------------------------- > > Key: HBASE-4463 > URL: https://issues.apache.org/jira/browse/HBASE-4463 > Project: HBase > Issue Type: Improvement > Components: regionserver > Reporter: Karthik Ranganathan > Assignee: Karthik Ranganathan > > The number of iops on the disk and the top of the rack bandwidth utilization > at off peak hours is much lower than at peak hours depending on the > application usage pattern. We can utilize this knowledge to improve the > performance of the HBase cluster by increasing the compact selection ratio to > a much larger value during off-peak hours than otherwise - increasing > hbase.hstore.compaction.ratio (1.2 default) to > hbase.hstore.compaction.ratio.offpeak (5 default). This will help reduce the > average number of files per store. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira