Improve RegionSplitter Performance ---------------------------------- Key: HBASE-3610 URL: https://issues.apache.org/jira/browse/HBASE-3610 Project: HBase Issue Type: Improvement Reporter: Nicolas Spiegelberg Assignee: Nicolas Spiegelberg Priority: Trivial
When running RegionSplitter on a 100-node cluster with 900 regions (and plenty of data), the utility took around 72 hours to run. Analysis revealed two major bottlenecks: 1. We are serialized on the logical split (i.e. waiting for the split request to be registered). Parallelizing this step will align configured and actual outstanding splits. 2. Outstanding splits are modeled like a queue. Changing this to a list with a scanner will allow handling splits that finish out of order. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira