Improve RegionSplitter Performance
----------------------------------

                 Key: HBASE-3610
                 URL: https://issues.apache.org/jira/browse/HBASE-3610
             Project: HBase
          Issue Type: Improvement
            Reporter: Nicolas Spiegelberg
            Assignee: Nicolas Spiegelberg
            Priority: Trivial


When running RegionSplitter on a 100-node cluster with 900 regions (and plenty 
of data), the utility took around 72 hours to run.  Analysis revealed two major 
bottlenecks:

1. We are serialized on the logical split (i.e. waiting for the split request 
to be registered).  Parallelizing this step will align configured and actual 
outstanding splits.
2. Outstanding splits are modeled like a queue.  Changing this to a list with a 
scanner will allow handling splits that finish out of order.


--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to