[ https://issues.apache.org/jira/browse/HBASE-13959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14602146#comment-14602146 ]
Hadoop QA commented on HBASE-13959: ----------------------------------- {color:red}-1 overall{color}. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12741944/13959-suggest.txt against master branch at commit e6ed79219966ce0dac3bc748261fce9478aa7550. ATTACHMENT ID: 12741944 {color:green}+1 @author{color}. The patch does not contain any @author tags. {color:red}-1 tests included{color}. The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch. {color:green}+1 hadoop versions{color}. The patch compiles with all supported hadoop versions (2.4.0 2.4.1 2.5.0 2.5.1 2.5.2 2.6.0 2.7.0) {color:green}+1 javac{color}. The applied patch does not increase the total number of javac compiler warnings. {color:green}+1 protoc{color}. The applied patch does not increase the total number of protoc compiler warnings. {color:green}+1 javadoc{color}. The javadoc tool did not generate any warning messages. {color:green}+1 checkstyle{color}. The applied patch does not increase the total number of checkstyle errors {color:green}+1 findbugs{color}. The patch does not introduce any new Findbugs (version 2.0.3) warnings. {color:green}+1 release audit{color}. The applied patch does not increase the total number of release audit warnings. {color:green}+1 lineLengths{color}. The patch does not introduce lines longer than 100 {color:green}+1 site{color}. The mvn post-site goal succeeds with this patch. {color:red}-1 core tests{color}. The patch failed these unit tests: org.apache.hadoop.hbase.TestRegionRebalancing Test results: https://builds.apache.org/job/PreCommit-HBASE-Build/14578//testReport/ Release Findbugs (version 2.0.3) warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/14578//artifact/patchprocess/newFindbugsWarnings.html Checkstyle Errors: https://builds.apache.org/job/PreCommit-HBASE-Build/14578//artifact/patchprocess/checkstyle-aggregate.html Console output: https://builds.apache.org/job/PreCommit-HBASE-Build/14578//console This message is automatically generated. > Region splitting takes too long because it uses a single thread in most > common cases > ------------------------------------------------------------------------------------ > > Key: HBASE-13959 > URL: https://issues.apache.org/jira/browse/HBASE-13959 > Project: HBase > Issue Type: Bug > Components: regionserver > Affects Versions: 0.98.12 > Reporter: Hari Krishna Dara > Assignee: Hari Krishna Dara > Priority: Critical > Fix For: 0.98.14 > > Attachments: 13959-suggest.txt, HBASE-13959-2.patch, > HBASE-13959-3.patch, HBASE-13959-4.patch, HBASE-13959.patch, > region-split-durations-compared.png > > > When storefiles need to be split as part of a region split, the current logic > uses a threadpool with the size set to the size of the number of stores. > Since most common table setup involves only a single column family, this > translates to having a single store and so the threadpool is run with a > single thread. However, in a write heavy workload, there could be several > tens of storefiles in a store at the time of splitting, and with a threadpool > size of one, these files end up getting split sequentially. > With a bit of tracing, I noticed that it takes on an average of 350ms to > create a single reference file, and splitting each storefile involves > creating two of these, so with a storefile count of 20, it takes about 14s > just to get through this phase alone (2 reference files for each storefile), > pushing the total time the region is offline to 18s or more. For environments > that are setup to fail fast, this makes the client exhaust all retries and > fail with NotServingRegionException. > The fix should increase the concurrency of this operation. -- This message was sent by Atlassian JIRA (v6.3.4#6332)