[ https://issues.apache.org/jira/browse/HBASE-20769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16523157#comment-16523157 ]
Hadoop QA commented on HBASE-20769: ----------------------------------- | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 0m 14s{color} | {color:blue} Docker mode activated. {color} | || || || || {color:brown} Prechecks {color} || | {color:green}+1{color} | {color:green} hbaseanti {color} | {color:green} 0m 0s{color} | {color:green} Patch does not have any anti-patterns. {color} | | {color:green}+1{color} | {color:green} @author {color} | {color:green} 0m 0s{color} | {color:green} The patch does not contain any @author tags. {color} | | {color:green}+1{color} | {color:green} test4tests {color} | {color:green} 0m 0s{color} | {color:green} The patch appears to include 1 new or modified test files. {color} | || || || || {color:brown} master Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 4m 57s{color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 0m 32s{color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 19s{color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} shadedjars {color} | {color:green} 4m 27s{color} | {color:green} branch has no errors when building our shaded downstream artifacts. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 0m 43s{color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 17s{color} | {color:green} master passed {color} | || || || || {color:brown} Patch Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 5m 12s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 0m 31s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javac {color} | {color:green} 0m 31s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 17s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} whitespace {color} | {color:green} 0m 0s{color} | {color:green} The patch has no whitespace issues. {color} | | {color:green}+1{color} | {color:green} shadedjars {color} | {color:green} 4m 27s{color} | {color:green} patch has no errors when building our shaded downstream artifacts. {color} | | {color:green}+1{color} | {color:green} hadoopcheck {color} | {color:green} 9m 52s{color} | {color:green} Patch does not cause any errors with Hadoop 2.7.4 or 3.0.0. {color} | | {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 0m 42s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 14s{color} | {color:green} the patch passed {color} | || || || || {color:brown} Other Tests {color} || | {color:green}+1{color} | {color:green} unit {color} | {color:green} 12m 22s{color} | {color:green} hbase-mapreduce in the patch passed. {color} | | {color:green}+1{color} | {color:green} asflicense {color} | {color:green} 0m 11s{color} | {color:green} The patch does not generate ASF License warnings. {color} | | {color:black}{color} | {color:black} {color} | {color:black} 45m 40s{color} | {color:black} {color} | \\ \\ || Subsystem || Report/Notes || | Docker | Client=17.05.0-ce Server=17.05.0-ce Image:yetus/hbase:b002b0b | | JIRA Issue | HBASE-20769 | | JIRA Patch URL | https://issues.apache.org/jira/secure/attachment/12929125/HBASE-20769.master.004.patch | | Optional Tests | asflicense javac javadoc unit findbugs shadedjars hadoopcheck hbaseanti checkstyle compile | | uname | Linux a030d521957a 3.13.0-139-generic #188-Ubuntu SMP Tue Jan 9 14:43:09 UTC 2018 x86_64 GNU/Linux | | Build tool | maven | | Personality | /home/jenkins/jenkins-slave/workspace/PreCommit-HBASE-Build/component/dev-support/hbase-personality.sh | | git revision | master / 4ba6242a62 | | maven | version: Apache Maven 3.5.4 (1edded0938998edf8bf061f1ceb3cfdeccf443fe; 2018-06-17T18:33:14Z) | | Default Java | 1.8.0_171 | | findbugs | v3.1.0-RC3 | | Test Results | https://builds.apache.org/job/PreCommit-HBASE-Build/13387/testReport/ | | Max. process+thread count | 3879 (vs. ulimit of 10000) | | modules | C: hbase-mapreduce U: hbase-mapreduce | | Console output | https://builds.apache.org/job/PreCommit-HBASE-Build/13387/console | | Powered by | Apache Yetus 0.7.0 http://yetus.apache.org | This message was automatically generated. > getSplits() has a out of bounds problem in TableSnapshotInputFormatImpl > ----------------------------------------------------------------------- > > Key: HBASE-20769 > URL: https://issues.apache.org/jira/browse/HBASE-20769 > Project: HBase > Issue Type: Bug > Affects Versions: 1.3.0, 1.4.0, 2.0.0 > Reporter: Jingyun Tian > Assignee: Jingyun Tian > Priority: Major > Fix For: 2.0.0 > > Attachments: HBASE-20769.master.001.patch, > HBASE-20769.master.002.patch, HBASE-20769.master.003.patch, > HBASE-20769.master.004.patch > > > When numSplits > 1, getSplits may create split that has start row smaller > than user specified scan's start row or stop row larger than user specified > scan's stop row. > {code} > byte[][] sp = sa.split(hri.getStartKey(), hri.getEndKey(), numSplits, > true); > for (int i = 0; i < sp.length - 1; i++) { > if (PrivateCellUtil.overlappingKeys(scan.getStartRow(), > scan.getStopRow(), sp[i], > sp[i + 1])) { > List<String> hosts = > calculateLocationsForInputSplit(conf, htd, hri, tableDir, > localityEnabled); > Scan boundedScan = new Scan(scan); > boundedScan.setStartRow(sp[i]); > boundedScan.setStopRow(sp[i + 1]); > splits.add(new InputSplit(htd, hri, hosts, boundedScan, > restoreDir)); > } > } > {code} > Since we split keys by the range of regions, when sp[i] < scan.getStartRow() > or sp[i + 1] > scan.getStopRow(), the created bounded scan may contain range > that over user defined scan. > fix should be simple: > {code} > boundedScan.setStartRow( > Bytes.compareTo(scan.getStartRow(), sp[i]) > 0 ? scan.getStartRow() : sp[i]); > boundedScan.setStopRow( > Bytes.compareTo(scan.getStopRow(), sp[i + 1]) < 0 ? scan.getStopRow() : sp[i > + 1]); > {code} > I will also try to add UTs to help discover this problem -- This message was sent by Atlassian JIRA (v7.6.3#76005)