[jira] [Commented] (HIVE-16855) org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements
[ https://issues.apache.org/jira/browse/HIVE-16855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16395591#comment-16395591 ] BELUGA BEHR commented on HIVE-16855: Errors appear to be unrelated. > org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements > -- > > Key: HIVE-16855 > URL: https://issues.apache.org/jira/browse/HIVE-16855 > Project: Hive > Issue Type: Improvement >Affects Versions: 2.1.1, 3.0.0 >Reporter: BELUGA BEHR >Assignee: BELUGA BEHR >Priority: Minor > Attachments: HIVE-16855.1.patch, HIVE-16855.2.patch, > HIVE-16855.3.patch > > > # Improve (Simplify) Logging > # Remove custom buffer size for {{BufferedInputStream}} and instead rely on > JVM default which is often larger these days (8192) > # Simplify looping logic -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HIVE-16855) org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements
[ https://issues.apache.org/jira/browse/HIVE-16855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16392325#comment-16392325 ] Hive QA commented on HIVE-16855: Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12913655/HIVE-16855.3.patch {color:red}ERROR:{color} -1 due to no test(s) being added or modified. {color:red}ERROR:{color} -1 due to 103 failed/errored test(s), 13348 tests executed *Failed tests:* {noformat} TestJdbcWithMiniHA - did not produce a TEST-*.xml file (likely timed out) (batchId=239) TestNegativeCliDriver - did not produce a TEST-*.xml file (likely timed out) (batchId=94)
[jira] [Commented] (HIVE-16855) org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements
[ https://issues.apache.org/jira/browse/HIVE-16855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16392263#comment-16392263 ] Hive QA commented on HIVE-16855: | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || || || || || {color:brown} Prechecks {color} || | {color:blue}0{color} | {color:blue} findbugs {color} | {color:blue} 0m 0s{color} | {color:blue} Findbugs executables are not available. {color} | | {color:green}+1{color} | {color:green} @author {color} | {color:green} 0m 0s{color} | {color:green} The patch does not contain any @author tags. {color} | || || || || {color:brown} master Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 7m 9s{color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 1m 0s{color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 36s{color} | {color:green} master passed {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 53s{color} | {color:green} master passed {color} | || || || || {color:brown} Patch Compile Tests {color} || | {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 1m 16s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} compile {color} | {color:green} 1m 1s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} javac {color} | {color:green} 1m 1s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 0m 40s{color} | {color:green} the patch passed {color} | | {color:green}+1{color} | {color:green} whitespace {color} | {color:green} 0m 0s{color} | {color:green} The patch has no whitespace issues. {color} | | {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 51s{color} | {color:green} the patch passed {color} | || || || || {color:brown} Other Tests {color} || | {color:red}-1{color} | {color:red} asflicense {color} | {color:red} 0m 13s{color} | {color:red} The patch generated 49 ASF License warnings. {color} | | {color:black}{color} | {color:black} {color} | {color:black} 13m 52s{color} | {color:black} {color} | \\ \\ || Subsystem || Report/Notes || | Optional Tests | asflicense javac javadoc findbugs checkstyle compile | | uname | Linux hiveptest-server-upstream 3.16.0-4-amd64 #1 SMP Debian 3.16.36-1+deb8u1 (2016-09-03) x86_64 GNU/Linux | | Build tool | maven | | Personality | /data/hiveptest/working/yetus_PreCommit-HIVE-Build-9561/dev-support/hive-personality.sh | | git revision | master / 73ccc44 | | Default Java | 1.8.0_111 | | asflicense | http://104.198.109.242/logs//PreCommit-HIVE-Build-9561/yetus/patch-asflicense-problems.txt | | modules | C: ql U: ql | | Console output | http://104.198.109.242/logs//PreCommit-HIVE-Build-9561/yetus.txt | | Powered by | Apache Yetushttp://yetus.apache.org | This message was automatically generated. > org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements > -- > > Key: HIVE-16855 > URL: https://issues.apache.org/jira/browse/HIVE-16855 > Project: Hive > Issue Type: Improvement >Affects Versions: 2.1.1, 3.0.0 >Reporter: BELUGA BEHR >Assignee: BELUGA BEHR >Priority: Minor > Attachments: HIVE-16855.1.patch, HIVE-16855.2.patch, > HIVE-16855.3.patch > > > # Improve (Simplify) Logging > # Remove custom buffer size for {{BufferedInputStream}} and instead rely on > JVM default which is often larger these days (8192) > # Simplify looping logic -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HIVE-16855) org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements
[ https://issues.apache.org/jira/browse/HIVE-16855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16392014#comment-16392014 ] Sahil Takiar commented on HIVE-16855: - +1 pending Hive QA > org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements > -- > > Key: HIVE-16855 > URL: https://issues.apache.org/jira/browse/HIVE-16855 > Project: Hive > Issue Type: Improvement >Affects Versions: 2.1.1, 3.0.0 >Reporter: BELUGA BEHR >Assignee: BELUGA BEHR >Priority: Minor > Attachments: HIVE-16855.1.patch, HIVE-16855.2.patch, > HIVE-16855.3.patch > > > # Improve (Simplify) Logging > # Remove custom buffer size for {{BufferedInputStream}} and instead rely on > JVM default which is often larger these days (8192) > # Simplify looping logic -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HIVE-16855) org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements
[ https://issues.apache.org/jira/browse/HIVE-16855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16391908#comment-16391908 ] BELUGA BEHR commented on HIVE-16855: [~stakiar] Posted new patch with the {Arrays#fill}} added back in. Thanks! > org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements > -- > > Key: HIVE-16855 > URL: https://issues.apache.org/jira/browse/HIVE-16855 > Project: Hive > Issue Type: Improvement >Affects Versions: 2.1.1, 3.0.0 >Reporter: BELUGA BEHR >Assignee: BELUGA BEHR >Priority: Minor > Attachments: HIVE-16855.1.patch, HIVE-16855.2.patch, > HIVE-16855.3.patch > > > # Improve (Simplify) Logging > # Remove custom buffer size for {{BufferedInputStream}} and instead rely on > JVM default which is often larger these days (8192) > # Simplify looping logic -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HIVE-16855) org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements
[ https://issues.apache.org/jira/browse/HIVE-16855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16391766#comment-16391766 ] Sahil Takiar commented on HIVE-16855: - I agree the call to {{Arrays#fill}} is weird, but I don't see a strong reason to remove it; and removing it could have unexpected consequences. Besides that, the rest of the patch LGTM. > org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements > -- > > Key: HIVE-16855 > URL: https://issues.apache.org/jira/browse/HIVE-16855 > Project: Hive > Issue Type: Improvement >Affects Versions: 2.1.1, 3.0.0 >Reporter: BELUGA BEHR >Assignee: BELUGA BEHR >Priority: Minor > Attachments: HIVE-16855.1.patch, HIVE-16855.2.patch > > > # Improve (Simplify) Logging > # Remove custom buffer size for {{BufferedInputStream}} and instead rely on > JVM default which is often larger these days (8192) > # Simplify looping logic -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (HIVE-16855) org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements
[ https://issues.apache.org/jira/browse/HIVE-16855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16320638#comment-16320638 ] BELUGA BEHR commented on HIVE-16855: [~aihuaxu] [~pvary] Please consider this patch for inclusion into the project. Thanks. > org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements > -- > > Key: HIVE-16855 > URL: https://issues.apache.org/jira/browse/HIVE-16855 > Project: Hive > Issue Type: Improvement >Affects Versions: 2.1.1, 3.0.0 >Reporter: BELUGA BEHR >Assignee: BELUGA BEHR >Priority: Minor > Attachments: HIVE-16855.1.patch, HIVE-16855.2.patch > > > # Improve (Simplify) Logging > # Remove custom buffer size for {{BufferedInputStream}} and instead rely on > JVM default which is often larger these days (8192) > # Simplify looping logic -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[jira] [Commented] (HIVE-16855) org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements
[ https://issues.apache.org/jira/browse/HIVE-16855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16317013#comment-16317013 ] BELUGA BEHR commented on HIVE-16855: Also applies to 2.x branch > org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements > -- > > Key: HIVE-16855 > URL: https://issues.apache.org/jira/browse/HIVE-16855 > Project: Hive > Issue Type: Improvement >Affects Versions: 2.1.1, 3.0.0 >Reporter: BELUGA BEHR >Assignee: BELUGA BEHR >Priority: Minor > Attachments: HIVE-16855.1.patch, HIVE-16855.2.patch > > > # Improve (Simplify) Logging > # Remove custom buffer size for {{BufferedInputStream}} and instead rely on > JVM default which is often larger these days (8192) > # Simplify looping logic -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[jira] [Commented] (HIVE-16855) org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements
[ https://issues.apache.org/jira/browse/HIVE-16855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16317007#comment-16317007 ] BELUGA BEHR commented on HIVE-16855: [~szita] :) > org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements > -- > > Key: HIVE-16855 > URL: https://issues.apache.org/jira/browse/HIVE-16855 > Project: Hive > Issue Type: Improvement >Affects Versions: 2.1.1, 3.0.0 >Reporter: BELUGA BEHR >Assignee: BELUGA BEHR >Priority: Minor > Attachments: HIVE-16855.1.patch, HIVE-16855.2.patch > > > # Improve (Simplify) Logging > # Remove custom buffer size for {{BufferedInputStream}} and instead rely on > JVM default which is often larger these days (8192) > # Simplify looping logic -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[jira] [Commented] (HIVE-16855) org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements
[ https://issues.apache.org/jira/browse/HIVE-16855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16250349#comment-16250349 ] Hive QA commented on HIVE-16855: Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12897352/HIVE-16855.2.patch {color:red}ERROR:{color} -1 due to no test(s) being added or modified. {color:red}ERROR:{color} -1 due to 13 failed/errored test(s), 11361 tests executed *Failed tests:* {noformat} org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[avro_partitioned_native] (batchId=6) org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver[dbtxnmgr_showlocks] (batchId=77) org.apache.hadoop.hive.cli.TestMiniLlapCliDriver.testCliDriver[unionDistinct_1] (batchId=146) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[insert_values_orig_table_use_metadata] (batchId=162) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[llap_acid_fast] (batchId=157) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[sysdb] (batchId=156) org.apache.hadoop.hive.cli.TestMiniSparkOnYarnCliDriver.org.apache.hadoop.hive.cli.TestMiniSparkOnYarnCliDriver (batchId=176) org.apache.hadoop.hive.cli.TestMiniTezCliDriver.testCliDriver[explainanalyze_2] (batchId=102) org.apache.hadoop.hive.cli.TestNegativeMinimrCliDriver.testCliDriver[ct_noperm_loc] (batchId=94) org.apache.hadoop.hive.cli.TestSparkCliDriver.testCliDriver[subquery_multi] (batchId=111) org.apache.hadoop.hive.cli.control.TestDanglingQOuts.checkDanglingQOut (batchId=206) org.apache.hadoop.hive.ql.exec.tez.TestWorkloadManager.testApplyPlanQpChanges (batchId=281) org.apache.hadoop.hive.ql.parse.TestReplicationScenarios.testConstraints (batchId=223) {noformat} Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/7792/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/7792/console Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-7792/ Messages: {noformat} Executing org.apache.hive.ptest.execution.TestCheckPhase Executing org.apache.hive.ptest.execution.PrepPhase Executing org.apache.hive.ptest.execution.ExecutionPhase Executing org.apache.hive.ptest.execution.ReportingPhase Tests exited with: TestsFailedException: 13 tests failed {noformat} This message is automatically generated. ATTACHMENT ID: 12897352 - PreCommit-HIVE-Build > org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements > -- > > Key: HIVE-16855 > URL: https://issues.apache.org/jira/browse/HIVE-16855 > Project: Hive > Issue Type: Improvement >Affects Versions: 2.1.1, 3.0.0 >Reporter: BELUGA BEHR >Assignee: BELUGA BEHR >Priority: Minor > Attachments: HIVE-16855.1.patch, HIVE-16855.2.patch > > > # Improve (Simplify) Logging > # Remove custom buffer size for {{BufferedInputStream}} and instead rely on > JVM default which is often larger these days (8192) > # Simplify looping logic -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[jira] [Commented] (HIVE-16855) org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements
[ https://issues.apache.org/jira/browse/HIVE-16855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16246657#comment-16246657 ] BELUGA BEHR commented on HIVE-16855: The only thing I can't figure out here is: {code:java} private void loadDirectly(MapJoinTableContainer[] mapJoinTables, String inputFileName) throws Exception { ... MapJoinTableContainer[] tables = sink.getMapJoinTables(); for (int i = 0; i < sink.getNumParent(); i++) { if (sink.getParentOperators().get(i) != null) { mapJoinTables[i] = tables[i]; } } Arrays.fill(tables, null); {code} Why is the 'tables' array being filled with NULL values? This is poor encapsulation that the sink's internal array is being manipulated outside of the sink code itself. There is no comment why this is being done. It could be that the array is being NULL'ed out to allow for GC, but all of the object references are being copied into 'mapJoinTables', which is an argument to the method, so no GC will be occurring anyway. I've just removed this call. > org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements > -- > > Key: HIVE-16855 > URL: https://issues.apache.org/jira/browse/HIVE-16855 > Project: Hive > Issue Type: Improvement >Affects Versions: 2.1.1, 3.0.0 >Reporter: BELUGA BEHR >Assignee: BELUGA BEHR >Priority: Minor > Attachments: HIVE-16855.1.patch, HIVE-16855.2.patch > > > # Improve (Simplify) Logging > # Remove custom buffer size for {{BufferedInputStream}} and instead rely on > JVM default which is often larger these days (8192) > # Simplify looping logic -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[jira] [Commented] (HIVE-16855) org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements
[ https://issues.apache.org/jira/browse/HIVE-16855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16239656#comment-16239656 ] BELUGA BEHR commented on HIVE-16855: [~aihuaxu] :) > org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements > -- > > Key: HIVE-16855 > URL: https://issues.apache.org/jira/browse/HIVE-16855 > Project: Hive > Issue Type: Improvement >Affects Versions: 2.1.1, 3.0.0 >Reporter: BELUGA BEHR >Assignee: BELUGA BEHR >Priority: Minor > Attachments: HIVE-16855.1.patch > > > # Improve (Simplify) Logging > # Remove custom buffer size for {{BufferedInputStream}} and instead rely on > JVM default which is often larger these days (8192) > # Simplify looping logic -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[jira] [Commented] (HIVE-16855) org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements
[ https://issues.apache.org/jira/browse/HIVE-16855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16217444#comment-16217444 ] BELUGA BEHR commented on HIVE-16855: [~ngangam] Please consider this simple improvement. > org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements > -- > > Key: HIVE-16855 > URL: https://issues.apache.org/jira/browse/HIVE-16855 > Project: Hive > Issue Type: Improvement >Affects Versions: 2.1.1, 3.0.0 >Reporter: BELUGA BEHR >Assignee: BELUGA BEHR >Priority: Minor > Attachments: HIVE-16855.1.patch > > > # Improve (Simplify) Logging > # Remove custom buffer size for {{BufferedInputStream}} and instead rely on > JVM default which is often larger these days (8192) > # Simplify looping logic -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[jira] [Commented] (HIVE-16855) org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements
[ https://issues.apache.org/jira/browse/HIVE-16855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16066462#comment-16066462 ] BELUGA BEHR commented on HIVE-16855: Test errors unrelated > org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements > -- > > Key: HIVE-16855 > URL: https://issues.apache.org/jira/browse/HIVE-16855 > Project: Hive > Issue Type: Improvement >Affects Versions: 2.1.1, 3.0.0 >Reporter: BELUGA BEHR >Assignee: BELUGA BEHR >Priority: Minor > Attachments: HIVE-16855.1.patch > > > # Improve (Simplify) Logging > # Remove custom buffer size for {{BufferedInputStream}} and instead rely on > JVM default which is often larger these days (8192) > # Simplify looping logic -- This message was sent by Atlassian JIRA (v6.4.14#64029)