[ https://issues.apache.org/jira/browse/HIVE-17037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16091394#comment-16091394 ]
Hive QA commented on HIVE-17037: -------------------------------- Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12877769/HIVE-17037.02.patch {color:green}SUCCESS:{color} +1 due to 1 test(s) being added or modified. {color:red}ERROR:{color} -1 due to 36 failed/errored test(s), 11075 tests executed *Failed tests:* {noformat} org.apache.hadoop.hive.cli.TestBeeLineDriver.testCliDriver[insert_overwrite_local_directory_1] (batchId=238) org.apache.hadoop.hive.cli.TestBeeLineDriver.testCliDriver[materialized_view_create_rewrite] (batchId=238) org.apache.hadoop.hive.cli.TestMiniLlapCliDriver.testCliDriver[llap_smb] (batchId=143) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[columnstats_part_coltype] (batchId=157) org.apache.hadoop.hive.cli.TestMiniLlapLocalCliDriver.testCliDriver[vector_if_expr] (batchId=145) org.apache.hadoop.hive.cli.TestMiniSparkOnYarnCliDriver.testCliDriver[spark_dynamic_partition_pruning] (batchId=167) org.apache.hadoop.hive.cli.TestMiniSparkOnYarnCliDriver.testCliDriver[spark_dynamic_partition_pruning_2] (batchId=169) org.apache.hadoop.hive.cli.TestMiniSparkOnYarnCliDriver.testCliDriver[spark_explainuser_1] (batchId=168) org.apache.hadoop.hive.cli.TestMiniSparkOnYarnCliDriver.testCliDriver[spark_use_op_stats] (batchId=167) org.apache.hadoop.hive.cli.TestMiniSparkOnYarnCliDriver.testCliDriver[spark_use_ts_stats_for_mapjoin] (batchId=168) org.apache.hadoop.hive.cli.TestMiniSparkOnYarnCliDriver.testCliDriver[spark_vectorized_dynamic_partition_pruning] (batchId=167) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query10] (batchId=233) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query14] (batchId=233) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query16] (batchId=233) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query23] (batchId=233) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query2] (batchId=233) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query31] (batchId=233) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query32] (batchId=233) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query33] (batchId=233) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query35] (batchId=233) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query45] (batchId=233) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query56] (batchId=233) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query58] (batchId=233) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query60] (batchId=233) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query64] (batchId=233) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query69] (batchId=233) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query77] (batchId=233) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query78] (batchId=233) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query83] (batchId=233) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query92] (batchId=233) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query94] (batchId=233) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query95] (batchId=233) org.apache.hadoop.hive.cli.TestPerfCliDriver.testCliDriver[query97] (batchId=233) org.apache.hive.hcatalog.api.TestHCatClient.testPartitionRegistrationWithCustomSchema (batchId=178) org.apache.hive.hcatalog.api.TestHCatClient.testPartitionSpecRegistrationWithCustomSchema (batchId=178) org.apache.hive.hcatalog.api.TestHCatClient.testTableSchemaPropagation (batchId=178) {noformat} Test results: https://builds.apache.org/job/PreCommit-HIVE-Build/6071/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/6071/console Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-6071/ Messages: {noformat} Executing org.apache.hive.ptest.execution.TestCheckPhase Executing org.apache.hive.ptest.execution.PrepPhase Executing org.apache.hive.ptest.execution.ExecutionPhase Executing org.apache.hive.ptest.execution.ReportingPhase Tests exited with: TestsFailedException: 36 tests failed {noformat} This message is automatically generated. ATTACHMENT ID: 12877769 - PreCommit-HIVE-Build > Extend join algorithm selection to avoid unnecessary input data shuffle > ----------------------------------------------------------------------- > > Key: HIVE-17037 > URL: https://issues.apache.org/jira/browse/HIVE-17037 > Project: Hive > Issue Type: Improvement > Components: Physical Optimizer > Affects Versions: 3.0.0 > Reporter: Jesus Camacho Rodriguez > Assignee: Jesus Camacho Rodriguez > Attachments: HIVE-17037.01.patch, HIVE-17037.02.patch, > HIVE-17037.patch > > > As an example, consider the following query: > {code:sql} > SELECT * > FROM ( > SELECT a.value > FROM src1 a > JOIN src1 b > ON (a.value = b.value) > GROUP BY a.value > ) a > JOIN src > ON (a.value = src.value); > {code} > Currently, the plan generated for Tez will contain an unnecessary shuffle > operation between the subquery and the join, since the records produced by > the subquery are already sorted by the value. > This issue is to extend join algorithm selection to be able to shuffle only > some of the inputs for a given join and avoid unnecessary shuffle operations. -- This message was sent by Atlassian JIRA (v6.4.14#64029)