[jira] [Commented] (HIVE-8597) SMB join small table side should use the same set of serialized payloads across tasks
[ https://issues.apache.org/jira/browse/HIVE-8597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14187797#comment-14187797 ] Gunther Hagleitner commented on HIVE-8597: -- +1 for .14 and trunk SMB join small table side should use the same set of serialized payloads across tasks - Key: HIVE-8597 URL: https://issues.apache.org/jira/browse/HIVE-8597 Project: Hive Issue Type: Improvement Components: Tez Affects Versions: 0.14.0 Reporter: Siddharth Seth Assignee: Siddharth Seth Fix For: 0.14.0 Attachments: HIVE-8597.1.patch Each task sees all splits belonging to the bucket being processed by the task. At the moment, we end up using different instances of the same serialized split which adds unnecessary memory pressure. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HIVE-8597) SMB join small table side should use the same set of serialized payloads across tasks
[ https://issues.apache.org/jira/browse/HIVE-8597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14187806#comment-14187806 ] Vikram Dixit K commented on HIVE-8597: -- Committed to trunk and branch 0.14. Thanks Siddharth Seth. SMB join small table side should use the same set of serialized payloads across tasks - Key: HIVE-8597 URL: https://issues.apache.org/jira/browse/HIVE-8597 Project: Hive Issue Type: Improvement Components: Tez Affects Versions: 0.14.0 Reporter: Siddharth Seth Assignee: Siddharth Seth Fix For: 0.14.0 Attachments: HIVE-8597.1.patch Each task sees all splits belonging to the bucket being processed by the task. At the moment, we end up using different instances of the same serialized split which adds unnecessary memory pressure. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HIVE-8597) SMB join small table side should use the same set of serialized payloads across tasks
[ https://issues.apache.org/jira/browse/HIVE-8597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14184067#comment-14184067 ] Hive QA commented on HIVE-8597: --- {color:red}Overall{color}: -1 at least one tests failed Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12676962/HIVE-8597.1.patch {color:red}ERROR:{color} -1 due to 1 failed/errored test(s), 6577 tests executed *Failed tests:* {noformat} org.apache.hive.minikdc.TestJdbcWithMiniKdc.testNegativeTokenAuth {noformat} Test results: http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-TRUNK-Build/1454/testReport Console output: http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-TRUNK-Build/1454/console Test logs: http://ec2-174-129-184-35.compute-1.amazonaws.com/logs/PreCommit-HIVE-TRUNK-Build-1454/ Messages: {noformat} Executing org.apache.hive.ptest.execution.PrepPhase Executing org.apache.hive.ptest.execution.ExecutionPhase Executing org.apache.hive.ptest.execution.ReportingPhase Tests exited with: TestsFailedException: 1 tests failed {noformat} This message is automatically generated. ATTACHMENT ID: 12676962 - PreCommit-HIVE-TRUNK-Build SMB join small table side should use the same set of serialized payloads across tasks - Key: HIVE-8597 URL: https://issues.apache.org/jira/browse/HIVE-8597 Project: Hive Issue Type: Improvement Components: Tez Affects Versions: 0.14.0 Reporter: Siddharth Seth Assignee: Siddharth Seth Fix For: 0.14.0 Attachments: HIVE-8597.1.patch Each task sees all splits belonging to the bucket being processed by the task. At the moment, we end up using different instances of the same serialized split which adds unnecessary memory pressure. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HIVE-8597) SMB join small table side should use the same set of serialized payloads across tasks
[ https://issues.apache.org/jira/browse/HIVE-8597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14183755#comment-14183755 ] Vikram Dixit K commented on HIVE-8597: -- LGTM +1. +1 for 0.14 as well. SMB join small table side should use the same set of serialized payloads across tasks - Key: HIVE-8597 URL: https://issues.apache.org/jira/browse/HIVE-8597 Project: Hive Issue Type: Improvement Components: Tez Affects Versions: 0.14.0 Reporter: Siddharth Seth Assignee: Siddharth Seth Fix For: 0.14.0 Attachments: HIVE-8597.1.patch Each task sees all splits belonging to the bucket being processed by the task. At the moment, we end up using different instances of the same serialized split which adds unnecessary memory pressure. -- This message was sent by Atlassian JIRA (v6.3.4#6332)