[ https://issues.apache.org/jira/browse/TEZ-3452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15553495#comment-15553495 ]
TezQA commented on TEZ-3452: ---------------------------- {color:green}+1 overall{color}. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12832042/TEZ-3452.3.patch against master revision 149db1b. {color:green}+1 @author{color}. The patch does not contain any @author tags. {color:green}+1 tests included{color}. The patch appears to include 1 new or modified test files. {color:green}+1 javac{color}. The applied patch does not increase the total number of javac compiler warnings. {color:green}+1 javadoc{color}. There were no new javadoc warning messages. {color:green}+1 findbugs{color}. The patch does not introduce any new Findbugs (version 3.0.1) warnings. {color:green}+1 release audit{color}. The applied patch does not increase the total number of release audit warnings. {color:green}+1 core tests{color}. The patch passed unit tests in . Test results: https://builds.apache.org/job/PreCommit-TEZ-Build/2022//testReport/ Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/2022//console This message is automatically generated. > Auto-reduce parallelism calculation can overflow with large inputs > ------------------------------------------------------------------ > > Key: TEZ-3452 > URL: https://issues.apache.org/jira/browse/TEZ-3452 > Project: Apache Tez > Issue Type: Bug > Reporter: Jonathan Eagles > Assignee: Jonathan Eagles > Attachments: TEZ-3452.1.patch, TEZ-3452.2.patch, TEZ-3452.3.patch > > > Overflow can occur when the numTasks is high (say 45000) and outputSize is > high (say 311TB) and slow start is set to 1.0. > {code:title=ShuffleVertexManager} > for (Map.Entry<String, SourceVertexInfo> vInfo : getBipartiteInfo()) { > SourceVertexInfo srcInfo = vInfo.getValue(); > if (srcInfo.numTasks > 0 && srcInfo.numVMEventsReceived > 0) { > // this assumes that 1 vmEvent is received per completed task - > TEZ-2961 > expectedTotalSourceTasksOutputSize += > (srcInfo.numTasks * srcInfo.outputSize) / > srcInfo.numVMEventsReceived; > } > } > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)