[jira] [Commented] (TEZ-2834) tez app hangs at large scale (~30TB)

2015-09-16 Thread Bikas Saha (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14790778#comment-14790778 ] Bikas Saha commented on TEZ-2834: - If this cluster has latest YARN then the am logs can be s

[jira] [Commented] (TEZ-2834) tez app hangs at large scale (~30TB)

2015-09-16 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14790783#comment-14790783 ] Gopal V commented on TEZ-2834: -- [~bikassaha]: YARN-4149? That was fixed last night, it's not de

[jira] [Commented] (TEZ-2834) tez app hangs at large scale (~30TB)

2015-09-16 Thread Bikas Saha (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14790819#comment-14790819 ] Bikas Saha commented on TEZ-2834: - The preemption code logs are all debug. This issue cannot

[jira] [Commented] (TEZ-2834) tez app hangs at large scale (~30TB)

2015-09-16 Thread Bikas Saha (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14790823#comment-14790823 ] Bikas Saha commented on TEZ-2834: - Was the cluster fully occupied when this was happening. M

[jira] [Commented] (TEZ-2834) tez app hangs at large scale (~30TB)

2015-09-18 Thread Bikas Saha (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14876689#comment-14876689 ] Bikas Saha commented on TEZ-2834: - Can you please try with the attached patch using master b

[jira] [Commented] (TEZ-2834) tez app hangs at large scale (~30TB)

2015-09-18 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14876693#comment-14876693 ] Gopal V commented on TEZ-2834: -- Rolling the patch into the weekend tests - do you want to flip

[jira] [Commented] (TEZ-2834) tez app hangs at large scale (~30TB)

2015-09-18 Thread Bikas Saha (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14876696#comment-14876696 ] Bikas Saha commented on TEZ-2834: - There current periodic logging and the logs from this pat

[jira] [Commented] (TEZ-2834) tez app hangs at large scale (~30TB)

2015-09-18 Thread Rajesh Balamohan (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14876745#comment-14876745 ] Rajesh Balamohan commented on TEZ-2834: --- Job completed with the patch. RM sometimes re

[jira] [Commented] (TEZ-2834) tez app hangs at large scale (~30TB)

2015-09-18 Thread Bikas Saha (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14876894#comment-14876894 ] Bikas Saha commented on TEZ-2834: - [~rajesh.balamohan] Please review! > tez app hangs at la

[jira] [Commented] (TEZ-2834) tez app hangs at large scale (~30TB)

2015-09-18 Thread TezQA (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14876918#comment-14876918 ] TezQA commented on TEZ-2834: {color:red}-1 overall{color}. Here are the results of testing the

[jira] [Commented] (TEZ-2834) tez app hangs at large scale (~30TB)

2015-09-19 Thread TezQA (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14877331#comment-14877331 ] TezQA commented on TEZ-2834: {color:green}+1 overall{color}. Here are the results of testing th

[jira] [Commented] (TEZ-2834) tez app hangs at large scale (~30TB)

2015-09-20 Thread Bikas Saha (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14900118#comment-14900118 ] Bikas Saha commented on TEZ-2834: - [~rajesh.balamohan] Please review when convenient! > tez

[jira] [Commented] (TEZ-2834) tez app hangs at large scale (~30TB)

2015-09-20 Thread Rajesh Balamohan (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14900142#comment-14900142 ] Rajesh Balamohan commented on TEZ-2834: --- lgtm. +1 Very minor: {noformat} !highestPri

[jira] [Commented] (TEZ-2834) tez app hangs at large scale (~30TB)

2015-09-21 Thread Bikas Saha (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14901120#comment-14901120 ] Bikas Saha commented on TEZ-2834: - compareTo will not handle null and NPE is highestWaitingR