[jira] [Updated] (TEZ-1345) Add checks to guarantee all init events are written to recovery to consider vertex initialized

2014-09-10 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1345: Attachment: Tez-1345-12.patch Minor update on the patch. * Since using ConcurrentHashMap. there's no need to

[jira] [Updated] (TEZ-1524) getDAGStatus seems to fork out the entire JVM

2014-09-10 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-1524: - Attachment: TEZ-1524.2.patch > getDAGStatus seems to fork out the entire JVM > -

[jira] [Updated] (TEZ-1345) Add checks to guarantee all init events are written to recovery to consider vertex initialized

2014-09-10 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1345: Attachment: Tez-1345-11.patch > Add checks to guarantee all init events are written to recovery to consider >

[jira] [Commented] (TEZ-1345) Add checks to guarantee all init events are written to recovery to consider vertex initialized

2014-09-10 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129680#comment-14129680 ] Jeff Zhang commented on TEZ-1345: - [~hitesh] Attach the new patch * Remove vertexName in V

[jira] [Updated] (TEZ-1570) Setting InputInitializer parallelism to 0 should not crash AM

2014-09-10 Thread Rajesh Balamohan (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated TEZ-1570: -- Summary: Setting InputInitializer parallelism to 0 should not crash AM (was: Setting InputInitial

[jira] [Commented] (TEZ-1357) Display better diagnostics when AM fails to launch

2014-09-10 Thread Rajesh Balamohan (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129664#comment-14129664 ] Rajesh Balamohan commented on TEZ-1357: --- Created TEZ-1570 to track the second issue se

[jira] [Created] (TEZ-1570) Setting InputInitializer parallelism to 0 crashes AM

2014-09-10 Thread Rajesh Balamohan (JIRA)
Rajesh Balamohan created TEZ-1570: - Summary: Setting InputInitializer parallelism to 0 crashes AM Key: TEZ-1570 URL: https://issues.apache.org/jira/browse/TEZ-1570 Project: Apache Tez Issue T

[jira] [Commented] (TEZ-850) Recovery unit tests

2014-09-10 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129641#comment-14129641 ] Jeff Zhang commented on TEZ-850: [~hitesh] Update the patch removing the "//TODO " Maybe it i

[jira] [Updated] (TEZ-850) Recovery unit tests

2014-09-10 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-850: --- Attachment: Tez-850-6.patch > Recovery unit tests > --- > > Key: TEZ-850 >

[jira] [Updated] (TEZ-1494) DAG hangs waiting for ShuffleManager.getNextInput()

2014-09-10 Thread Hitesh Shah (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hitesh Shah updated TEZ-1494: - Fix Version/s: (was: 0.6.0) > DAG hangs waiting for ShuffleManager.getNextInput() >

[jira] [Commented] (TEZ-1494) DAG hangs waiting for ShuffleManager.getNextInput()

2014-09-10 Thread Hitesh Shah (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1494?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129627#comment-14129627 ] Hitesh Shah commented on TEZ-1494: -- Setting the fix version to the lowest version the patch

[jira] [Commented] (TEZ-1357) Display better diagnostics when AM fails to launch

2014-09-10 Thread Hitesh Shah (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129625#comment-14129625 ] Hitesh Shah commented on TEZ-1357: -- [~rajesh.balamohan] Good finds. Lets split this jira in

[jira] [Commented] (TEZ-850) Recovery unit tests

2014-09-10 Thread Hitesh Shah (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129623#comment-14129623 ] Hitesh Shah commented on TEZ-850: - Question on "//TODO something wrong" in the patch - could

[jira] [Commented] (TEZ-850) Recovery unit tests

2014-09-10 Thread Hitesh Shah (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129618#comment-14129618 ] Hitesh Shah commented on TEZ-850: - [~zjffdu] they are showing up in Intellij. I am not sure w

[jira] [Commented] (TEZ-850) Recovery unit tests

2014-09-10 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129615#comment-14129615 ] Jeff Zhang commented on TEZ-850: [~hitesh] Attach the patch. BTW, could you let me know what

[jira] [Updated] (TEZ-850) Recovery unit tests

2014-09-10 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-850: --- Attachment: Tez-850-5.patch > Recovery unit tests > --- > > Key: TEZ-850 >

[jira] [Commented] (TEZ-850) Recovery unit tests

2014-09-10 Thread Hitesh Shah (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129596#comment-14129596 ] Hitesh Shah commented on TEZ-850: - There were minor warnings for example: - instead of us

[jira] [Commented] (TEZ-1559) Add system tests for AM recovery

2014-09-10 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129592#comment-14129592 ] Jeff Zhang commented on TEZ-1559: - Attach the new patch. bq. Recovery data is meant to be i

[jira] [Updated] (TEZ-1559) Add system tests for AM recovery

2014-09-10 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1559: Attachment: Tez-1559-3.patch > Add system tests for AM recovery > > >

[jira] [Created] (TEZ-1569) Add tests for preemption

2014-09-10 Thread Bikas Saha (JIRA)
Bikas Saha created TEZ-1569: --- Summary: Add tests for preemption Key: TEZ-1569 URL: https://issues.apache.org/jira/browse/TEZ-1569 Project: Apache Tez Issue Type: Test Reporter: Bikas Sa

[jira] [Comment Edited] (TEZ-850) Recovery unit tests

2014-09-10 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129566#comment-14129566 ] Jeff Zhang edited comment on TEZ-850 at 9/11/14 3:13 AM: - Attach the

[jira] [Commented] (TEZ-850) Recovery unit tests

2014-09-10 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129566#comment-14129566 ] Jeff Zhang commented on TEZ-850: Attach the new patch. * rebase it * remove MockVertexImpl w

[jira] [Updated] (TEZ-850) Recovery unit tests

2014-09-10 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-850: --- Attachment: Tez-850-4.patch > Recovery unit tests > --- > > Key: TEZ-850 >

[jira] [Commented] (TEZ-1534) Make client side configs available to AM and tasks

2014-09-10 Thread Hitesh Shah (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129534#comment-14129534 ] Hitesh Shah commented on TEZ-1534: -- Mostly looks good except for: {code} +Configuratio

[jira] [Commented] (TEZ-1559) Add system tests for AM recovery

2014-09-10 Thread Hitesh Shah (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129530#comment-14129530 ] Hitesh Shah commented on TEZ-1559: -- bq. VertexId is generated internally in tez which is no

[jira] [Commented] (TEZ-1567) Avoid blacklisting nodes if the cluster size is 1

2014-09-10 Thread Bikas Saha (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129490#comment-14129490 ] Bikas Saha commented on TEZ-1567: - The threshold logic was inherited from MR. There might be

[jira] [Commented] (TEZ-1559) Add system tests for AM recovery

2014-09-10 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129472#comment-14129472 ] Jeff Zhang commented on TEZ-1559: - [~hitesh] Attach the new patch. * add counter verificat

[jira] [Updated] (TEZ-1559) Add system tests for AM recovery

2014-09-10 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1559: Attachment: Tez-1559-2.patch > Add system tests for AM recovery > > >

[jira] [Commented] (TEZ-1357) Display better diagnostics when AM fails to launch

2014-09-10 Thread Rajesh Balamohan (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129423#comment-14129423 ] Rajesh Balamohan commented on TEZ-1357: --- Adding couple of scenarios that I encountered

[jira] [Commented] (TEZ-1559) Add system tests for AM recovery

2014-09-10 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129409#comment-14129409 ] Jeff Zhang commented on TEZ-1559: - bq. in the future, it will be helpful to reviewers if whi

[jira] [Updated] (TEZ-1494) DAG hangs waiting for ShuffleManager.getNextInput()

2014-09-10 Thread Rajesh Balamohan (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated TEZ-1494: -- Fix Version/s: 0.6.0 0.5.1 > DAG hangs waiting for ShuffleManager.getNextInput(

[jira] [Updated] (TEZ-1568) Add system test for propagation of diagnostics for errors

2014-09-10 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1568: Issue Type: Sub-task (was: Bug) Parent: TEZ-1240 > Add system test for propagation of diagnostics for

[jira] [Created] (TEZ-1568) Add system test for propagation of diagnostics for errors

2014-09-10 Thread Jeff Zhang (JIRA)
Jeff Zhang created TEZ-1568: --- Summary: Add system test for propagation of diagnostics for errors Key: TEZ-1568 URL: https://issues.apache.org/jira/browse/TEZ-1568 Project: Apache Tez Issue Type: Bu

[jira] [Commented] (TEZ-1567) Avoid blacklisting nodes if the cluster size is 1

2014-09-10 Thread Siddharth Seth (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129395#comment-14129395 ] Siddharth Seth commented on TEZ-1567: - Yes, this should consider the threshold to figure

[jira] [Updated] (TEZ-1534) Make client side configs available to AM and tasks

2014-09-10 Thread Siddharth Seth (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Siddharth Seth updated TEZ-1534: Attachment: TEZ-1534.2.txt Updated patch - which does a blanket allow all, and removes all filtering.

[jira] [Updated] (TEZ-1564) State machine error: Invalid event: T_SCHEDULE at SCHEDULED

2014-09-10 Thread Hitesh Shah (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hitesh Shah updated TEZ-1564: - Target Version/s: 0.5.1 > State machine error: Invalid event: T_SCHEDULE at SCHEDULED >

[jira] [Updated] (TEZ-1564) State machine error: Invalid event: T_SCHEDULE at SCHEDULED

2014-09-10 Thread Hitesh Shah (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hitesh Shah updated TEZ-1564: - Priority: Critical (was: Major) > State machine error: Invalid event: T_SCHEDULE at SCHEDULED > ---

[jira] [Comment Edited] (TEZ-1567) Avoid blacklisting nodes if the cluster size is 1

2014-09-10 Thread Hitesh Shah (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129360#comment-14129360 ] Hitesh Shah edited comment on TEZ-1567 at 9/10/14 11:35 PM: Mayb

[jira] [Comment Edited] (TEZ-1567) Avoid blacklisting nodes if the cluster size is 1

2014-09-10 Thread Hitesh Shah (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129360#comment-14129360 ] Hitesh Shah edited comment on TEZ-1567 at 9/10/14 11:34 PM: Mayb

[jira] [Updated] (TEZ-1567) Avoid blacklisting nodes if the cluster size is 1

2014-09-10 Thread Hitesh Shah (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hitesh Shah updated TEZ-1567: - Labels: newbie (was: ) > Avoid blacklisting nodes if the cluster size is 1 > --

[jira] [Commented] (TEZ-1567) Avoid blacklisting nodes if the cluster size is 1

2014-09-10 Thread Hitesh Shah (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129360#comment-14129360 ] Hitesh Shah commented on TEZ-1567: -- Maybe generalize this to say that blacklisting is disab

[jira] [Created] (TEZ-1567) Avoid blacklisting nodes if the cluster size is 1

2014-09-10 Thread Siddharth Seth (JIRA)
Siddharth Seth created TEZ-1567: --- Summary: Avoid blacklisting nodes if the cluster size is 1 Key: TEZ-1567 URL: https://issues.apache.org/jira/browse/TEZ-1567 Project: Apache Tez Issue Type: Bu

[jira] [Comment Edited] (TEZ-1534) Make client side configs available to AM and tasks

2014-09-10 Thread Hitesh Shah (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129312#comment-14129312 ] Hitesh Shah edited comment on TEZ-1534 at 9/10/14 11:05 PM: Comm

[jira] [Commented] (TEZ-1534) Make client side configs available to AM and tasks

2014-09-10 Thread Hitesh Shah (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129312#comment-14129312 ] Hitesh Shah commented on TEZ-1534: -- Comments: {code} +if (allowedPrefixes.length == 0)

[jira] [Updated] (TEZ-1157) Optimize broadcast :- Tasks pertaining to same job in same machine should not download multiple copies of broadcast data

2014-09-10 Thread Gopal V (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gopal V updated TEZ-1157: - Attachment: TEZ-1157.6.patch > Optimize broadcast :- Tasks pertaining to same job in same machine should not > down

[jira] [Comment Edited] (TEZ-1539) Allow a FIRE_ONCE_ON_SUCCESS model for events generated by user code

2014-09-10 Thread Bikas Saha (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129245#comment-14129245 ] Bikas Saha edited comment on TEZ-1539 at 9/10/14 10:28 PM: --- Settin

[jira] [Commented] (TEZ-1539) Allow a FIRE_ONCE_ON_SUCCESS model for events generated by user code

2014-09-10 Thread Bikas Saha (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129245#comment-14129245 ] Bikas Saha commented on TEZ-1539: - Setting aside the discussion on the change in semantics a

[jira] [Assigned] (TEZ-1433) Invalid credentials can be used when a DAG is submitted to a session which has timed out

2014-09-10 Thread Siddharth Seth (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Siddharth Seth reassigned TEZ-1433: --- Assignee: Siddharth Seth (was: Jonathan Eagles) > Invalid credentials can be used when a DAG i

[jira] [Updated] (TEZ-1534) Make client side configs available to AM and tasks

2014-09-10 Thread Siddharth Seth (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Siddharth Seth updated TEZ-1534: Attachment: TEZ-1534.1.txt Patch allows additional parameters to be filtered through - selective list

[jira] [Assigned] (TEZ-1534) Make client side configs available to AM and tasks

2014-09-10 Thread Siddharth Seth (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Siddharth Seth reassigned TEZ-1534: --- Assignee: Siddharth Seth > Make client side configs available to AM and tasks > ---

[jira] [Commented] (TEZ-1559) Add system tests for AM recovery

2014-09-10 Thread Hitesh Shah (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129161#comment-14129161 ] Hitesh Shah commented on TEZ-1559: -- Comments: - in the future, it will be helpful to rev

[jira] [Commented] (TEZ-850) Recovery unit tests

2014-09-10 Thread Hitesh Shah (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129117#comment-14129117 ] Hitesh Shah commented on TEZ-850: - Comments on patch: - needs a minor rebase - MockVert

[jira] [Commented] (TEZ-1433) Invalid credentials can be used when a DAG is submitted to a session which has timed out

2014-09-10 Thread Jonathan Eagles (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129104#comment-14129104 ] Jonathan Eagles commented on TEZ-1433: -- I'm not going to be able to get to this soon. C

[jira] [Created] (TEZ-1566) Reduce log verbosity

2014-09-10 Thread Siddharth Seth (JIRA)
Siddharth Seth created TEZ-1566: --- Summary: Reduce log verbosity Key: TEZ-1566 URL: https://issues.apache.org/jira/browse/TEZ-1566 Project: Apache Tez Issue Type: Bug Reporter: Siddh

[jira] [Created] (TEZ-1565) Add tests for checking custom vertex managers like auto-reduce parallelism in recovery

2014-09-10 Thread Hitesh Shah (JIRA)
Hitesh Shah created TEZ-1565: Summary: Add tests for checking custom vertex managers like auto-reduce parallelism in recovery Key: TEZ-1565 URL: https://issues.apache.org/jira/browse/TEZ-1565 Project: Apa

[jira] [Commented] (TEZ-1524) getDAGStatus seems to fork out the entire JVM

2014-09-10 Thread Hitesh Shah (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129072#comment-14129072 ] Hitesh Shah commented on TEZ-1524: -- Comment on patch: {code} +ACLManager aclManager =

[jira] [Updated] (TEZ-1542) Local Mode crashes on concurrentModificationException

2014-09-10 Thread Chen He (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chen He updated TEZ-1542: - Attachment: (was: TEZ-1542-3.patch) > Local Mode crashes on concurrentModificationException > --

[jira] [Updated] (TEZ-1542) Local Mode crashes on concurrentModificationException

2014-09-10 Thread Chen He (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chen He updated TEZ-1542: - Attachment: TEZ-1542-3.patch > Local Mode crashes on concurrentModificationException > -

[jira] [Updated] (TEZ-1542) Local Mode crashes on concurrentModificationException

2014-09-10 Thread Chen He (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chen He updated TEZ-1542: - Attachment: TEZ-1542-3.patch shutdown threadpool after test. > Local Mode crashes on concurrentModificationExceptio

[jira] [Updated] (TEZ-1524) getDAGStatus seems to fork out the entire JVM

2014-09-10 Thread Hitesh Shah (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hitesh Shah updated TEZ-1524: - Assignee: Gopal V > getDAGStatus seems to fork out the entire JVM >

[jira] [Updated] (TEZ-1542) Local Mode crashes on concurrentModificationException

2014-09-10 Thread Chen He (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chen He updated TEZ-1542: - Attachment: TEZ-1542-2.patch add concurrent unit test > Local Mode crashes on concurrentModificationException > ---

[jira] [Commented] (TEZ-1494) DAG hangs waiting for ShuffleManager.getNextInput()

2014-09-10 Thread Bikas Saha (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1494?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14128856#comment-14128856 ] Bikas Saha commented on TEZ-1494: - lgtm. lets get this in for 0.5.1 and continue to investig

[jira] [Commented] (TEZ-1539) Allow a FIRE_ONCE_ON_SUCCESS model for events generated by user code

2014-09-10 Thread Siddharth Seth (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14128849#comment-14128849 ] Siddharth Seth commented on TEZ-1539: - The Initializer events are generated by tasks and

[jira] [Commented] (TEZ-1562) DAGImpl commitOrAbortOutputs takes long time (300+ seconds) for reducer vertex with 4000+ tasks

2014-09-10 Thread Bikas Saha (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14128829#comment-14128829 ] Bikas Saha commented on TEZ-1562: - https://issues.apache.org/jira/browse/MAPREDUCE-4815 > D

[jira] [Commented] (TEZ-1526) LoadingCache for TezTaskID slow for large jobs

2014-09-10 Thread Siddharth Seth (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14128826#comment-14128826 ] Siddharth Seth commented on TEZ-1526: - [~jeagles] - the loading cache gives us the benef

[jira] [Resolved] (TEZ-1519) TezTaskRunner should not initialize TezConfiguration in TezChild

2014-09-10 Thread Hitesh Shah (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hitesh Shah resolved TEZ-1519. -- Resolution: Fixed Fix Version/s: 0.5.1 Committed to master and branch 0.5. Thanks [~pramachandran

[jira] [Commented] (TEZ-1519) TezTaskRunner should not initialize TezConfiguration in TezChild

2014-09-10 Thread Hitesh Shah (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14128676#comment-14128676 ] Hitesh Shah commented on TEZ-1519: -- +1. Committing shortly. > TezTaskRunner should not in

[jira] [Commented] (TEZ-1563) TezClient.submitDAGSession alters DAG local resources regardless of DAG submission

2014-09-10 Thread Josh Elser (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14128675#comment-14128675 ] Josh Elser commented on TEZ-1563: - Thanks for the link. Your patch on TEZ-1433 appears that

[jira] [Commented] (TEZ-1345) Add checks to guarantee all init events are written to recovery to consider vertex initialized

2014-09-10 Thread Hitesh Shah (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14128670#comment-14128670 ] Hitesh Shah commented on TEZ-1345: -- Mostly looks good. 2 comments: - any reason for usin

[jira] [Commented] (TEZ-1559) Add system tests for AM recovery

2014-09-10 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14128583#comment-14128583 ] Jeff Zhang commented on TEZ-1559: - Attach the patch. * Add unit test covering the case in t

[jira] [Updated] (TEZ-1559) Add system tests for AM recovery

2014-09-10 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1559: Attachment: Tez-1559.patch > Add system tests for AM recovery > > >

[jira] [Updated] (TEZ-1543) Shuffle Errors on heavy load (causing task retries)

2014-09-10 Thread Rajesh Balamohan (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated TEZ-1543: -- Summary: Shuffle Errors on heavy load (causing task retries) (was: Shuffle Errors on heavy load)

[jira] [Updated] (TEZ-1543) Shuffle Errors on heavy load

2014-09-10 Thread Rajesh Balamohan (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan updated TEZ-1543: -- Attachment: TEZ-1543.1.patch with_patch.svg syn_app_with_issue.svg

[jira] [Assigned] (TEZ-1543) Shuffle Errors on heavy load

2014-09-10 Thread Rajesh Balamohan (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rajesh Balamohan reassigned TEZ-1543: - Assignee: Rajesh Balamohan > Shuffle Errors on heavy load > >