[jira] [Comment Edited] (TEZ-2307) Possible wrong error message when submitting new dag

2016-01-28 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15122733#comment-15122733 ] Jeff Zhang edited comment on TEZ-2307 at 1/29/16 1:56 AM: -- bq. I t

[jira] [Updated] (TEZ-2307) Possible wrong error message when submitting new dag

2016-01-31 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2307: Attachment: TEZ-2307-5.patch > Possible wrong error message when submitting new dag >

[jira] [Commented] (TEZ-2307) Possible wrong error message when submitting new dag

2016-02-01 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15126295#comment-15126295 ] Jeff Zhang commented on TEZ-2307: - Upload a new patch. [~sseth] Please help review. The fail

[jira] [Updated] (TEZ-2307) Possible wrong error message when submitting new dag

2016-02-01 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2307: Attachment: TEZ-2307-6.patch > Possible wrong error message when submitting new dag >

[jira] [Commented] (TEZ-2307) Possible wrong error message when submitting new dag

2016-02-01 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15127417#comment-15127417 ] Jeff Zhang commented on TEZ-2307: - Thanks [~sseth] Upload new patch to address comments, wil

[jira] [Updated] (TEZ-2307) Possible wrong error message when submitting new dag

2016-02-01 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2307: Attachment: TEZ-2307-7.patch rebase the patch to trigger the prebuild > Possible wrong error message when sub

[jira] [Created] (TEZ-3097) Flaky test: TestCommit.testDAGCommitStartedEventFail_OnDAGSuccess

2016-02-04 Thread Jeff Zhang (JIRA)
Jeff Zhang created TEZ-3097: --- Summary: Flaky test: TestCommit.testDAGCommitStartedEventFail_OnDAGSuccess Key: TEZ-3097 URL: https://issues.apache.org/jira/browse/TEZ-3097 Project: Apache Tez Issue

[jira] [Created] (TEZ-3099) Compilation fails of TestExternalTezServicesErrors on hadoop-2.2

2016-02-05 Thread Jeff Zhang (JIRA)
Jeff Zhang created TEZ-3099: --- Summary: Compilation fails of TestExternalTezServicesErrors on hadoop-2.2 Key: TEZ-3099 URL: https://issues.apache.org/jira/browse/TEZ-3099 Project: Apache Tez Issue

[jira] [Updated] (TEZ-3099) Compilation failure of TestExternalTezServicesErrors on hadoop-2.

2016-02-05 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-3099: Summary: Compilation failure of TestExternalTezServicesErrors on hadoop-2. (was: Compilation fails of TestExt

[jira] [Updated] (TEZ-3099) Compilation failure of TestExternalTezServicesErrors on hadoop-2.2

2016-02-05 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-3099: Summary: Compilation failure of TestExternalTezServicesErrors on hadoop-2.2 (was: Compilation failure of Test

[jira] [Updated] (TEZ-3099) Compilation fails of TestExternalTezServicesErrors on hadoop-2.2

2016-02-05 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-3099: Description: \cc [~sseth] https://builds.apache.org/job/Tez-Build-Hadoop-2.2/242/console {noformat} [ERROR]

[jira] [Updated] (TEZ-3124) Running task hangs due to missing event to initialize input

2016-02-17 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-3124: Labels: Recovery (was: ) > Running task hangs due to missing event to initialize input >

[jira] [Created] (TEZ-3124) Running task hangs due to missing event to initialize input

2016-02-17 Thread Jeff Zhang (JIRA)
Jeff Zhang created TEZ-3124: --- Summary: Running task hangs due to missing event to initialize input Key: TEZ-3124 URL: https://issues.apache.org/jira/browse/TEZ-3124 Project: Apache Tez Issue Type:

[jira] [Updated] (TEZ-3124) Running task hangs due to missing event to initialize input in recovery

2016-02-17 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-3124: Summary: Running task hangs due to missing event to initialize input in recovery (was: Running task hangs due

[jira] [Updated] (TEZ-3124) Running task hangs due to missing event to initialize input in recovery

2016-02-17 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-3124: Description: {noformat} 2016-02-09 04:48:42 Starting to run new task attempt: attempt_1454993155302_0001_1_00

[jira] [Updated] (TEZ-3124) Running task hangs due to missing event to initialize input in recovery

2016-02-17 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-3124: Description: {noformat} 2016-02-09 04:48:42 Starting to run new task attempt: attempt_1454993155302_0001_1_00

[jira] [Updated] (TEZ-3124) Running task hangs due to missing event to initialize input in recovery

2016-02-17 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-3124: Attachment: a.log > Running task hangs due to missing event to initialize input in recovery >

[jira] [Commented] (TEZ-3124) Running task hangs due to missing event to initialize input in recovery

2016-02-17 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15151735#comment-15151735 ] Jeff Zhang commented on TEZ-3124: - tasksNotYetScheduled is accessed by both dispatcher threa

[jira] [Updated] (TEZ-3124) Running task hangs due to missing event to initialize input in recovery

2016-02-17 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-3124: Attachment: TEZ-3124-1.patch > Running task hangs due to missing event to initialize input in recovery > -

[jira] [Comment Edited] (TEZ-3124) Running task hangs due to missing event to initialize input in recovery

2016-02-17 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15151735#comment-15151735 ] Jeff Zhang edited comment on TEZ-3124 at 2/18/16 4:52 AM: -- Attach o

[jira] [Issue Comment Deleted] (TEZ-3124) Running task hangs due to missing event to initialize input in recovery

2016-02-17 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-3124: Comment: was deleted (was: Attach one patch to fix it. [~bikassaha] Please help review it. * tasksNotYetSche

[jira] [Commented] (TEZ-3124) Running task hangs due to missing event to initialize input in recovery

2016-02-17 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15151804#comment-15151804 ] Jeff Zhang commented on TEZ-3124: - Attach one patch to fix it. [~bikassaha] Please help revi

[jira] [Updated] (TEZ-3124) Running task hangs due to missing event to initialize input in recovery

2016-02-17 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-3124: Attachment: TEZ-3124-2.patch Add more logging > Running task hangs due to missing event to initialize input i

[jira] [Updated] (TEZ-3124) Running task hangs due to missing event to initialize input in recovery

2016-02-22 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-3124: Description: {noformat} 2016-02-09 04:48:42 Starting to run new task attempt: attempt_1454993155302_0001_1_00

[jira] [Updated] (TEZ-3124) Running task hangs due to missing event to initialize input in recovery

2016-02-22 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-3124: Attachment: TEZ-3124-3.patch > Running task hangs due to missing event to initialize input in recovery > -

[jira] [Commented] (TEZ-3124) Running task hangs due to missing event to initialize input in recovery

2016-02-22 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15156668#comment-15156668 ] Jeff Zhang commented on TEZ-3124: - Find the root cause, this is due to there's multiple Vert

[jira] [Comment Edited] (TEZ-3124) Running task hangs due to missing event to initialize input in recovery

2016-02-22 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15156668#comment-15156668 ] Jeff Zhang edited comment on TEZ-3124 at 2/22/16 9:22 AM: -- Find the

[jira] [Commented] (TEZ-3124) Running task hangs due to missing event to initialize input in recovery

2016-02-22 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15158021#comment-15158021 ] Jeff Zhang commented on TEZ-3124: - The failed tests are TestFaultTolerance and TestDAGImpl.t

[jira] [Updated] (TEZ-3124) Running task hangs due to missing event to initialize input in recovery

2016-02-22 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-3124: Attachment: TEZ-3124-4.patch > Running task hangs due to missing event to initialize input in recovery > -

[jira] [Commented] (TEZ-3124) Running task hangs due to missing event to initialize input in recovery

2016-02-23 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15158781#comment-15158781 ] Jeff Zhang commented on TEZ-3124: - [~bikassaha] [~hitesh] The timeout is due to too many tes

[jira] [Commented] (TEZ-3124) Running task hangs due to missing event to initialize input in recovery

2016-02-23 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159919#comment-15159919 ] Jeff Zhang commented on TEZ-3124: - RecoveryParser will scan the recovery files from the olde

[jira] [Comment Edited] (TEZ-3124) Running task hangs due to missing event to initialize input in recovery

2016-02-23 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159919#comment-15159919 ] Jeff Zhang edited comment on TEZ-3124 at 2/24/16 12:07 AM: --- Recove

[jira] [Comment Edited] (TEZ-3124) Running task hangs due to missing event to initialize input in recovery

2016-02-23 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15160005#comment-15160005 ] Jeff Zhang edited comment on TEZ-3124 at 2/24/16 1:13 AM: -- bq. Can

[jira] [Commented] (TEZ-3124) Running task hangs due to missing event to initialize input in recovery

2016-02-23 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15160005#comment-15160005 ] Jeff Zhang commented on TEZ-3124: - bq. Can this happen? Even if no, why add the side effect

[jira] [Updated] (TEZ-3124) Running task hangs due to missing event to initialize input in recovery

2016-02-23 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-3124: Attachment: TEZ-3124-5.patch Update the patch. Only logging VertexInitializedEvent once. > Running task hang

[jira] [Commented] (TEZ-3124) Running task hangs due to missing event to initialize input in recovery

2016-02-24 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15160355#comment-15160355 ] Jeff Zhang commented on TEZ-3124: - Committed to master > Running task hangs due to missing

[jira] [Updated] (TEZ-3137) Tez task failed with illegal state exception in recovery

2016-02-24 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-3137: Description: {noformat} 2016-02-19 02:33:18,917 [INFO] [Dispatcher thread {Central}] |impl.VertexImpl|: verte

[jira] [Created] (TEZ-3137) Tez task failed with illegal state exception in recovery

2016-02-24 Thread Jeff Zhang (JIRA)
Jeff Zhang created TEZ-3137: --- Summary: Tez task failed with illegal state exception in recovery Key: TEZ-3137 URL: https://issues.apache.org/jira/browse/TEZ-3137 Project: Apache Tez Issue Type: Bug

[jira] [Commented] (TEZ-3137) Tez task failed with illegal state exception in recovery

2016-02-24 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15166424#comment-15166424 ] Jeff Zhang commented on TEZ-3137: - StartWhileInitializingTransition is for the root vertex w

[jira] [Updated] (TEZ-3137) Tez task failed with illegal state exception in recovery

2016-02-24 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-3137: Attachment: TEZ-3137-1.patch > Tez task failed with illegal state exception in recovery >

[jira] [Commented] (TEZ-3137) Tez task failed with illegal state exception in recovery

2016-02-24 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15166576#comment-15166576 ] Jeff Zhang commented on TEZ-3137: - [~hitesh] Trivial patch, please help review. This is only

[jira] [Updated] (TEZ-3137) Tez task failed with illegal state exception in recovery

2016-02-24 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-3137: Attachment: TEZ-3137-2.patch > Tez task failed with illegal state exception in recovery >

[jira] [Updated] (TEZ-3137) Tez task failed with illegal state exception in recovery

2016-02-24 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-3137: Attachment: (was: TEZ-3137-2.patch) > Tez task failed with illegal state exception in recovery > -

[jira] [Commented] (TEZ-3137) Tez task failed with illegal state exception in recovery

2016-02-24 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15166617#comment-15166617 ] Jeff Zhang commented on TEZ-3137: - Upload another patch, recoveredState is NEW by default, n

[jira] [Comment Edited] (TEZ-3137) Tez task failed with illegal state exception in recovery

2016-02-24 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15166617#comment-15166617 ] Jeff Zhang edited comment on TEZ-3137 at 2/25/16 2:44 AM: -- Upload a

[jira] [Updated] (TEZ-3137) Tez task failed with illegal state exception in recovery

2016-02-24 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-3137: Attachment: TEZ-3137-2.patch > Tez task failed with illegal state exception in recovery >

[jira] [Updated] (TEZ-3137) Tez task failed with illegal state exception in recovery

2016-02-24 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-3137: Attachment: TEZ-3137-3.patch > Tez task failed with illegal state exception in recovery >

[jira] [Commented] (TEZ-3137) Tez task failed with illegal state exception in recovery

2016-02-24 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15166684#comment-15166684 ] Jeff Zhang commented on TEZ-3137: - The previous patch (2) is incorrect. cause TestDAGRecover

[jira] [Commented] (TEZ-3137) Tez task failed with illegal state exception in recovery

2016-02-24 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15166698#comment-15166698 ] Jeff Zhang commented on TEZ-3137: - Committed to branch-0.7 > Tez task failed with illegal s

[jira] [Updated] (TEZ-2863) Container, node, and logs not available in UI for tasks that fail to launch

2016-02-28 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2863: Attachment: TEZ-2863.3.patch.addendum TEZ-2863.3-branch-0.7.patch.addendum > Container, node,

[jira] [Commented] (TEZ-2863) Container, node, and logs not available in UI for tasks that fail to launch

2016-02-28 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15171388#comment-15171388 ] Jeff Zhang commented on TEZ-2863: - [~jeagles] I attached 2 addendum patch based on your pat

[jira] [Commented] (TEZ-2863) Container, node, and logs not available in UI for tasks that fail to launch

2016-02-29 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15173002#comment-15173002 ] Jeff Zhang commented on TEZ-2863: - [~jeagles] Just quick go through your new patch, seems yo

[jira] [Updated] (TEZ-2686) TestFaultTolerance fails frequently

2016-03-14 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2686: Assignee: Zhiyuan Yang > TestFaultTolerance fails frequently > > >

[jira] [Commented] (TEZ-2686) TestFaultTolerance fails frequently

2016-03-14 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15194477#comment-15194477 ] Jeff Zhang commented on TEZ-2686: - [~aplusplus] Assign to you, thanks for taking this. > T

[jira] [Commented] (TEZ-2958) Recovered TA, whose commit cannot be recovered, should move to killed state

2016-03-16 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15196990#comment-15196990 ] Jeff Zhang commented on TEZ-2958: - [~jlowe] The patch mostly looks good to me. One concern i

[jira] [Commented] (TEZ-2958) Recovered TA, whose commit cannot be recovered, should move to killed state

2016-03-18 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15198653#comment-15198653 ] Jeff Zhang commented on TEZ-2958: - Thanks [~jlowe] +1 > Recovered TA, whose commit cannot b

[jira] [Resolved] (TEZ-2022) java.lang.IllegalStateException: Vertex: got invalid start event

2016-03-19 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang resolved TEZ-2022. - Resolution: Duplicate Fixed in TEZ-3137 > java.lang.IllegalStateException: Vertex: got invalid start event

[jira] [Commented] (TEZ-3213) Uncaught exception during vertex recovery leads to invalid state transition loop

2016-04-26 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15259407#comment-15259407 ] Jeff Zhang commented on TEZ-3213: - Thanks [~ebadger]. The patch lgtm. It is required for DA

[jira] [Commented] (TEZ-3213) Uncaught exception during vertex recovery leads to invalid state transition loop

2016-04-27 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15260183#comment-15260183 ] Jeff Zhang commented on TEZ-3213: - It is not necessary to do that in master. There's no RECO

[jira] [Commented] (TEZ-3239) ShuffleVertexManager recovery issue when auto parallelism is enabled

2016-05-02 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15267849#comment-15267849 ] Jeff Zhang commented on TEZ-3239: - [~mingma] Are you in master or 0.7 ? It's a known issue f

[jira] [Commented] (TEZ-3239) ShuffleVertexManager recovery issue when auto parallelism is enabled

2016-05-02 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15267916#comment-15267916 ] Jeff Zhang commented on TEZ-3239: - [~mingma] Could you attach the AM log ? > ShuffleVertexM

[jira] [Updated] (TEZ-3258) Jvm Checker has a bug for JVM options

2016-05-17 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-3258: Assignee: Azuryy(Chijiong) > Jvm Checker has a bug for JVM options > - > >

[jira] [Updated] (TEZ-3258) Jvm Checker has a bug for JVM options

2016-05-17 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-3258: Assignee: Fengdong Yu (was: Azuryy(Chijiong)) > Jvm Checker has a bug for JVM options > -

[jira] [Resolved] (TEZ-1577) Recover attempt information when recovering from task desired state

2016-05-18 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang resolved TEZ-1577. - Resolution: Won't Fix > Recover attempt information when recovering from task desired state > --

[jira] [Commented] (TEZ-1577) Recover attempt information when recovering from task desired state

2016-05-18 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15290342#comment-15290342 ] Jeff Zhang commented on TEZ-1577: - Yes, we can close it as won't-fix. Currently task attempt

[jira] [Commented] (TEZ-3273) In one vexter has some task failed,DAG will stuck forever.

2016-05-25 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15299834#comment-15299834 ] Jeff Zhang commented on TEZ-3273: - Could you attach the yarn app log ? And what version of t

[jira] [Comment Edited] (TEZ-3273) In one vexter has some task failed,DAG will stuck forever.

2016-05-25 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15299834#comment-15299834 ] Jeff Zhang edited comment on TEZ-3273 at 5/25/16 10:34 AM: --- Could

[jira] [Commented] (TEZ-3273) In one vexter has some task failed,DAG will stuck forever.

2016-05-25 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15301572#comment-15301572 ] Jeff Zhang commented on TEZ-3273: - This is RM log, you need to look for its app log. Either

[jira] [Commented] (TEZ-3273) In one vexter has some task failed,DAG will stuck forever.

2016-05-25 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15301598#comment-15301598 ] Jeff Zhang commented on TEZ-3273: - Use the instruction here to enable log aggregation. http

[jira] [Commented] (TEZ-2846) Flaky test: TestCommit.testVertexCommit_OnDAGSuccess

2016-06-07 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15319813#comment-15319813 ] Jeff Zhang commented on TEZ-2846: - +1 > Flaky test: TestCommit.testVertexCommit_OnDAGSucces

[jira] [Commented] (TEZ-3343) sqoop import can't success

2016-07-12 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15374251#comment-15374251 ] Jeff Zhang commented on TEZ-3343: - Could you attach the yarn app log ? You can ask this kind

[jira] [Commented] (TEZ-3343) sqoop import can't success

2016-07-14 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15376598#comment-15376598 ] Jeff Zhang commented on TEZ-3343: - Are you using CDH ? MRVersion is only in CDH but not in a

[jira] [Commented] (TEZ-3343) sqoop import can't success

2016-07-14 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15376647#comment-15376647 ] Jeff Zhang commented on TEZ-3343: - Please try to build tez using CDH. "mvn clean package -D

[jira] [Commented] (TEZ-3379) Tez analyzer: Move sysout to log4j

2016-07-26 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15395036#comment-15395036 ] Jeff Zhang commented on TEZ-3379: - +1 > Tez analyzer: Move sysout to log4j > --

[jira] [Commented] (TEZ-3416) ArrayIndexOutOfBoundsException happens in ScatterGatherEdgeManager after DAG recovery

2016-08-22 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15431837#comment-15431837 ] Jeff Zhang commented on TEZ-3416: - [~aplusplus] The app log is not completed ? I only see th

[jira] [Commented] (TEZ-3416) ArrayIndexOutOfBoundsException happens in ScatterGatherEdgeManager after DAG recovery

2016-08-23 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15434231#comment-15434231 ] Jeff Zhang commented on TEZ-3416: - If Reducer3 use NoOpVertexManager, that means it has alre

[jira] [Commented] (TEZ-3416) ArrayIndexOutOfBoundsException happens in ScatterGatherEdgeManager after DAG recovery

2016-08-23 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15434236#comment-15434236 ] Jeff Zhang commented on TEZ-3416: - What is in the VertexConfigurationDoneEvent of Reducer 3

[jira] [Commented] (TEZ-3572) wrong MR spell in INSTALL.md

2017-01-09 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-3572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15814088#comment-15814088 ] Jeff Zhang commented on TEZ-3572: - [~ferhui] MRR is correct. That means mapper->reducer->red

[jira] [Assigned] (TEZ-1064) Restore dagName Set for duplicate detection in recovered AMs.

2014-07-28 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang reassigned TEZ-1064: --- Assignee: Jeff Zhang (was: Hitesh Shah) > Restore dagName Set for duplicate detection in recovered AMs

[jira] [Updated] (TEZ-1064) Restore dagName Set for duplicate detection in recovered AMs.

2014-07-28 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1064: Attachment: Tez-1064.patch Attach the patch > Restore dagName Set for duplicate detection in recovered AMs.

[jira] [Updated] (TEZ-1064) Restore dagName Set for duplicate detection in recovered AMs.

2014-07-28 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1064: Attachment: Tez-1064-2.patch > Restore dagName Set for duplicate detection in recovered AMs. > -

[jira] [Updated] (TEZ-1064) Restore dagName Set for duplicate detection in recovered AMs.

2014-07-28 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1064: Attachment: (was: Tez-1064-2.patch) > Restore dagName Set for duplicate detection in recovered AMs. > --

[jira] [Updated] (TEZ-1064) Restore dagName Set for duplicate detection in recovered AMs.

2014-07-28 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1064: Attachment: Tez-1064-2.patch > Restore dagName Set for duplicate detection in recovered AMs. > -

[jira] [Commented] (TEZ-1064) Restore dagName Set for duplicate detection in recovered AMs.

2014-07-28 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1064?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14077143#comment-14077143 ] Jeff Zhang commented on TEZ-1064: - [~hitesh] I update the patch with the following change: *

[jira] [Created] (TEZ-1325) Make dagSummaryDataMap ordered to help find the lastCompletedDAG

2014-07-28 Thread Jeff Zhang (JIRA)
Jeff Zhang created TEZ-1325: --- Summary: Make dagSummaryDataMap ordered to help find the lastCompletedDAG Key: TEZ-1325 URL: https://issues.apache.org/jira/browse/TEZ-1325 Project: Apache Tez Issue

[jira] [Updated] (TEZ-1325) Make dagSummaryDataMap (RecoverParser) ordered to help find the lastCompletedDAG

2014-07-28 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1325: Summary: Make dagSummaryDataMap (RecoverParser) ordered to help find the lastCompletedDAG (was: Make dagSumm

[jira] [Updated] (TEZ-1325) Make dagSummaryDataMap (RecoverParser) LinkedHashMap to help find the lastCompletedDAG

2014-07-28 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1325: Summary: Make dagSummaryDataMap (RecoverParser) LinkedHashMap to help find the lastCompletedDAG (was: Make d

[jira] [Updated] (TEZ-1325) Change dagSummaryDataMap in RecoverParser to LinkedHashMap to help find the lastCompletedDAG

2014-07-28 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1325: Summary: Change dagSummaryDataMap in RecoverParser to LinkedHashMap to help find the lastCompletedDAG (was:

[jira] [Commented] (TEZ-1325) Change dagSummaryDataMap in RecoverParser to LinkedHashMap to help find the lastCompletedDAG

2014-07-28 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14077194#comment-14077194 ] Jeff Zhang commented on TEZ-1325: - in the current code, dagSummaryDataMap in RecoverParser i

[jira] [Updated] (TEZ-1325) Change dagSummaryDataMap in RecoverParser to LinkedHashMap to help find the lastCompletedDAG

2014-07-28 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1325: Issue Type: Sub-task (was: Bug) Parent: TEZ-15 > Change dagSummaryDataMap in RecoverParser to Linked

[jira] [Created] (TEZ-1326) AMStartedEvent should not be recovery event

2014-07-28 Thread Jeff Zhang (JIRA)
Jeff Zhang created TEZ-1326: --- Summary: AMStartedEvent should not be recovery event Key: TEZ-1326 URL: https://issues.apache.org/jira/browse/TEZ-1326 Project: Apache Tez Issue Type: Bug Affects

[jira] [Updated] (TEZ-1326) AMStartedEvent should not be recovery event

2014-07-28 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1326: Issue Type: Sub-task (was: Bug) Parent: TEZ-15 > AMStartedEvent should not be recovery event > -

[jira] [Updated] (TEZ-1326) AMStartedEvent should not be recovery event

2014-07-28 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1326: Attachment: Tez-1326.patch > AMStartedEvent should not be recovery event > --

[jira] [Created] (TEZ-1327) Rename AMLaunchedEvent to AMInitializedEvent

2014-07-28 Thread Jeff Zhang (JIRA)
Jeff Zhang created TEZ-1327: --- Summary: Rename AMLaunchedEvent to AMInitializedEvent Key: TEZ-1327 URL: https://issues.apache.org/jira/browse/TEZ-1327 Project: Apache Tez Issue Type: Bug

[jira] [Updated] (TEZ-1327) Rename AMLaunchedEvent to AMInitializedEvent

2014-07-28 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1327: Issue Type: Sub-task (was: Bug) Parent: TEZ-15 > Rename AMLaunchedEvent to AMInitializedEvent >

[jira] [Resolved] (TEZ-1327) Rename AMLaunchedEvent to AMInitializedEvent

2014-07-28 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang resolved TEZ-1327. - Resolution: Won't Fix Close this issue. Because I find the launchTime in AMLaunchedEvent is the time when

[jira] [Assigned] (TEZ-1342) tez.am.client.am.port-range not taking effect

2014-07-30 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang reassigned TEZ-1342: --- Assignee: Jeff Zhang > tez.am.client.am.port-range not taking effect >

[jira] [Updated] (TEZ-1342) tez.am.client.am.port-range not taking effect

2014-07-30 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1342: Attachment: Tez-1342.patch Attach the patch. We should pass the port-range property name instead of the prop

[jira] [Assigned] (TEZ-1065) DAGStatus.getVertexStatus and other vertex related API's should maintain vertex order

2014-07-30 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang reassigned TEZ-1065: --- Assignee: Jeff Zhang (was: Rekha Joshi) > DAGStatus.getVertexStatus and other vertex related API's sho

[jira] [Commented] (TEZ-1065) DAGStatus.getVertexStatus and other vertex related API's should maintain vertex order

2014-07-30 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14079645#comment-14079645 ] Jeff Zhang commented on TEZ-1065: - Will work on this. > DAGStatus.getVertexStatus and other

<    10   11   12   13   14   15   16   17   18   19   >