[jira] [Updated] (TEZ-2460) Temporary solution for issue due to YARN-2560

2015-05-19 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2460: Attachment: TEZ-2460-2.patch Upload new to address the issue in comment. (Add timeout property in TezConfigu

[jira] [Updated] (TEZ-2460) Temporary solution for issue due to YARN-2560

2015-05-19 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2460: Attachment: TEZ-2460-3.patch Minor update on the patch (add comment for the new property) > Temporary solutio

[jira] [Commented] (TEZ-2460) Temporary solution for issue due to YARN-2560

2015-05-19 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551728#comment-14551728 ] Jeff Zhang commented on TEZ-2460: - Committed to master & branch-0.7 > Temporary solution fo

[jira] [Updated] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-19 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1273: Attachment: TEZ-1273-7.patch Retrigger the prebuild > Refactor DAGAppMaster to state machine based >

[jira] [Updated] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-19 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1273: Attachment: (was: TEZ-1273-7.patch) > Refactor DAGAppMaster to state machine based > -

[jira] [Commented] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-19 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551825#comment-14551825 ] Jeff Zhang commented on TEZ-1273: - [~hitesh] Please help review it. * Please check the atta

[jira] [Comment Edited] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-19 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551825#comment-14551825 ] Jeff Zhang edited comment on TEZ-1273 at 5/20/15 5:17 AM: -- [~hitesh

[jira] [Updated] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-19 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1273: Attachment: DAGAppMaster_4.pdf > Refactor DAGAppMaster to state machine based > --

[jira] [Comment Edited] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-19 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551825#comment-14551825 ] Jeff Zhang edited comment on TEZ-1273 at 5/20/15 5:17 AM: -- [~hitesh

[jira] [Updated] (TEZ-2456) Refactor recovery event logging to ensure it meet the recovery event spec

2015-05-19 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2456: Description: Currently we don't have spec for the recovery event logging. Recovery would be fragile to code c

[jira] [Updated] (TEZ-2456) Refactor recovery event logging to ensure it meet the recovery event spec

2015-05-19 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2456: Description: Currently we don't have spec for the recovery event logging. Recovery would be fragile to code c

[jira] [Commented] (TEZ-2456) Refactor recovery event logging to ensure it meet the recovery event spec

2015-05-19 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14551846#comment-14551846 ] Jeff Zhang commented on TEZ-2456: - bq. VertexFinishedEvent should be logged before DAGFinish

[jira] [Updated] (TEZ-2249) Wait for all task attempt finished before moving Task to finished state

2015-05-20 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2249?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2249: Attachment: TEZ-2249-1.patch > Wait for all task attempt finished before moving Task to finished state > -

[jira] [Updated] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-20 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1273: Attachment: (was: TEZ-1273-7.patch) > Refactor DAGAppMaster to state machine based > -

[jira] [Updated] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-20 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1273: Attachment: TEZ-1273-7.patch > Refactor DAGAppMaster to state machine based >

[jira] [Comment Edited] (TEZ-2249) Wait for all task attempt finished before moving Task to finished state

2015-05-20 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14552290#comment-14552290 ] Jeff Zhang edited comment on TEZ-2249 at 5/20/15 1:13 PM: -- [~bikass

[jira] [Comment Edited] (TEZ-2249) Wait for all task attempt finished before moving Task to finished state

2015-05-20 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14552290#comment-14552290 ] Jeff Zhang edited comment on TEZ-2249 at 5/20/15 1:13 PM: -- [~bikass

[jira] [Commented] (TEZ-2249) Wait for all task attempt finished before moving Task to finished state

2015-05-20 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14552290#comment-14552290 ] Jeff Zhang commented on TEZ-2249: - [~bikassaha] Please help review. 2 new scenaio are added

[jira] [Commented] (TEZ-2249) Wait for all task attempt finished before moving Task to finished state

2015-05-20 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553344#comment-14553344 ] Jeff Zhang commented on TEZ-2249: - [~hitesh], but if we don't kill the speculated attempt, t

[jira] [Commented] (TEZ-2249) Wait for all task attempt finished before moving Task to finished state

2015-05-20 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553426#comment-14553426 ] Jeff Zhang commented on TEZ-2249: - [~hitesh] As my understanding, when vertex commit happens

[jira] [Updated] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-20 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1273: Attachment: TEZ-1273-7.patch > Refactor DAGAppMaster to state machine based >

[jira] [Updated] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-20 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1273: Attachment: (was: TEZ-1273-7.patch) > Refactor DAGAppMaster to state machine based > -

[jira] [Updated] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-20 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1273: Attachment: TEZ-1273-7.patch > Refactor DAGAppMaster to state machine based >

[jira] [Updated] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-20 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1273: Attachment: (was: TEZ-1273-7.patch) > Refactor DAGAppMaster to state machine based > -

[jira] [Commented] (TEZ-391) SharedEdge - Support for passing same output from a vertex as input to two different vertices

2015-05-20 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553693#comment-14553693 ] Jeff Zhang commented on TEZ-391: The following shows the different edge types we may need to

[jira] [Updated] (TEZ-391) SharedEdge - Support for passing same output from a vertex as input to two different vertices

2015-05-20 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-391?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-391: --- Attachment: TEZ-391-WIP-7.patch > SharedEdge - Support for passing same output from a vertex as input to two > d

[jira] [Updated] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-20 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1273: Attachment: (was: TEZ-1273-7.patch) > Refactor DAGAppMaster to state machine based > -

[jira] [Updated] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-20 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1273: Attachment: TEZ-1273-7.patch > Refactor DAGAppMaster to state machine based >

[jira] [Updated] (TEZ-2474) The old taskNum is logged incorrect when parallelism is changed

2015-05-21 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2474: Description: {noformat} 2015-05-21 15:04:33,103 INFO [App Shared Pool - #0] impl.VertexImpl: Vertex vertex_14

[jira] [Created] (TEZ-2474) The old taskNum is logged incorrect when parallelism is changed

2015-05-21 Thread Jeff Zhang (JIRA)
Jeff Zhang created TEZ-2474: --- Summary: The old taskNum is logged incorrect when parallelism is changed Key: TEZ-2474 URL: https://issues.apache.org/jira/browse/TEZ-2474 Project: Apache Tez Issue

[jira] [Updated] (TEZ-2474) The old taskNum is logged incorrect when parallelism is changed

2015-05-21 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2474: Target Version/s: 0.7.1 > The old taskNum is logged incorrect when parallelism is changed > -

[jira] [Updated] (TEZ-2474) The old taskNum is logged incorrect when parallelism is changed

2015-05-21 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2474: Attachment: TEZ-2474-1.patch [~bikassaha] [~rajesh.balamohan] Please help review. > The old taskNum is logge

[jira] [Updated] (TEZ-2474) The old taskNum is logged incorrectly when parallelism is changed

2015-05-21 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2474: Summary: The old taskNum is logged incorrectly when parallelism is changed (was: The old taskNum is logged i

[jira] [Commented] (TEZ-2474) The old taskNum is logged incorrectly when parallelism is changed

2015-05-21 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553886#comment-14553886 ] Jeff Zhang commented on TEZ-2474: - Thanks [~rajesh.balamohan] Committed to master & branch-0

[jira] [Comment Edited] (TEZ-391) SharedEdge - Support for passing same output from a vertex as input to two different vertices

2015-05-21 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553693#comment-14553693 ] Jeff Zhang edited comment on TEZ-391 at 5/21/15 8:54 AM: - [~bikassaha

[jira] [Comment Edited] (TEZ-391) SharedEdge - Support for passing same output from a vertex as input to two different vertices

2015-05-21 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553693#comment-14553693 ] Jeff Zhang edited comment on TEZ-391 at 5/21/15 9:04 AM: - [~bikassaha

[jira] [Comment Edited] (TEZ-391) SharedEdge - Support for passing same output from a vertex as input to two different vertices

2015-05-21 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553693#comment-14553693 ] Jeff Zhang edited comment on TEZ-391 at 5/21/15 9:03 AM: - [~bikassaha

[jira] [Updated] (TEZ-391) SharedEdge - Support for passing same output from a vertex as input to two different vertices

2015-05-21 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-391?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-391: --- Attachment: (was: TEZ-391-WIP-7.patch) > SharedEdge - Support for passing same output from a vertex as input

[jira] [Updated] (TEZ-391) SharedEdge - Support for passing same output from a vertex as input to two different vertices

2015-05-21 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-391?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-391: --- Attachment: TEZ-391-WIP-7.patch > SharedEdge - Support for passing same output from a vertex as input to two > d

[jira] [Updated] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-21 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1273: Attachment: TEZ-1273-7.patch > Refactor DAGAppMaster to state machine based >

[jira] [Updated] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-21 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1273: Attachment: (was: TEZ-1273-7.patch) > Refactor DAGAppMaster to state machine based > -

[jira] [Commented] (TEZ-2476) Spurious System.err messages in LogicalIOProcessorRuntimeTask

2015-05-21 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14555290#comment-14555290 ] Jeff Zhang commented on TEZ-2476: - System.err message are also in DAGAppMaster for DAG compl

[jira] [Commented] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-21 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14555353#comment-14555353 ] Jeff Zhang commented on TEZ-1273: - bq. Should there be 2 events - RECOVER and RECOVER_FAILED

[jira] [Created] (TEZ-2477) Session stats should be recovered

2015-05-21 Thread Jeff Zhang (JIRA)
Jeff Zhang created TEZ-2477: --- Summary: Session stats should be recovered Key: TEZ-2477 URL: https://issues.apache.org/jira/browse/TEZ-2477 Project: Apache Tez Issue Type: Bug Reporter:

[jira] [Updated] (TEZ-2477) Session stats should be recovered

2015-05-21 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2477: Issue Type: Sub-task (was: Bug) Parent: TEZ-15 > Session stats should be recovered > ---

[jira] [Commented] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-21 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1483#comment-1483 ] Jeff Zhang commented on TEZ-1273: - || || NEW || INITED || RECOVERING || RUNNING || IDLE || T

[jira] [Updated] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-22 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1273: Attachment: TEZ-1273-8.patch > Refactor DAGAppMaster to state machine based >

[jira] [Comment Edited] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-22 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1483#comment-1483 ] Jeff Zhang edited comment on TEZ-1273 at 5/22/15 9:21 AM: -- || || NE

[jira] [Updated] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-22 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1273: Attachment: TEZ-1273-8.patch > Refactor DAGAppMaster to state machine based >

[jira] [Updated] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-22 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1273: Attachment: (was: TEZ-1273-8.patch) > Refactor DAGAppMaster to state machine based > -

[jira] [Commented] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-22 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14556052#comment-14556052 ] Jeff Zhang commented on TEZ-1273: - Upload new patch. * Add new state TERMINATING * use volat

[jira] [Comment Edited] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-22 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14556052#comment-14556052 ] Jeff Zhang edited comment on TEZ-1273 at 5/22/15 12:15 PM: --- Upload

[jira] [Issue Comment Deleted] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-22 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1273: Comment: was deleted (was: {color:red}-1 overall{color}. Here are the results of testing the latest attachme

[jira] [Commented] (TEZ-2370) Add stages information to RM UI for debugging / visibility on job progress

2015-05-24 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14557913#comment-14557913 ] Jeff Zhang commented on TEZ-2370: - [~harisekhon] start from 0.6.0, tez provides ui for job d

[jira] [Commented] (TEZ-2398) Flaky test: TestFaultTolerance

2015-05-24 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14557914#comment-14557914 ] Jeff Zhang commented on TEZ-2398: - [~rajesh.balamohan] Any logs on this ? > Flaky test: Tes

[jira] [Commented] (TEZ-2475) Tez local mode hanging in big testsuite

2015-05-24 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14557976#comment-14557976 ] Jeff Zhang commented on TEZ-2475: - [~fs111] It looks like TezChild hangs there for getting t

[jira] [Commented] (TEZ-2304) InvalidStateTransitonException TA_SCHEDULE at START_WAIT during recovery

2015-05-24 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14557994#comment-14557994 ] Jeff Zhang commented on TEZ-2304: - In this log, there's only recovery events for attempt_14

[jira] [Comment Edited] (TEZ-2304) InvalidStateTransitonException TA_SCHEDULE at START_WAIT during recovery

2015-05-24 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14557994#comment-14557994 ] Jeff Zhang edited comment on TEZ-2304 at 5/25/15 6:28 AM: -- In this

[jira] [Updated] (TEZ-2311) AM can hang if kill received while recovering from previous attempt

2015-05-24 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2311: Labels: Recovery (was: ) > AM can hang if kill received while recovering from previous attempt >

[jira] [Updated] (TEZ-2322) Succeeded count wrong for Pig on Tez job, decreased 380 => 181

2015-05-24 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2322: Labels: Recovery (was: ) > Succeeded count wrong for Pig on Tez job, decreased 380 => 181 > -

[jira] [Updated] (TEZ-2304) InvalidStateTransitonException TA_SCHEDULE at START_WAIT during recovery

2015-05-24 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2304: Labels: Recovery (was: ) > InvalidStateTransitonException TA_SCHEDULE at START_WAIT during recovery > ---

[jira] [Updated] (TEZ-2456) Refactor recovery event logging to ensure it meet the recovery event spec

2015-05-24 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2456: Labels: Recovery (was: ) > Refactor recovery event logging to ensure it meet the recovery event spec > --

[jira] [Updated] (TEZ-2483) Tez should close task if processor fail

2015-05-25 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2483: Attachment: TEZ-2483-2.patch Thanks [~daijy] I upload a new patch with minor change. close() can only be invo

[jira] [Comment Edited] (TEZ-2483) Tez should close task if processor fail

2015-05-25 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14558625#comment-14558625 ] Jeff Zhang edited comment on TEZ-2483 at 5/26/15 2:32 AM: -- Thanks [

[jira] [Commented] (TEZ-2307) DAGAppMaster may still be in RUNNING when DAG is finished

2015-05-25 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14558699#comment-14558699 ] Jeff Zhang commented on TEZ-2307: - [~mitdesai] It is not easy to reproduce it. It happens ra

[jira] [Commented] (TEZ-1954) Multiple instances of Inconsistent synchronization in org.apache.tez.dag.app.DAGAppMaster.

2015-05-26 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14560185#comment-14560185 ] Jeff Zhang commented on TEZ-1954: - I believe things will change after TEZ-1273. > Multiple

[jira] [Commented] (TEZ-2304) InvalidStateTransitonException TA_SCHEDULE at START_WAIT during recovery

2015-05-26 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14560187#comment-14560187 ] Jeff Zhang commented on TEZ-2304: - bq. Maybe createAttempt could be changed to use the last

[jira] [Commented] (TEZ-2475) Tez local mode hanging in big testsuite

2015-05-26 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14560220#comment-14560220 ] Jeff Zhang commented on TEZ-2475: - [~sseth] Is it related to TEZ-1802 ? I see the following

[jira] [Commented] (TEZ-2488) Tez AM crashes if a submitted DAG is configured to use invalid resource sizes.

2015-05-26 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14560243#comment-14560243 ] Jeff Zhang commented on TEZ-2488: - [~hitesh] Here DAG specify the memory request which is be

[jira] [Updated] (TEZ-2483) Tez should close task if processor fail

2015-05-26 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2483: Attachment: TEZ-2483-3.patch > Tez should close task if processor fail > -

[jira] [Updated] (TEZ-2488) Tez AM crashes if a submitted DAG is configured to use invalid resource sizes.

2015-05-26 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2488: Attachment: TEZ-2488-1.patch > Tez AM crashes if a submitted DAG is configured to use invalid resource > size

[jira] [Commented] (TEZ-2483) Tez should close task if processor fail

2015-05-27 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14560565#comment-14560565 ] Jeff Zhang commented on TEZ-2483: - Thanks for your review, [~rajesh.balamohan] [~sseth] , up

[jira] [Updated] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-27 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1273: Attachment: TEZ-1273-9.patch > Refactor DAGAppMaster to state machine based >

[jira] [Updated] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-27 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1273: Attachment: DAGAppMaster_5.pdf > Refactor DAGAppMaster to state machine based > --

[jira] [Updated] (TEZ-2488) Tez AM crashes if a submitted DAG is configured to use invalid resource sizes.

2015-05-27 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2488: Target Version/s: 0.7.1 > Tez AM crashes if a submitted DAG is configured to use invalid resource > sizes. >

[jira] [Updated] (TEZ-2488) Tez AM crashes if a submitted DAG is configured to use invalid resource sizes.

2015-05-27 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2488: Attachment: TEZ-2488-2.patch > Tez AM crashes if a submitted DAG is configured to use invalid resource > size

[jira] [Commented] (TEZ-2488) Tez AM crashes if a submitted DAG is configured to use invalid resource sizes.

2015-05-27 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14560884#comment-14560884 ] Jeff Zhang commented on TEZ-2488: - Upload a new patch to fix the test failure. [~hitesh] Ple

[jira] [Comment Edited] (TEZ-2488) Tez AM crashes if a submitted DAG is configured to use invalid resource sizes.

2015-05-27 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14560884#comment-14560884 ] Jeff Zhang edited comment on TEZ-2488 at 5/27/15 12:38 PM: --- Upload

[jira] [Updated] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-27 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1273: Attachment: (was: TEZ-1273-9.patch) > Refactor DAGAppMaster to state machine based > -

[jira] [Updated] (TEZ-2488) Tez AM crashes if a submitted DAG is configured to use invalid resource sizes.

2015-05-27 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2488: Target Version/s: 0.5.4, 0.6.2, 0.7.1 (was: 0.7.1) > Tez AM crashes if a submitted DAG is configured to use i

[jira] [Updated] (TEZ-1273) Refactor DAGAppMaster to state machine based

2015-05-27 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-1273: Attachment: TEZ-1273-9.patch > Refactor DAGAppMaster to state machine based >

[jira] [Commented] (TEZ-2483) Tez should close task if processor fail

2015-05-27 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562192#comment-14562192 ] Jeff Zhang commented on TEZ-2483: - Thanks [~rajesh.balamohan] Committed to master, branch-0.

[jira] [Created] (TEZ-2494) Check resource size when VertexManager change that

2015-05-27 Thread Jeff Zhang (JIRA)
Jeff Zhang created TEZ-2494: --- Summary: Check resource size when VertexManager change that Key: TEZ-2494 URL: https://issues.apache.org/jira/browse/TEZ-2494 Project: Apache Tez Issue Type: Improveme

[jira] [Commented] (TEZ-2488) Tez AM crashes if a submitted DAG is configured to use invalid resource sizes.

2015-05-27 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562281#comment-14562281 ] Jeff Zhang commented on TEZ-2488: - Thanks [~hitesh] Committed to master, 0.5, 0.6, 0.7 Crea

[jira] [Assigned] (TEZ-2304) InvalidStateTransitonException TA_SCHEDULE at START_WAIT during recovery

2015-05-28 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang reassigned TEZ-2304: --- Assignee: Jeff Zhang > InvalidStateTransitonException TA_SCHEDULE at START_WAIT during recovery > -

[jira] [Updated] (TEZ-2304) InvalidStateTransitonException TA_SCHEDULE at START_WAIT during recovery

2015-05-28 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2304: Attachment: TEZ-2304-1.patch > InvalidStateTransitonException TA_SCHEDULE at START_WAIT during recovery >

[jira] [Commented] (TEZ-2304) InvalidStateTransitonException TA_SCHEDULE at START_WAIT during recovery

2015-05-28 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14562815#comment-14562815 ] Jeff Zhang commented on TEZ-2304: - Upload patch for it, [~hitesh] Please help review it. *

[jira] [Updated] (TEZ-2304) InvalidStateTransitonException TA_SCHEDULE at START_WAIT during recovery

2015-05-28 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2304: Attachment: TEZ-2304-2.patch > InvalidStateTransitonException TA_SCHEDULE at START_WAIT during recovery >

[jira] [Commented] (TEZ-2391) TestVertexImpl timing out at times on jenkins builds

2015-05-28 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14564210#comment-14564210 ] Jeff Zhang commented on TEZ-2391: - Looking at this issue, can reproduce it on ubuntu. It is

[jira] [Updated] (TEZ-2391) TestVertexImpl timing out at times on jenkins builds

2015-05-29 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2391?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2391: Attachment: TEZ-2391-1.patch > TestVertexImpl timing out at times on jenkins builds > ---

[jira] [Commented] (TEZ-2391) TestVertexImpl timing out at times on jenkins builds

2015-05-29 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14564591#comment-14564591 ] Jeff Zhang commented on TEZ-2391: - e.g. TestVertexImpl#testInputInitializerEvents fails ver

[jira] [Comment Edited] (TEZ-2391) TestVertexImpl timing out at times on jenkins builds

2015-05-29 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14564591#comment-14564591 ] Jeff Zhang edited comment on TEZ-2391 at 5/29/15 11:15 AM: --- e.g.

[jira] [Comment Edited] (TEZ-2391) TestVertexImpl timing out at times on jenkins builds

2015-05-29 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14564591#comment-14564591 ] Jeff Zhang edited comment on TEZ-2391 at 5/29/15 12:22 PM: --- e.g.

[jira] [Comment Edited] (TEZ-2391) TestVertexImpl timing out at times on jenkins builds

2015-05-29 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14564591#comment-14564591 ] Jeff Zhang edited comment on TEZ-2391 at 5/29/15 12:24 PM: --- e.g.

[jira] [Updated] (TEZ-2391) TestVertexImpl timing out at times on jenkins builds

2015-05-31 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2391?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2391: Attachment: TEZ-2391-2.patch > TestVertexImpl timing out at times on jenkins builds > ---

[jira] [Created] (TEZ-2507) mapreduce.{map|reduce}.java.opts should override tez.task.launch.cmd-opts

2015-05-31 Thread Jeff Zhang (JIRA)
Jeff Zhang created TEZ-2507: --- Summary: mapreduce.{map|reduce}.java.opts should override tez.task.launch.cmd-opts Key: TEZ-2507 URL: https://issues.apache.org/jira/browse/TEZ-2507 Project: Apache Tez

[jira] [Updated] (TEZ-2507) mapreduce.{map|reduce}.java.opts should override tez.task.launch.cmd-opts

2015-05-31 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2507: Description: Otherwise it may JVM options conflicts. Here's the issue reported by [~r7raul] {noformat} I chan

[jira] [Updated] (TEZ-2507) mapreduce.{map|reduce}.java.opts should override tez.task.launch.cmd-opts

2015-05-31 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated TEZ-2507: Description: Otherwise it may JVM options conflicts. Here's the issue reported by [~r7raul] {noformat} I chan

[jira] [Commented] (TEZ-2507) mapreduce.{map|reduce}.java.opts should override tez.task.launch.cmd-opts

2015-05-31 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14566943#comment-14566943 ] Jeff Zhang commented on TEZ-2507: - 2 things need to be done for this issue: * Make tez.task.

[jira] [Commented] (TEZ-2391) TestVertexImpl timing out at times on jenkins builds

2015-05-31 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/TEZ-2391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14566966#comment-14566966 ] Jeff Zhang commented on TEZ-2391: - Upload one new patch to solve 2 new issues in TestVertexI

<    2   3   4   5   6   7   8   9   10   11   >