Jeff Zhang created TEZ-2041: ------------------------------- Summary: TEZ-UI May need to provide extra info about recovery on tez-ui Key: TEZ-2041 URL: https://issues.apache.org/jira/browse/TEZ-2041 Project: Apache Tez Issue Type: Bug Components: UI Reporter: Jeff Zhang Priority: Minor
There's one special case that one task may be associated with 2 succeeded task_attempt when recovery happens. If the first succeeded task attempt can not be recovered. Another task attempt will be scheduled. So in this case, there will be 2 succeeded task attempts for the same task. It may confuse users on tez-ui, maybe could provide extra info about recovery on tez-ui. This also need extra info in the history event. {code} 2015-02-05 17:11:14,999 INFO [Dispatcher thread: Central] impl.TaskImpl: Trying to recover task, taskId=task_1423126607079_0004_1_04_000000, recoveredState=SUCCEEDED 2015-02-05 17:11:14,999 INFO [Dispatcher thread: Central] impl.TaskImpl: Recovering data for task from previous DAG attempt, taskId=task_1423126607079_0004_1_04_000000, output=scope-43 2015-02-05 17:11:14,999 INFO [Dispatcher thread: Central] impl.TaskImpl: Task recovery not supported by committer, failing task attempt, taskId=task_1423126607079_0004_1_04_000000, attemptId=attempt_1423126607079_0004_1_04_000000_0, output=scope-43 2015-02-05 17:11:15,000 INFO [Dispatcher thread: Central] impl.TaskImpl: task_1423126607079_0004_1_04_000000 Task Transitioned from NEW to RUNNING due to event T_RECOVER {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)