[ 
https://issues.apache.org/jira/browse/TEZ-1141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14181974#comment-14181974
 ] 

Bikas Saha commented on TEZ-1141:
---------------------------------

Should the times match the assert with failedAttempts+1
{code}+    Assert.assertEquals(failedAttempts + 1, mockTask.failedAttempts);
+    verify(mockTask.getVertex(), 
times(1)).incrementFailedTaskAttemptCount();{code}

Maybe we can enhance one or two of the simple TestFaultTolerance tests to check 
that the count is being correctly transmitted e2e.

+1.

> DAGStatus.Progress should include number of failed attempts
> -----------------------------------------------------------
>
>                 Key: TEZ-1141
>                 URL: https://issues.apache.org/jira/browse/TEZ-1141
>             Project: Apache Tez
>          Issue Type: Improvement
>    Affects Versions: 0.5.0
>            Reporter: Bikas Saha
>            Assignee: Hitesh Shah
>         Attachments: TEZ-1141.1.patch, TEZ-1141.2.patch
>
>
> Currently its impossible to know whether a job is seeing a lot of issues and 
> failures because we only report running tasks. Eventually the job fails but 
> before that we have no indication that a bunch of task failures have been 
> happening.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to