[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16135171#comment-16135171
 ] 

Peter Bacsko edited comment on MAPREDUCE-6892 at 8/21/17 1:25 PM:
------------------------------------------------------------------

[~haibochen] I uploaded patch v4. I fixed everything you mentioned, but there 
are some things which I didn't change:

1) Checkstyle, Javadoc errors: we would end up with a messy, inconsistent code 
style if we decide to eliminate these. I kept the style of the existing code, 
which unfortunately introduced new warnings. Refactoring/re-indenting the 
existing code would be way too much change and it's unnecessary IMHO.

2) Job20LineHistoryEventEmitter - I kept the "-1" values. I was looking for the 
code which generates the stuff that this class parses. I couldn't find it and I 
believe it does not exist anymore. However I found this on an old branch: 
https://github.com/apache/hadoop/blob/branch-1.2/src/mapred/org/apache/hadoop/mapred/JobHistory.java

I think this was the class which generated the events that Apache Rumen is 
meant to parse. This makes me wonder whether it makes sense to have Rumen on 
the trunk. Looking at the latest changes, the vast majority of the source files 
haven't been touched for 5-6 years (and newer changes are completely unrelated 
to its functionality).


was (Author: pbacsko):
[~haibochen] I uploaded patch v4. I fixed everything you mentioned, but there 
are some things which I didn't change:

1) Checkstyle, Javadoc errors: we would end up with a messy, inconsistent code 
style if we decide to eliminate these. I kept the style of the existing code, 
which unfortunately introduced new warnings. Refactoring/re-indenting the 
existing code would be way too much change and it's unnecessary IMHO.

2) Job20LineHistoryEventEmitter - I kept the "-1" values. I was looking for the 
code which generates the stuff that this class parses. I couldn't find it and I 
believe it does not exist anymore. However I found this on an old branch: 
https://github.com/apache/hadoop/blob/branch-1.2/src/mapred/org/apache/hadoop/mapred/JobHistory.java

I think this was the class which generated the events that Apache Rumen is 
meant parse. This makes me wonder whether it makes sense to have Rumen on the 
trunk. Looking at the latest changes, the vast majority of the source files 
haven't been touched for 5-6 years (and newer changes are completely unrelated 
to its functionality).

> Issues with the count of failed/killed tasks in the jhist file
> --------------------------------------------------------------
>
>                 Key: MAPREDUCE-6892
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6892
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: client, jobhistoryserver
>            Reporter: Peter Bacsko
>            Assignee: Peter Bacsko
>         Attachments: MAPREDUCE-6892-001.patch, MAPREDUCE-6892-002.PATCH, 
> MAPREDUCE-6892-003.patch, MAPREDUCE-6892-004.patch
>
>
> Recently we encountered some issues with the value of failed tasks. After 
> parsing the jhist file, {{JobInfo.getFailedMaps()}} returned 0, but actually 
> there were failures. 
> Another minor thing is that you cannot get the number of killed tasks 
> (although this can be calculated).
> The root cause is that {{JobUnsuccessfulCompletionEvent}} contains only the 
> successful map/reduce task counts. Number of failed (or killed) tasks are not 
> stored.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org

Reply via email to