[ https://issues.apache.org/jira/browse/MAPREDUCE-3678?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13471221#comment-13471221 ]
Harsh J commented on MAPREDUCE-3678: ------------------------------------ Hi, If no one has any objections to these INFO log additions, I'll commit it in in a couple of days. This helps projects such as Pig, Hive, etc. without any changes on their end. > The Map tasks logs should have the value of input split it processed > -------------------------------------------------------------------- > > Key: MAPREDUCE-3678 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-3678 > Project: Hadoop Map/Reduce > Issue Type: New Feature > Components: nodemanager, tasktracker > Affects Versions: 1.0.0, 2.0.0-alpha > Reporter: Bejoy KS > Assignee: Harsh J > Attachments: MAPREDUCE-3678-branch-1.patch, MAPREDUCE-3678.patch > > > It would be easier to debug some corner in tasks if we knew what was the > input split processed by that task. Map reduce task tracker log should > accommodate the same. Also in the jobdetails web UI, the split also should be > displayed along with the Split Locations. > Sample as > Input Split > hdfs://myserver:9000/userdata/sampleapp/inputdir/file1.csv - <split > no>/<offset from beginning of file> > This would be much beneficial to nail down some data quality issues in large > data volume processing. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira