[
https://issues.apache.org/jira/browse/HADOOP-1374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12562252#action_12562252
]
Arun C Murthy commented on HADOOP-1374:
---------------------------------------
Forgot to add: we have features which enable a reduce to give feedback to the
JT about maps from which it is failing to fetch outputs; which leads to the map
being killed and being restarted (HADOOP-1158). In 0.15.0 it takes a bit of
time for the errant map to get killed and re-run somwhere else, so you might
just have to wait for sometime (at times upto 30mins) before this occurs...
0.16.0 has improves upon that further (HADOOP-1984).
> TaskTracker falls into an infinite loop.
> ----------------------------------------
>
> Key: HADOOP-1374
> URL: https://issues.apache.org/jira/browse/HADOOP-1374
> Project: Hadoop Core
> Issue Type: Bug
> Components: mapred
> Affects Versions: 0.12.3
> Reporter: Konstantin Shvachko
> Assignee: Arun C Murthy
> Attachments: DataNode1.log, DataNode2.log, JobTracker.log,
> NameNode.log, TaskTracker1.log, TaskTracker2.log, TestDFSIO.log
>
>
> All maps had been completed successfully. I had only one reduce task during
> which
> TaskTracker infinitely outputs:
> 07/05/15 19:35:41 INFO mapred.TaskTracker: task_0001_r_000000_0 0.16666667%
> reduce > copy (4 of 8 at 0.00 MB/s) >
> 07/05/15 19:35:42 INFO mapred.TaskTracker: task_0001_r_000000_0 0.16666667%
> reduce > copy (4 of 8 at 0.00 MB/s) >
> 07/05/15 19:35:43 INFO mapred.TaskTracker: task_0001_r_000000_0 0.16666667%
> reduce > copy (4 of 8 at 0.00 MB/s) >
> 07/05/15 19:35:44 INFO mapred.TaskTracker: task_0001_r_000000_0 0.16666667%
> reduce > copy (4 of 8 at 0.00 MB/s) >
> 07/05/15 19:35:45 INFO mapred.TaskTracker: task_0001_r_000000_0 0.16666667%
> reduce > copy (4 of 8 at 0.00 MB/s) >
> JobTracker does not log anything about task task_0001_r_000000_0 except for
> 07/05/15 19:49:01 INFO mapred.JobTracker: Adding task 'task_0001_r_000000_0'
> to tip tip_0001_r_000000, for tracker 'tracker_my-host.com:50050'
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.