Hello all,
We were having some trouble with the tasktracker on one of our machines
while doing a fetch and had to restart the tasktracker. Is it possible
to restart the task that was being done? We were doing a rather large
fetch, and it was reducing when it errored (exception is below). Is it
possible to use the data that was already mapped and just restart the
reduce job? or are we going to have to re-do the entire fetch?
Exception:
060120 125418 task_r_24x406 copy failed: task_m_2hi2zg from 127.0.0.2:61640
java.io.IOException: timed out waiting for response
at org.apache.nutch.ipc.Client.call(Client.java:296)
at org.apache.nutch.ipc.RPC$Invoker.invoke(RPC.java:127)
at $Proxy2.getFile(Unknown Source)
at
org.apache.nutch.mapred.ReduceTaskRunner.prepare(ReduceTaskRunner.java:94)
at org.apache.nutch.mapred.TaskRunner.run(TaskRunner.java:62)
Thanks for any info.
-Matt Zytaruk