Sorry, I didn't notice the "xxxxxxxx" masks.

where "xxxxxxxx" in xxxxxxxx/crawl_generate/part-00002
is
/user/justin/crawl/MERGEDsegments

Thanks,
Justin

Justin Yao wrote:
Hi,

I encountered an error when merging segment using nightly build #736.
I have 3 nodes and all servers have CentOS 5.2 installed.
Every time when I tried to merge segment using command
"nutch mergesegs crawl/MERGEDsegments -dir crawl/segments",
it will failed with error message
"Task attempt_200903031109_0007_r_000002_0 failed to report status for
603 seconds. Killing!"
and later
"org.apache.hadoop.ipc.RemoteException:
org.apache.hadoop.hdfs.protocol.AlreadyBeingCreatedException: failed to
create file xxxxxxxx/crawl_generate/part-00002 for
DFSClient_attempt_200903031109_0007_r_000002_2 on client 10.6.180.2
because current leaseholder is trying to recreate file"

If I raised "mapred.task.timeout" to "3600000" (1 hour), it will still fail.

I've tried build #743, #736 and #723. None will succeed.
Does someone encounter this problem before? Any suggestion would be appreciated.

Thanks,

Reply via email to