Re: FAILED_UNCLEAN?

Nathan Marz Wed, 25 Feb 2009 01:22:14 -0800

This is on Hadoop 0.19.1. The first time I saw it happen, the job washung. That is, 5 map tasks were "running", but looking at each taskthere was the FAILED_UNCLEAN task attempt and no other task attempts.I reran it again, the job failed immediately, and some of the taskshad FAILED_UNCLEAN.

There is one job that runs in parallel with this job, but it's of thesame priority. The other job had failed when the job I'm describinggot hung.



On Feb 24, 2009, at 10:46 PM, Amareshwari Sriramadasu wrote:

Nathan Marz wrote:
I have a large job operating on over 2 TB of data, with about 50000input splits. For some reason (as yet unknown), tasks startedfailing on two of the machines (which got blacklisted). 13 mappersfailed in total. Of those 13, 8 of the tasks were able to executeon another machine without any issues. 5 of the tasks *did not* getre-executed on another machine, and their status is marked as"FAILED_UNCLEAN". Anyone have any idea what's going on? Why isn'tHadoop running these tasks on other machines?
Has the job failed/killed or Succeded when you see this situation ?Once the job completes, the unclean attempts will not get scheduled.If not, are there other jobs of higher priority running at the sametime preventing the cleanups to be launched?
What version of Hadoop are you using? latest trunk?

Thanks
Amareshwari
Thanks,
Nathan Marz

Re: FAILED_UNCLEAN?

Reply via email to