Re: Task process exit with nonzero status of 1

2009-09-24 Thread Edward Capriolo
On Wed, Sep 23, 2009 at 2:06 PM, Marc Limotte wrote: > I'm seeing this error when I try to run my job. > > java.io.IOException: Task process exit with nonzero status of 1. >    at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:418) > > From what I can find by doing some Google searches, t

RE: Task process exit with nonzero status of 1

2009-09-24 Thread Marc Limotte
@hadoop.apache.org Subject: Re: Task process exit with nonzero status of 1 On Wed, Sep 23, 2009 at 2:06 PM, Marc Limotte wrote: > I'm seeing this error when I try to run my job. > > java.io.IOException: Task process exit with nonzero status of 1. >at org.apache.hadoop.mapred.TaskRunner.ru

Re: Task process exit with nonzero status of 1

2009-09-24 Thread Todd Lipcon
er 24, 2009 7:50 AM > To: common-user@hadoop.apache.org > Subject: Re: Task process exit with nonzero status of 1 > > On Wed, Sep 23, 2009 at 2:06 PM, Marc Limotte wrote: > > I'm seeing this error when I try to run my job. > > > > java.io.IOExcep

RE: Task process exit with nonzero status of 1

2009-09-24 Thread Marc Limotte
Sent: Thursday, September 24, 2009 10:19 AM To: common-user@hadoop.apache.org Subject: Re: Task process exit with nonzero status of 1 Hi Marc, Exit status 1 usually means some kind of controlled exit by the mapreduce child task. Things like JVM crashes usually are indicated by other exit codes (134 s

Re: Task process exit with nonzero status of 1

2009-09-24 Thread Todd Lipcon
09-24 18:16:43,091 INFO org.apache.hadoop.mapred.TaskRunner: > attempt_200909221656_0006_r_00_1 done; removing files. > 2009-09-24 18:16:43,092 INFO org.apache.hadoop.mapred.TaskTracker: > addFreeSlot : current free slots : 2 > 2009-09-24 18:17:07,057 INFO org.apache.hadoop.mapred.Tas

Re: Task process exit with nonzero status of 1

2009-09-24 Thread Koji Noguchi
tasks it ran: 0 > 2009-09-24 18:16:43,091 INFO org.apache.hadoop.mapred.TaskRunner: > attempt_200909221656_0006_r_00_1 done; removing files. > 2009-09-24 18:16:43,092 INFO org.apache.hadoop.mapred.TaskTracker: addFreeSlot > : current free slots : 2 > 2009-09-24 18:17:07,057 INFO org.a

RE: Task process exit with nonzero status of 1

2009-09-24 Thread Marc Limotte
c.com] Sent: Thursday, September 24, 2009 11:37 AM To: common-user@hadoop.apache.org Subject: Re: Task process exit with nonzero status of 1 > > A little more background. This job was working fine for weeks, running > > hourly, and then failed on Saturday morning and hasn't worked s

RE: Task process exit with nonzero status of 1

2009-09-24 Thread Marc Limotte
> 2009-09-24 18:16:43,091 INFO org.apache.hadoop.mapred.TaskRunner: > attempt_200909221656_0006_r_00_1 done; removing files. > 2009-09-24 18:16:43,092 INFO org.apache.hadoop.mapred.TaskTracker: > addFreeSlot : current free slots : 2 > 2009-09-24 18:17:07,057 INFO org.apache.hadoop.

RE: Task process exit with nonzero status of 1

2009-09-24 Thread Marc Limotte
arc Limotte [mailto:mlimo...@feeva.com] Sent: Thursday, September 24, 2009 2:19 PM To: common-user@hadoop.apache.org Cc: Deept Kumar Subject: RE: Task process exit with nonzero status of 1 Added DEBUG, but don't see anything interesting. The only new tasktracker log entries are about receiving a hea

Re: Task process exit with nonzero status of 1

2009-09-24 Thread Vinod KV
thing that is failing. Marc -Original Message- From: Marc Limotte [mailto:mlimo...@feeva.com] Sent: Thursday, September 24, 2009 2:19 PM To: common-user@hadoop.apache.org Cc: Deept Kumar Subject: RE: Task process exit with nonzero status of 1 Added DEBUG, but don't see anything in

RE: Task process exit with nonzero status of 1

2009-10-09 Thread Feng, Ao
request, if there is no elegant solution at this time? Thanks, Ao -Original Message- From: Marc Limotte [mailto:mlimo...@feeva.com] Sent: Thursday, September 24, 2009 11:24 AM To: common-user@hadoop.apache.org Subject: RE: Task process exit with nonzero status of 1 Hi Todd. No userlogs

Re: Task process exit with nonzero status of 1

2009-10-09 Thread Frank Singleton
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Feng, Ao wrote: > I probably know what the problem it, as we are encountering the same issue on > our prod cluster. Every once a while jobs start failing on the same task > trackers, and the only error message is this exit status 1. > > Go to the us

RE: Task process exit with nonzero status of 1

2009-10-27 Thread Marc Limotte
Just an FYI, found the solution to this problem. Apparently, it's an OS limit on the number of sub-directories that can be created in another directory. In this case, we had 31998 sub-directories under hadoop/userlogs/, so any new tasks would fail in Job Setup. >From the unix command line, mkd

Re: Task process exit with nonzero status of 1 - deleting userlogshelps

2010-06-16 Thread Manhee Jo
t, I could solve the problem. Hope this help. Regards, Manhee - Original Message - From: "Edward Capriolo" To: Sent: Tuesday, June 15, 2010 2:47 AM Subject: Re: Task process exit with nonzero status of 1 - deleting userlogshelps On Mon, Jun 14, 2010 at 1:15 PM, Johannes Zil

Re: Task process exit with nonzero status of 1 - deleting userlogshelps

2010-06-17 Thread Johannes Zillmann
er ulimit, I could solve the problem. > Hope this help. > > > Regards, > Manhee > > - Original Message - From: "Edward Capriolo" > To: > Sent: Tuesday, June 15, 2010 2:47 AM > Subject: Re: Task process exit with nonzero status of 1 - deleting

Re: Task process exit with nonzero status of 1 - deleting userlogs helps

2010-06-14 Thread Edward Capriolo
On Mon, Jun 14, 2010 at 1:15 PM, Johannes Zillmann wrote: > Hi, > > i have running a 4-node cluster with hadoop-0.20.2. Now i suddenly run into > a situation where every task scheduled on 2 of the 4 nodes failed. > Seems like the child jvm crashes. There are no child logs under > logs/userlogs. T

Re: Task process exit with nonzero status of 1 - deleting userlogs helps

2010-06-16 Thread Johannes Zillmann
Hi Edward, i copied the userlogs folder which caused the error. Two things which is speak against the too-many files theory. a) i can add new files to this folder (touch userlogsOLD/a, etc... ) b) the sysctl fs.file-max shows 817874 whereas the file count on the first level of userlogsOLD is 31

Re: Task process exit with nonzero status of 1 - deleting userlogs helps

2010-06-16 Thread Amareshwari Sri Ramadasu
The issue is fixed in branch 0.21 through http://issues.apache.org/jira/browse/MAPREDUCE-927. Now, the attempt directories are moved inside job directory. So, userlogs directory will have only job directories. Thanks Amareshwari On 6/16/10 12:47 PM, "Johannes Zillmann" wrote: Hi Edward, i cop