+1, jstack is crucial to solve these kinds of issues. Also, which scheduler
are you using?

Thanks
-Todd

On Thu, Jun 17, 2010 at 2:38 PM, Ted Yu <yuzhih...@gmail.com> wrote:

> Is upgrading to hadoop-0.20.2+228 possible ?
>
> Use jstack to get stack trace of job tracker process when this happens
> again.
> Use jmap to get shared object memory maps or heap memory details.
>
> On Thu, Jun 17, 2010 at 2:00 PM, Li, Tan <t...@shopping.com> wrote:
>
> > Folks,
> >
> > I need some help on job tracker.
> > I am running a two hadoop clusters (with 30+ nodes) on Ubuntu. One is
> with
> > version 0.19.1 (apache) and the other one is with version 0.20. 1+169.68
> > (Cloudera).
> >
> > I have the same problem with both the clusters: the job tracker hangs
> > almost once a day.
> > Symptom: The job tracker web page can not be loaded, the command "hadoop
> > job -list" hangs and jobtracker.log file stops being updated.
> > No useful information can I find in the job tracker log file.
> > The symptom is gone after I restart the job tracker and the cluster runs
> > fine for another 20+ hour period. And then the symptom comes back.
> >
> > I do not have serious problem with HDFS.
> >
> > Any ideas about the causes? Any configuration parameter that I can change
> > to reduce the chances of the problem?
> > Any tips for diagnosing and troubleshooting?
> >
> > Thanks!
> >
> > Tan
> >
> >
> >
> >
>



-- 
Todd Lipcon
Software Engineer, Cloudera

Reply via email to