Re: tasktracker heartbeat interval

Michael Nelson Thu, 13 Apr 2006 15:12:47 -0700

I've been playing with a 6 node cluster to test the feasibility of usinghadoop to deal with a large data set we have been struggling tocomprehend (7TB). I have understood all along that I would need a muchlarger cluster if we decided to go ahead with hadoop but I must say thatmy initial reaction was that there was a lot of communication overhead.I suspect the change you have described would have lead to a muchdifferent first impression.


Doug Cutting wrote:

Currently, pseudo-distributed mode is *much* slower than "local" mode.It makes sense that running a trivial task on 100 nodes might takelonger than running it standalone, but running it on one node overlocalhost should not be that much slower. In part this is due to taskjvm startup time, but I think the larger part of the blame is heartbeatintervals.
The tasktracker polls for new tasks only every heartbeat interval. Whenrunning small jobs in small clusters, this interval dominatesperformance. But in larger clusters a short heartbeat interval wouldoverload the jobtracker. Perhaps the tasktracker should instead get itsheartbeat interval from the jobtracker. The jobtracker could return asmall interval when few tasktrackers are known, and a larger intervalwhen lots of tasktrackers are known. This would make small clustersmore responsive.
One could use a similar mechanism in dfs.

This is a very low priority issue that I just wanted to get out of my head.

Doug

Re: tasktracker heartbeat interval

Reply via email to