Re: dfs datanode heartbeats and getBlockwork requests

Eric Baldeschwieler Mon, 03 Apr 2006 23:52:50 -0700

If we moved to a scheme where the name node was just given a smallnumber of blocks with each heartbeat, there would be no reason to notstart reporting blocks immediately, would there? Or the name node torespond to the heartbeat with the block range it wanted nextheartbeat...


On Apr 3, 2006, at 2:42 PM, Doug Cutting wrote:

Hairong Kuang wrote:
Currently dfs datanodes send heartbeats and getBlockwork requeststo thenamenode at the same frequency (once every 3 seconds) aftercertain startuptime. Is there any design reason that we need two seperatemessages insteadof one? I am thinking that if we let a sendHeartbeat requestreturn theblocks to be deleted or replicated, we are able to cut the networktraffic
in dfs.
No, that sounds like a reasonable change to me.
The startup delay will be need to be somehow re-implemented.Perhaps we could simply change this to a timer in the namenode onstartup, so that it waits a while on startup before giving anyblockwork. We might then have issues if, e.g, the namenode'sethernet cable were yanked for a few minutes. When it is re-connected, the namenode will start issuing lots of uneededreplication requests. Having a delay in blockwork at the datanodeeach time it establishes a new connection to the namenode solvesthat problem. Are there other cases that the current startupblockwork delay is handling?
Doug

Re: dfs datanode heartbeats and getBlockwork requests

Reply via email to