One thing to consider.. If the node was down for a day all of its blocks 
could’ve been replicated to other datanodes.
When machine is brought back , these blocks become overreplicated and NameNode 
decides to delete them.
You should check the logs of both DataNode and Namenode to see if it could be 
the case.

Boris.


On 4/12/11 7:46 AM, "felix gao" <gre1...@gmail.com> wrote:

What reason/condition would cause a datanode’s blocks to be removed?   Our 
cluster had a one of its datanodes crash because of bad RAM.   After the system 
was upgraded and the datanode/tasktracker brought online the next day we 
noticed the amount of space utilized was minimal and the cluster was 
rebalancing blocks to the datanode.   It would seem the prior blocks were 
removed.   Was this because the datanode was declared dead?   What is the 
criteria for a namenode to decide (Assuming its the namenode) when a datanode 
should remove prior blocks?

Reply via email to