Re: Question regarding datanode been wiped by hadoop

Marcos Ortiz Tue, 12 Apr 2011 09:12:19 -0700

El 4/12/2011 10:46 AM, felix gao escribió:

What reason/condition would cause a datanode’s blocks to be removed?Our cluster had a one of its datanodes crash because of bad RAM.After the system was upgraded and the datanode/tasktracker broughtonline the next day we noticed the amount of space utilized wasminimal and the cluster was rebalancing blocks to the datanode. Itwould seem the prior blocks were removed. Was this because thedatanode was declared dead? What is the criteria for a namenode todecide (Assuming its the namenode) when a datanode should remove priorblocks?

1- Did you check the DataNode´s logs?

2- Did you protect the NameNode´s dfs.name.dir and the dfs.edits.dir ´sdirectories?On these directories, the NameNode stores the file system image and thesecond is where the edit log or journal is written. A good practice forthese directories is to have them on RAID 1 or RAID 10 to guarantize theconsistency of your cluster.

Any data loss in these directories (dfs.name.dir and dfs.edits.dir)will result in a loss of data in your HDFS. So, the second good practiceis to have a secondary NameNode to setup in any case that the primaryNameNode fails.

Another thing to keep in mind, is that when the NameNode fails, you haveto restar the JobTracker and the TaskTrackers after that the NameNodewill be restarted.

Regards

--
Marcos Luís Ortíz Valmaseda
 Software Engineer (Large-Scaled Distributed Systems)
 University of Information Sciences,
 La Habana, Cuba
 Linux User # 418229

Re: Question regarding datanode been wiped by hadoop

Reply via email to