Hi,
I found a strange behavior in my cluster. The data nodes stop sending any
information randomly (no logs coming). So the namenode thinks its down. But
after some time ( approx 30 mints) the datanode nodes comes up and start
behaving properly. I tried finding any error log, but the datanode
Hi Rahul,
one possibility could be system time updations:
Can you check , System time changed in your system?
Since the heartbeats will depends on System times, that will effect sending the
heartbeats to NN.
Whihc version of hadoop are you using?
approximately how many blocks will be there in
When nodes are not reporting heartbeats, can you ssh into them?
Can they see the JT machine?
What does netstat -a show?
Cheers,
Joep
From: Rahul Das [rahul.h...@gmail.com]
Sent: Tuesday, August 02, 2011 11:21 PM
To: hdfs-user@hadoop.apache.org
Subject: Dananode