Hey Bill,

Have you tried the Ganglia or JMX stats from your namenode?

I.e., look here:

http://rcf.unl.edu/ganglia/?m=load_one&r=hour&s=descending&c=red-workers&h=hadoop-name&sh=1&hc=4&z=small

The dfs.FSNamesystem.UnderReplicatedBlocks metric should keep track of what you're looking for. You can query Ganglia or turn on JMX and use one of the JMX/Nagios connectors.

Brian

On Feb 10, 2009, at 5:05 PM, Bill Au wrote:

I am in the process of setting up remote monitoring of my Hadoop cluster. I
seems to me that the replication status can only be obtained from the
command line by the fsck command.  Has anyone though about adding
replication status to the NameNode web UI in dfshealth.jsp? Or is that something that I really shouldn't worry about since Hadoop will fix things
all by itself?

Bill

Reply via email to