Hi, I am new to Hadoop, and I apologies if this was answered before, or if this is not the right list for my question.
I am trying to do the following: 1- Read monitoring information from slave nodes in hadoop 2- Process the data to detect nodes failure (node crash, problems in requests ... etc) and decide if I need to restart the whole machine. 3- Restart the machine running the slave facing problems My question is for step 1- collecting monitoring information. I have checked Hadoop monitoring features. But currently you can forward the motioning data to files, or to Ganglia. Is there a way to query Hadoop nodes for motioning information (dfs, rcp, jvm, mapred), or push the monitoring information to my own code. Thank you for your help -sam