> On Feb. 4, 2015, 2:27 p.m., Jonathan Hurley wrote:
> > ambari-server/src/main/resources/common-services/HBASE/0.96.0.2.0/alerts.json,
> >  lines 115-118
> > <https://reviews.apache.org/r/30566/diff/1/?file=845735#file845735line115>
> >
> >     Warning not needed since it has the same value as Critical.
> 
> Yurii Shylov wrote:
>     Then we should also fix those alerts from HDFS: 
> namenode_hdfs_blocks_health, namenode_directory_status. They also have both 
> warning and critical with same value.

Hmmm - perhaps this is because the web client is expecting values for all 3 
states; OK, please add them back in. Then Ship It!


- Jonathan


-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/30566/#review71013
-----------------------------------------------------------


On Feb. 5, 2015, 10:02 a.m., Yurii Shylov wrote:
> 
> -----------------------------------------------------------
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/30566/
> -----------------------------------------------------------
> 
> (Updated Feb. 5, 2015, 10:02 a.m.)
> 
> 
> Review request for Ambari, Jonathan Robie and Srimanth Gunturi.
> 
> 
> Bugs: AMBARI-9458
>     https://issues.apache.org/jira/browse/AMBARI-9458
> 
> 
> Repository: ambari
> 
> 
> Description
> -------
> 
> When a slave component, such as a DataNode, encounters some catastrophic 
> problem like a heap allocation error, and no longer can perform its work, the 
> NameNode marks this DataNode as being unhealthy.
> 
> The current alert definitions only check for the DataNode process being 
> alive, which is still technically is. We need to add new alert definitions 
> for:
> 
> - HDFS/DataNode (runs on NameNode, query is to NameNode JMX)
> - YARN/NodeManager (runs on ResourceManager, query is to ResourceManager JMX)
> - HBase/RegionServer (runs on HBase Master, queries HBase Master JMX)
> 
> Which will check for slaves that are in some sort of bad state. Depending on 
> the JMX structures that need to be queried, these can either be METRIC or 
> SCRIPT style alert definitions.
> 
> 
> Diffs
> -----
> 
>   
> ambari-server/src/main/resources/common-services/HBASE/0.96.0.2.0/alerts.json 
> fa911e1 
>   ambari-server/src/main/resources/common-services/HDFS/2.1.0.2.0/alerts.json 
> b8a20ac 
>   ambari-server/src/main/resources/common-services/YARN/2.1.0.2.0/alerts.json 
> dc4fafd 
>   
> ambari-server/src/main/resources/common-services/YARN/2.1.0.2.0/package/alerts/alert_nodemanagers_summary.py
>  PRE-CREATION 
> 
> Diff: https://reviews.apache.org/r/30566/diff/
> 
> 
> Testing
> -------
> 
> In progress
> 
> 
> Thanks,
> 
> Yurii Shylov
> 
>

Reply via email to