Akira AJISAKA created HDFS-8929: ----------------------------------- Summary: Add a metric to expose the timestamp of the last journal Key: HDFS-8929 URL: https://issues.apache.org/jira/browse/HDFS-8929 Project: Hadoop HDFS Issue Type: New Feature Components: journal-node Reporter: Akira AJISAKA
If there are three JNs and only one JN is failing to journal, we can detect it by monitoring the difference of the last written transaction id among JNs from NN WebUI or JN metrics. However, it's difficult to define the threshold to alert because the increase rate of the number of transaction depends on how busy the cluster is. Therefore I'd like to propose a metric to expose the timestamp of the last journal. That way we can easily alert if a JN is failing to journal for some fixed period. -- This message was sent by Atlassian JIRA (v6.3.4#6332)