Re: Flink Prometheus metric doubt

Chesnay Schepler Thu, 02 Jan 2020 03:04:49 -0800

In practice the documentation is incorrect. While technically the metric_would_ emit -1 if the job is in a failed/finished state, the reality isthat at this point the metric is unregistered and no longer updated,since the owning component (the jobmanager) is shutting down.


I can't think of a workaround for this problem at the moment.


On 19/12/2019 11:56, Jesús Vásquez wrote:

Hi all, i'm monitoring Flink jobs using prometheus.
I have been trying to use the metricsflink_jobmanager_job_uptime/downtime in order to create an alert, thatfires when one of this values emits -1 since the doc says this is thebehavior of the metric when the job gets to a completed state.The thing is that i have tested the behavior when one of my job failsand the mentioned metrics never emit something different than zero.Finally the metric disappears after the job has failed.
Am i missing something or is this the expected behavior ?

Re: Flink Prometheus metric doubt

Reply via email to