yes, but running a HA prometheus doesn't let me see which prom-instance has the issues, as only 1 is failing
Op dinsdag 16 maart 2021 om 10:35:50 UTC+1 schreef Stuart Clark: > On 16/03/2021 09:04, Tom Liefheid wrote: > > Thanks for your answers, > > > > In my current setup, running prometheus in HA, i have 1 instance who > > can't scrape apps, but the other one can. I want to find out which one > > isn't able to scrape the apps, so i can restart it. i don't see > > anything in the logs that reflect the issues. it would be nice if we > > could 'translate' the output of the /targets page to some kind of > > metric, if that makes sense > All scrapes automatically produce the "up" metric, so a value of 0 would > indicate a failure (as you would see with red sections of the target > page). You should see labels for the job/target which is failing. It can > be a useful metric to alert on, and then look at logs/the target page to > try to figure out why the scrape is failing. > > -- > Stuart Clark > > -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/prometheus-users/9a2d6dec-e9bd-4cd5-9c41-f3c1c4a26c4en%40googlegroups.com.

