[ceph-users] RFI: Prometheus, Etc, Services - Optimum Number To Run

duluxoz Fri, 19 Jan 2024 21:42:26 -0800

Hi All,

In regards to the monitoring services on a Ceph Cluster (ie Prometheus,Grafana, Alertmanager, Loki, Node-Exported, Promtail, etc) how manyinstances should/can we run for fault tolerance purposes? I can't seemto recall that advice being in the doco anywhere (but of course, Iprobably missed it).

I'm concerned about HA on those services - will they continue to run ifthe Ceph Node they're on fails?

At the moment we're running only 1 instance of each in the cluster, butseveral Ceph Nodes are capable of running each - ie/eg 3 nodesconfigured but only count:1.

This is on the latest version of Reef using cephadmin (if it makes ahuge difference :-) ).

So any advice, etc, would be greatly appreciated, including if we shouldbe running any services not mentioned (not Mgr, Mon, OSD, or iSCSI,obviously :-) )


Cheers

Dulux-Oz
_______________________________________________
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io

[ceph-users] RFI: Prometheus, Etc, Services - Optimum Number To Run

Reply via email to