On Tue, 2024-07-16 at 11:18 +0000, S Sathish S via Users wrote: > Hi Team, > > In our product we have 9 nodes pacemaker cluster setup non-DC nodes > reboot parallelly. Most of nodes join cluster properly and only one > node pacemaker and corosync service is not came up properly with > below error message. > > Error Message: > Error: error running crm_mon, is pacemaker running? > crm_mon: Connection to cluster failed: Connection refused
All that indicates is that Pacemaker is not responding. You'd have to look at the system log and/or pacemaker.log from that time to find out more. > > Query : Is it recommended to reboot parallelly of non-DC nodes ? As long as they are cleanly rebooted, there should be no fencing or other actual problems. However the cluster will lose quorum and have to stop all resources. If you reboot less than half of the nodes at one time and wait for them to rejoin before rebooting more, you would avoid that. > > Thanks and Regards, > S Sathish S > _______________________________________________ > Manage your subscription: > https://lists.clusterlabs.org/mailman/listinfo/users > > ClusterLabs home: https://www.clusterlabs.org/ -- Ken Gaillot <kgail...@redhat.com> _______________________________________________ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/