Hi Ken,

On 2/26/20 7:30 AM, Ken Gaillot wrote:
The use case is a large organization with few cluster experts and many
junior system administrators who reboot hosts for OS updates during
planned maintenance windows, without any knowledge of what the host
does. The cluster runs services that have a preferred node and take a
very long time to start.

In this scenario, pacemaker's default behavior of moving the service to
a failover node when the node shuts down, and moving it back when the
node comes back up, results in needless downtime compared to just
leaving the service down for the few minutes needed for a reboot.

1. Do I understand it correctly that scenario will be when system gracefully reboots (pacemaker service is stopped by system shutting down) and also in case that users for example manually stop cluster but doesn't reboot the node - something like `pcs cluster stop`?

If you decide while the node is down that you need the resource to be
recovered, you can manually clear a lock with "crm_resource --refresh"
specifying both --node and --resource.

2. I'm interested how the situation will look like in the 'crm_mon' output or in 'crm_simulate'. Will there be some indication why the resources are not moving like 'blocked-shutdown-lock' or they will just appear as not moving (Stopped)?

Will this look differently from situation where for example the resource is just not allowed by constraint to run on other nodes?

Thanks for heads up

--
Ondrej Famera
_______________________________________________
Manage your subscription:
https://lists.clusterlabs.org/mailman/listinfo/users

ClusterLabs home: https://www.clusterlabs.org/

Reply via email to