Re: AW: Manual fence KVM Host

Guto Veronezi Wed, 10 Apr 2024 10:38:29 -0700

Hello Murilo,

Complementing Swen's answer, if your host is still up and you can manageit, then you could also put your host in maintenance mode in ACS. Thisprocess will evacuate (migrate to another host) every VM from the host(not only the ones that have HA enabled). Is this your situation? Ifnot, could you provide more details about your configurations and theenvironment state?

Depending on what you have in your setup, the HA might not work asexpected. For VMware and XenServer, the process is expected to happen atthe hypervisor level. For KVM, ACS does not support HA; what ACSsupports is failover (it is named HA in ACS though) and this processwill work only when certain criteria are met. Furthermore, we have twoways to implement the failover for ACS + KVM: the VM's failover and thehost's failover. In both cases, when identified that a host crashed or aVM suddenly stopped working, ACS will start the VM in another host.

In ACS + KVM, to work with VM's failover, it is necessary at least oneNFS primary storage; the KVM Agent of every host writes the heartbeat init. The VM's failover is triggered only if the VM's compute offering hasthe property "Offer HA" enabled OR the global setting "force.ha" isenabled. VRs have failover triggered independently of the offering ofthe global setting. In this approach, ACS will check the VM stateperiodically (sending commands to the KVM Agent) and it will trigger thefailover if the VM meets the previously mentioned criteria AND thedetermined limit (defined by the global settings "ping.interval" and"ping.timeout") has been elapsed. Bear in mind that, if you lose yourhost, ACS will trigger the failover; however, if you gracefully shutdownthe KVM Agent or the host, the Agent will send a disconnect command tothe Management Server and ACS will not check the VM state anymore forthat host. Therefore, if you lose your host while the service is down,the failover will not be triggered. Also, if a host loses access to theNFS primary storage used for heartbeat and the VM uses some otherprimary storage, ACS might trigger the failover too. As we do not have aSTONITH/fencing in this scenario, it is possible for the VM to still berunning in the host and ACS to try to start it in another host.

In ACS + KVM, to work with the host's failover, it is necessary toconfigure the host's OOBM (of each host desired to trigger the failover)in ACS. In this approach, ACS monitors the Agent's state and triggersthe failover in case it cannot establish the connection again. In thisscenario, ACS will shut down the host via OOBM and will start the VMs inanother host; therefore, it is not dependent on an NFS primary storage.This behavior is driven by the "kvm.ha.*" global settings. Furthermore,one has to be aware that stopping the Agent might trigger the failover;therefore, it is recommended to disable the failover feature while doingoperations in the host (like upgrading the packages or some othermaintenance processes).


Best regards,
Daniel Salvador (gutoveronezi)

On 10/04/2024 03:52, m...@swen.io wrote:

What exactly do you mean? In which state is the host?
If a host is in state "Disconnected" or "Alert" you can declare a host as 
degraded via api (https://cloudstack.apache.org/api/apidocs-4.19/apis/declareHostAsDegraded.html) 
or UI (icon).
Cloudstack will then start all VM with HA enabled on other hosts, if storage is 
accessible.

Regards,
Swen

-----Ursprüngliche Nachricht-----
Von: Murilo Moura <a...@bigsys.com.br>
Gesendet: Mittwoch, 10. April 2024 02:10
An: users@cloudstack.apache.org
Betreff: Manual fence KVM Host

hey guys!

Is there any way to manually fence a KVM host and then automatically start the 
migration of VMs that have HA enabled?

Re: AW: Manual fence KVM Host

Reply via email to