Hi,

Before I got sick I was trying to do the same thing and noticed the same as 
you, here it noticed the server being offline in 10 minutes or so, but even 
when the server got back it still thought all those vm’s were working while a 
virsh list on the hypervisor clearly showed it had no running vm’s.

I get the same when doing a shutdown -h  on a vm, it takes cloudstack up to 10 
minutes before it sees that the vm is offline, how can we shorten this time 
drastically? As now when I need to do a change to a vm I shut it down and then 
need to wait all that time before I can make the change and boot it again.

--
Jimmy

From: Jeroen Kleijer <jeroen.klei...@gmail.com>
Date: Saturday, 15 February 2025 at 08:40
To: users@cloudstack.apache.org <users@cloudstack.apache.org>
Subject: Hypervisor failure
Hi all,

We're running tests on our Apache CloudStack (4.19.1.2) environment where
our hypervisors are running KVM. We've noticed that when we pull the plug
on a hypervisor, it can take ACS up to an hour(!) before it finally notices
that the hypervisor is down and changes the state to DOWN and in the
meantime, it considers the VM's that were running on it still available.
This leads us to two questions:
1) which variables need to be tweaked to make ACS notice something like
this in just a couple of minutes instead of more than an hour?
2) why are these values so high? An hour before ACS defines an agent
offline seems very long.

Kind regards,

Jeroen Kleijer

Reply via email to