[ovirt-users] Re: Power failure makes cluster and hosted engine unusable

2021-04-01 Thread Thomas Hoberg
I am glad you got it done! I find that oVirt resembles more an adventure game (with all its huge emotional rewards, once you prevail), than a streamlined machine, that just works every time you push a button. Those are boring, sure, but really what I am looking for when the mission is to run

[ovirt-users] Re: Power failure makes cluster and hosted engine unusable

2021-04-01 Thread Seann G. Clark via Users
Cc: users@ovirt.org Subject: [ovirt-users] Re: Power failure makes cluster and hosted engine unusable Hi Thomas, Thanks for looking into this, the problem is really somewhere around this tasks file. However I just tried faking the memory values directly inside the tasks file to something way

[ovirt-users] Re: Power failure makes cluster and hosted engine unusable

2021-04-01 Thread Roman Bednar
Hi Thomas, Thanks for looking into this, the problem is really somewhere around this tasks file. However I just tried faking the memory values directly inside the tasks file to something way higher and everything looks fine. I think the problem resides in registering the output of the "free -m"

[ovirt-users] Re: Power failure makes cluster and hosted engine unusable

2021-03-31 Thread Thomas Hoberg
Roman, I believe the bug is in /usr/share/ansible/roles/ovirt.hosted_engine_setup/tasks/pre_checks/validate_memory_size.yml - name: Set Max memory set_fact: max_mem: "{{ free_mem.stdout|int + cached_mem.stdout|int - he_reserved_memory_MB + he_avail_memory_grace_MB }}" If these

[ovirt-users] Re: Power failure makes cluster and hosted engine unusable

2021-03-31 Thread Thomas Hoberg
Yup, that's a bug in the ansible code, I've come across on hosts that had 512GB of RAM. I quite simply deleted the checks from the ansible code and re-ran the wizard. I can't read YAML or Python or whatever it is that Ansible uses, but my impression is that things are 'cast' or converted into

[ovirt-users] Re: Power failure makes cluster and hosted engine unusable

2021-03-30 Thread Vincent Royer
Seann, If this happens again, try doing nothing (seriously) Each time I've had a power failure, the engine takes a really long time to come back up. I don't know if it's by design or what. Host logs are flooded with errors, everything seemingly storage related. However, my Gluster setup is on

[ovirt-users] Re: Power failure makes cluster and hosted engine unusable

2021-03-30 Thread Roman Bednar
Hi Seann, On Mon, Mar 29, 2021 at 8:31 PM Seann G. Clark via Users wrote: > All, > > > > After a power failure, and generator failure I lost my cluster, and the > Hosted engine refused to restart after power was restored. I would expect, > once storage comes up that the hosted engine comes