[ovirt-users] oVirt Engine VM On Rocky Linux

2022-09-20 Thread Matthew J Black
Hi Everybody (Hi Dr. Nick),

Has anyone attempted to migrate the oVirt Engine VM over to Rocky Linux (v8.6), 
and if so, any "gotchas" we need to know about?

Cheers

Dulux-Oz
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/HW7SBAUNP2BPOE5DNHXKR6NKX74ZPLCY/


[ovirt-users] oVirt & (Ceph) iSCSI

2022-09-20 Thread Matthew J Black
Hi Everybody (Hi Dr. Nick),

So, next question in my on-going saga: *somewhere* in the documentation I read 
that when using oVirt with multiple iSCSI paths (in my case, multiple Ceph 
iSCSI Gateways) we need to set up DM Multipath.

My question is: Is this still relevant information when using oVirt v4.5.2?

Relevant link referred to by the oVirt Documentation:
- 
https://access.redhat.com/documentation/en-us/red_hat_enterprise_linux/7/html-single/dm_multipath/

Cheers

Dulux-Oz
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/YFFQGNVKU2VSWGGYBKYAQBRU2NJTUYZ3/


[ovirt-users] Re: Self-hosted-engine timeout and recovering time

2022-09-20 Thread Yedidyah Bar David
On Wed, Sep 21, 2022 at 12:22 AM Marcos Sungaila
 wrote:
>
> Hi all,
>
> I have a cluster running the 4.4.10 release with 6 KVM hosts and 
> Self-Hosted-Engine.

What storage?

> I'm testing some network outage scenarios, and I faced strange behavior.

I suppose you have redundancy in your network.

It's important to clarify (for yourself, mainly) what exactly you
test, what's important, what's expected, etc.

> After disconnecting the KVM hosts hosting the SHE, there was a long timeout 
> until switching the Self-Hosted-Engine to another host as expected.

I suggest studying the ha-agent logs, /var/log/ovirt-hosted-engine-ha/agent.log.

Much of the relevant code is in ovirt_hosted_engine_ha/agent/states.py
(in the git repo, or under /usr/lib/python3.6/site-packages/ on your
machine).

> Also, there took a relatively long time to take over the HA VMs from the 
> failing server.

That's a separate issue, about which I personally know very little.
You might want to start a separate thread about it.

I do know, though, that if you keep the storage connected, the host
might be able to keep updating VM leases on the storage. See e.g.:

https://www.ovirt.org/develop/release-management/features/storage/vm-leases.html

I didn't check the admin guide, but I suppose it has some material about HA VMs.

> Is there a configuration where I can reduce the SHE timeout to make this 
> recover process faster?

IIRC there is nothing user-configurable.

You can see most relevant constants in
ovirt_hosted_engine_ha/agent/constants.py{,.in}.
Nothing stops you from changing them, but please note that this is
somewhat risky, and I strongly suggest to do very careful testing with
your new settings. It might make sense to try to methodically go
through all the possible state changes in the above state machine.

The general assumption is that network and storage, for critical
setups, are redundant, and that the engine itself is not considered
critical, in the sense that if it's dead, all your VMs are still
alive. And also, that it's more important to not corrupt VM disk
images (e.g. by starting the VM concurrently on two hosts) than to
keep the VM alive.

Best regards,
-- 
Didi
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/DLSI7CNZV2Z37CG2VL4U2WSXE2QGIQYS/


[ovirt-users] Self-hosted-engine timeout and recovering time

2022-09-20 Thread Marcos Sungaila
Hi all,

I have a cluster running the 4.4.10 release with 6 KVM hosts and 
Self-Hosted-Engine.
I'm testing some network outage scenarios, and I faced strange behavior. 
After disconnecting the KVM hosts hosting the SHE, there was a long timeout 
until switching the Self-Hosted-Engine to another host as expected.
Also, there took a relatively long time to take over the HA VMs from the 
failing server.
Is there a configuration where I can reduce the SHE timeout to make this 
recover process faster?

Regards, 
Marcos Sungaila
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/6C3XI3O2PJ7OMNDWIOGHYF64IQQBQSSN/


[ovirt-users] Re: Error during deployment of ovirt-engine

2022-09-20 Thread Jonas
Thanks for the detailed explanation! I got it to work by downgrading the 
package as you wrote.


On 9/19/22 08:43, Yedidyah Bar David wrote:

On Sat, Sep 17, 2022 at 4:15 PM  wrote:

I meant what exactly is broken and if there is a workaround or a planned fix.

The breakage happened due to:

- ansible-core recently moved to python 3.9

- oVirt needs several python libraries in its use of ansible, some of
which are packaged by oVirt. Those are not yet built for python 3.9.


I performed a deployment using hosted-engine. This goes on step further, but 
fails while waiting for the host:
[...]
[ INFO  ] TASK [ovirt.ovirt.hosted_engine_setup : Obtain SSO token using 
username/password credentials]
[ INFO  ] ok: [localhost]
[ INFO  ] TASK [ovirt.ovirt.hosted_engine_setup : Wait for the host to be up]
[ INFO  ] TASK [ovirt.ovirt.hosted_engine_setup : Notify the user about a 
failure]
[ ERROR ] fatal: [localhost]: FAILED! => {"changed": false, "msg": "Host is not 
up, please check logs, perhaps also o
n the engine machine"}
[...]

On the hosted-engine, I see in 
/var/log/ovirt-engine/host-deploy/ovirt-host-deploy-ansible-20220917140938-server-005.admin.int.rabe.ch-937b3c69-a4d5-45c7-af0f-b8705490ff2a.log:
[...]
2022-09-17 14:12:30 CEST - {
   "uuid" : "97891064-a215-4709-b944-ceba2d13b19f",
   "counter" : 391,
   "stdout" : "fatal: [server-005.admin.int.rabe.ch]: FAILED! => {\"msg\": \"The conditional check 
'cluster_switch == \\\"ovs\\\" or (ovn_central is defined and ovn_central | ipaddr)' failed. The error was: The ipaddr 
filter requires python's netaddr be installed on the ansible controller\\n\\nThe error appears to be in 
'/usr/share/ovirt-engine/ansible-runner-service-project/project/roles/ovirt-provider-ovn-driver/tasks/configure.yml': line 3, column 
5, but may\\nbe elsewhere in the file depending on the exact syntax problem.\\n\\nThe offending line appears to be:\\n\\n- block:\\n 
 - name: Install ovs\\n^ here\\n\"}",
[...]

But netaddr is installed:

python38-netaddr might be installed, but python39-netaddr is probably not.


[root@ovirt-engine-test host-deploy]# pip3 install netaddr
WARNING: Running pip install with root privileges is generally not a good idea. 
Try `pip3 install --user` instead.
Requirement already satisfied: netaddr in /usr/lib/python3.6/site-packages

Any ideas what I can do?

For the time being, try to downgrade ansible-core to 2.12, or the last
version requiring python 3.8.

We merged this patch to the engine master branch (to eventually become 4.5.3):

https://github.com/oVirt/ovirt-engine/pull/654

Going forward:

1. We hope to finish porting the engine to CS9 soon, and if this
proves stable, consider dropping support for CS8.

2. We might try, in the time being, to build the missing dependencies
for python 3.9. It's not clear if it's worth the effort.

Best regards,

___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/TU7AJCH7UWZUCV7OI66IQKREXIOBUNBI/


[ovirt-users] Re: Snapshot task stuck at oVirt 4.4.8

2022-09-20 Thread Benny Zlotnik
Is it running on the vdsm host? By default live snapshot has a timeout
of 30 minutes

On Tue, Sep 20, 2022 at 11:34 AM  wrote:
>
> Hi,
>
> We're running oVirt 4.4.8 and one of our users tried to create a
> snapshot on a VM. The snapshot task got stuck (not sure why) and since
> then a "locked" icon is being shown on the VM. We need to remove this
> VM, but since it has a pending task, we're unable.
>
> The ovirt-engine log shows hundreds of events like:
>
> [2022-09-20 09:23:09,286+01 INFO
> [org.ovirt.engine.core.bll.SerialChildCommandsExecutionCallback]
> (EE-ManagedScheduledExecutorService-engineScheduledThreadPool-Thread-27)
> [2769dad5-3ec3-4c46-90a2-924746ea8d97] Command 'CreateSnapshotForVm'
> (id: '4fcb6ab7-2cd7-4a0c-be97-f6979be25bb9') waiting on child command
> id: 'cbb7a2c0-2111-4958-a55d-d48bf2d8591b'
> type:'CreateLiveSnapshotForVm' to complete
>
> An ovirt-engine restart didn't make any difference.
>
> Is there a way to remove this task manually, even changing something in
> the DB?
>
> Thanks.
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code of Conduct: 
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives: 
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/SWVFVEJFQ25N7OK2ZQQOX7FN7KXUS6JB/
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/JFKWPQIHNIKR2ECPHQOXKJ5EVEWKFBAT/


[ovirt-users] Re: Snapshot task stuck at oVirt 4.4.8

2022-09-20 Thread Andrei Verovski
Hi,


I also recommend upgrade to 4.5.x, it fixed a lot of bugs.


> On 20 Sep 2022, at 14:01, Jirka Simon  wrote:
> 
> Hi Nicolas,
> 
> 
> we had the same problem before we upgraded to 4.5.X
> 
> 
> you can check locks with  
> /usr/share/ovirt-engine/setup/dbutils/unlock_entity.sh -t all  -qc
> 
> 
> and then unlock
> 
> 
> /usr/share/ovirt-engine/setup/dbutils/unlock_entity.sh -t snapshot snapshot_id
> 
> 
> or sometime helped just power off and power on the VM and snapshot task 
> failed.
> 
> Jirka
> 
> 
> 
> On 9/20/22 10:26, nico...@devels.es  wrote:
>> Hi, 
>> 
>> We're running oVirt 4.4.8 and one of our users tried to create a snapshot on 
>> a VM. The snapshot task got stuck (not sure why) and since then a "locked" 
>> icon is being shown on the VM. We need to remove this VM, but since it has a 
>> pending task, we're unable. 
>> 
>> The ovirt-engine log shows hundreds of events like: 
>> 
>> [2022-09-20 09:23:09,286+01 INFO  
>> [org.ovirt.engine.core.bll.SerialChildCommandsExecutionCallback] 
>> (EE-ManagedScheduledExecutorService-engineScheduledThreadPool-Thread-27) 
>> [2769dad5-3ec3-4c46-90a2-924746ea8d97] Command 'CreateSnapshotForVm' (id: 
>> '4fcb6ab7-2cd7-4a0c-be97-f6979be25bb9') waiting on child command id: 
>> 'cbb7a2c0-2111-4958-a55d-d48bf2d8591b' type:'CreateLiveSnapshotForVm' to 
>> complete 
>> 
>> An ovirt-engine restart didn't make any difference. 
>> 
>> Is there a way to remove this task manually, even changing something in the 
>> DB? 
>> 
>> Thanks. 
>> ___ 
>> Users mailing list -- users@ovirt.org  
>> To unsubscribe send an email to users-le...@ovirt.org 
>>  
>> Privacy Statement: https://www.ovirt.org/privacy-policy.html 
>>  
>> oVirt Code of Conduct: 
>> https://www.ovirt.org/community/about/community-guidelines/ 
>>  
>> List Archives: 
>> https://lists.ovirt.org/archives/list/users@ovirt.org/message/SWVFVEJFQ25N7OK2ZQQOX7FN7KXUS6JB/
>>  
>> 
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code of Conduct: 
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives: 
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/3TXGF7RAKIGMFWW6FR3BBEB5AAC5RT5V/

___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/HJNUWJSPW73Q5QFZASXRK3ULLFBN5IDH/


[ovirt-users] Re: VM Down With "Bad Volume Specification"

2022-09-20 Thread Benny Zlotnik
You could stop the disk copy by going to the host performing the copy
and killing the process running `qemu-img convert`, the operation
should then fail

On Mon, Sep 19, 2022 at 9:20 PM Clint Boggio  wrote:
>
> I'm thinking that I need to somehow stop the disk move, and then delete the 
> snapshot that existed prior to my attempt at moving the disk. Can someone 
> please tell me how to stop that disk move ?
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code of Conduct: 
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives: 
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/J2U4M3C6EC3GRXN6FCNW4KSHTGLANGWL/
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/5ZLLZXCERK6D2SVBTG3FSIUUZ6MV2UF6/


[ovirt-users] Re: Snapshot task stuck at oVirt 4.4.8

2022-09-20 Thread Jirka Simon

Hi Nicolas,

we had the same problem before we upgraded to 4.5.X

you can check locks with 
/usr/share/ovirt-engine/setup/dbutils/unlock_entity.sh -t all  -qc


and then unlock

/usr/share/ovirt-engine/setup/dbutils/unlock_entity.sh -t snapshot 
snapshot_id


or sometime helped just power off and power on the VM and snapshot task 
failed.


Jirka

On 9/20/22 10:26, nico...@devels.es wrote:

Hi,

We're running oVirt 4.4.8 and one of our users tried to create a 
snapshot on a VM. The snapshot task got stuck (not sure why) and since 
then a "locked" icon is being shown on the VM. We need to remove this 
VM, but since it has a pending task, we're unable.


The ovirt-engine log shows hundreds of events like:

[2022-09-20 09:23:09,286+01 INFO 
[org.ovirt.engine.core.bll.SerialChildCommandsExecutionCallback] 
(EE-ManagedScheduledExecutorService-engineScheduledThreadPool-Thread-27) 
[2769dad5-3ec3-4c46-90a2-924746ea8d97] Command 'CreateSnapshotForVm' 
(id: '4fcb6ab7-2cd7-4a0c-be97-f6979be25bb9') waiting on child command 
id: 'cbb7a2c0-2111-4958-a55d-d48bf2d8591b' 
type:'CreateLiveSnapshotForVm' to complete


An ovirt-engine restart didn't make any difference.

Is there a way to remove this task manually, even changing something 
in the DB?


Thanks.
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/SWVFVEJFQ25N7OK2ZQQOX7FN7KXUS6JB/___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/3TXGF7RAKIGMFWW6FR3BBEB5AAC5RT5V/


[ovirt-users] Snapshot task stuck at oVirt 4.4.8

2022-09-20 Thread nicolas

Hi,

We're running oVirt 4.4.8 and one of our users tried to create a 
snapshot on a VM. The snapshot task got stuck (not sure why) and since 
then a "locked" icon is being shown on the VM. We need to remove this 
VM, but since it has a pending task, we're unable.


The ovirt-engine log shows hundreds of events like:

[2022-09-20 09:23:09,286+01 INFO  
[org.ovirt.engine.core.bll.SerialChildCommandsExecutionCallback] 
(EE-ManagedScheduledExecutorService-engineScheduledThreadPool-Thread-27) 
[2769dad5-3ec3-4c46-90a2-924746ea8d97] Command 'CreateSnapshotForVm' 
(id: '4fcb6ab7-2cd7-4a0c-be97-f6979be25bb9') waiting on child command 
id: 'cbb7a2c0-2111-4958-a55d-d48bf2d8591b' 
type:'CreateLiveSnapshotForVm' to complete


An ovirt-engine restart didn't make any difference.

Is there a way to remove this task manually, even changing something in 
the DB?


Thanks.
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/SWVFVEJFQ25N7OK2ZQQOX7FN7KXUS6JB/