The problem appears to be MTU related, I may have a network configuration
problem. Setting back to 1500 mtu seems to have solved it for now

On Thu, May 27, 2021 at 2:26 PM Jayme <jay...@gmail.com> wrote:

> I've gotten a bit further. I have a separate 10Gbe network for GlusterFS
> traffic which was also set as the migration network. I disabled migration
> on GlusterFS network and enabled on default management network and now
> migration seems to be working. I'm not sure why at this point, it used to
> work fine on GlusterFS migration network in the past.
>
> On Thu, May 27, 2021 at 2:11 PM Jayme <jay...@gmail.com> wrote:
>
>> I have a three node oVirt 4.4.5 cluster running oVirt node hosts. Storage
>> is mix of GlusterFS and NFS. Everything has been running smoothly, but the
>> other day I noticed many VMs had invalid snapshots. I run a script to
>> export OVA for VMs for backup purposes, exports seemed to have been fine
>> but snapshots failed to delete at the end. I was able to manually delete
>> the snapshots through oVirt admin GUI without any errors/warnings and the
>> VMs have been running fine and can restart them without problems.
>>
>> I thought this problem may be due to snapshot bug which is supposedly
>> fixed in oVirt 4.4.6. I decided to start upgrading cluster to 4.4.6 and am
>> now having a problem with VMs not being able to migrate.
>>
>> When I migrate any VM (doesn't seem to matter which host to and from) the
>> process starts but stops at 0-1%. Eventually after 15-30 minutes or more
>> the tasks are all completed by the VM is not migrated.
>>
>> I am unable to migrate any VMs and as such I cannot place any host in
>> maintenance mode.
>>
>> I've attaching some VDSM logs from source and destination hosts, these
>> were after initiating a migration of a single VM
>>
>> I'm seeing some errors in the logs regarding the migration stalling, but
>> not able to determine why its stalling.
>>
>> 2021-05-27 17:10:22,167+0000 INFO  (jsonrpc/4) [api.host] FINISH
>> getAllVmIoTunePolicies return={'status': {'code': 0, 'message': 'Done'},
>> 'io_tune_policies_dict': {'f8f4e4a1-b565-4663-8962-c8804dbb86fb':
>> {'policy': [], 'current_values': [{'name': 'vda', 'path':
>> '/rhev/data-center/mnt/10.11.0.9:_vmstorage_nvme1n1/bce04425-1d25-4489-bdab-2834a1a57db8/images/38b27cce-c744-4a12-85a3-3af07d386da2/93c1e793-f8cb-42c9-86a6-0e9ce4a6023a',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}]},
>> '2b87204f-f695-474a-9f08-47b85fcac366': {'policy': [], 'current_values':
>> [{'name': 'sda', 'path': 
>> '/rhev/data-center/mnt/10.11.0.9:_vmstorage_nvme0n1/a99cd663-f6d5-42d8-bd7a-ee0b5d068608/images/f2e0c9f3-ab0d-441a-85a6-07a42e78b5a8/848f353e-6787-4e20-ab7b-0541ebd852c6',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}]},
>> '26332421-54a3-4afc-90e7-551a7e314c80': {'policy': [], 'current_values':
>> [{'name': 'vda', 'path': 
>> '/rhev/data-center/mnt/10.11.0.9:_vmstorage_nvme0n1/a99cd663-f6d5-42d8-bd7a-ee0b5d068608/images/b7a785f9-307b-42af-9bbe-23cac884fe97/ed1d027e-a36a-4e6b-9207-119915044e06',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}]},
>> '60edbd80-dad7-4bf8-8fd1-e138413cf9f6': {'policy': [], 'current_values':
>> [{'name': 'sda', 'path': 
>> '/rhev/data-center/mnt/10.11.0.9:_vmstorage_nvme2n1/a7efa448-201b-4453-9bc9-900559b891ca/images/535fcb2e-ece9-4d50-86fe-bf6264d11ae1/6c01a036-8a14-46ba-a4b4-fe4f66a586a3',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}, {'name':
>> 'sdb', 'path': 
>> '/rhev/data-center/mnt/10.11.0.9:_vmstorage_nvme2n1/a7efa448-201b-4453-9bc9-900559b891ca/images/1f467fb5-5ea7-42ba-bace-f175c86791b2/cbe8327f-9b7f-442f-a650-6888bb11a674',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}, {'name':
>> 'sdd', 'path': 
>> '/rhev/data-center/mnt/10.11.0.9:_vmstorage_nvme2n1/a7efa448-201b-4453-9bc9-900559b891ca/images/c93956d5-c88d-41f9-8c38-9f5f62cc90dd/3920b46c-5fab-4b63-b47f-2fa5c6714c36',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}]},
>> 'beeefe06-78a0-4e14-a932-cc8d734d542d': {'policy': [], 'current_values':
>> [{'name': 'sda', 'path':
>> '/rhev/data-center/mnt/glusterSD/gluster0.grove.silverorange.com:_data__sdb/30fd0a2f-ab42-4a8a-8f0b-67242dc2d15d/images/310d8b3e-d578-418d-9802-dc0ebcea06d6/aa758c51-8478-4273-aeef-d4b374b8d6b4',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}, {'name':
>> 'sdb', 'path':
>> '/rhev/data-center/mnt/glusterSD/gluster0.grove.silverorange.com:_data__sdb/30fd0a2f-ab42-4a8a-8f0b-67242dc2d15d/images/4072fda1-ec82-45c9-b353-91fceb13bf08/891f5982-dead-48b4-8907-caa1e309fa82',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}]},
>> '7e5156de-649d-4904-9092-21a699242a37': {'policy': [], 'current_values':
>> [{'name': 'vda', 'path': 
>> '/rhev/data-center/mnt/10.11.0.9:_vmstorage_nvme0n1/a99cd663-f6d5-42d8-bd7a-ee0b5d068608/images/ca0c1208-a7aa-4ef6-a450-4a40bd4455f3/a2335199-ddd4-429b-b55d-f4d527081fd3',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}]}}}
>> from=::1,35012 (api:54)
>> 2021-05-27 17:10:31,118+0000 WARN  (migmon/7e5156de) [virt.vm]
>> (vmId='7e5156de-649d-4904-9092-21a699242a37') Migration stalling: remaining
>> (32863MiB) > lowmark (32863MiB). (migration:801)
>> 2021-05-27 17:10:31,118+0000 INFO  (migmon/7e5156de) [virt.vm]
>> (vmId='7e5156de-649d-4904-9092-21a699242a37') Migration Progress: 190.035
>> seconds elapsed, 1% of data processed, total data: 32864MB, processed data:
>> 0MB, remaining data: 32863MB, transfer speed 0Mbps, zero pages: 160MB,
>> compressed: 0MB, dirty rate: 0, memory iteration: 1 (migration:814)
>> 2021-05-27 17:10:33,827+0000 INFO  (jsonrpc/5) [throttled] Current
>> getAllVmStats: {'f8f4e4a1-b565-4663-8962-c8804dbb86fb': 'Up',
>> '2b87204f-f695-474a-9f08-47b85fcac366': 'Up',
>> '26332421-54a3-4afc-90e7-551a7e314c80': 'Up',
>> '60edbd80-dad7-4bf8-8fd1-e138413cf9f6': 'Up',
>> 'beeefe06-78a0-4e14-a932-cc8d734d542d': 'Up',
>> '7e5156de-649d-4904-9092-21a699242a37': 'Migration Source'}
>> (throttledlog:104)
>> 2021-05-27 17:10:37,186+0000 INFO  (jsonrpc/5) [api.host] FINISH
>> getAllVmIoTunePolicies return={'status': {'code': 0, 'message': 'Done'},
>> 'io_tune_policies_dict': {'f8f4e4a1-b565-4663-8962-c8804dbb86fb':
>> {'policy': [], 'current_values': [{'name': 'vda', 'path':
>> '/rhev/data-center/mnt/10.11.0.9:_vmstorage_nvme1n1/bce04425-1d25-4489-bdab-2834a1a57db8/images/38b27cce-c744-4a12-85a3-3af07d386da2/93c1e793-f8cb-42c9-86a6-0e9ce4a6023a',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}]},
>> '2b87204f-f695-474a-9f08-47b85fcac366': {'policy': [], 'current_values':
>> [{'name': 'sda', 'path': 
>> '/rhev/data-center/mnt/10.11.0.9:_vmstorage_nvme0n1/a99cd663-f6d5-42d8-bd7a-ee0b5d068608/images/f2e0c9f3-ab0d-441a-85a6-07a42e78b5a8/848f353e-6787-4e20-ab7b-0541ebd852c6',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}]},
>> '26332421-54a3-4afc-90e7-551a7e314c80': {'policy': [], 'current_values':
>> [{'name': 'vda', 'path': 
>> '/rhev/data-center/mnt/10.11.0.9:_vmstorage_nvme0n1/a99cd663-f6d5-42d8-bd7a-ee0b5d068608/images/b7a785f9-307b-42af-9bbe-23cac884fe97/ed1d027e-a36a-4e6b-9207-119915044e06',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}]},
>> '60edbd80-dad7-4bf8-8fd1-e138413cf9f6': {'policy': [], 'current_values':
>> [{'name': 'sda', 'path': 
>> '/rhev/data-center/mnt/10.11.0.9:_vmstorage_nvme2n1/a7efa448-201b-4453-9bc9-900559b891ca/images/535fcb2e-ece9-4d50-86fe-bf6264d11ae1/6c01a036-8a14-46ba-a4b4-fe4f66a586a3',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}, {'name':
>> 'sdb', 'path': 
>> '/rhev/data-center/mnt/10.11.0.9:_vmstorage_nvme2n1/a7efa448-201b-4453-9bc9-900559b891ca/images/1f467fb5-5ea7-42ba-bace-f175c86791b2/cbe8327f-9b7f-442f-a650-6888bb11a674',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}, {'name':
>> 'sdd', 'path': 
>> '/rhev/data-center/mnt/10.11.0.9:_vmstorage_nvme2n1/a7efa448-201b-4453-9bc9-900559b891ca/images/c93956d5-c88d-41f9-8c38-9f5f62cc90dd/3920b46c-5fab-4b63-b47f-2fa5c6714c36',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}]},
>> 'beeefe06-78a0-4e14-a932-cc8d734d542d': {'policy': [], 'current_values':
>> [{'name': 'sda', 'path':
>> '/rhev/data-center/mnt/glusterSD/gluster0.grove.silverorange.com:_data__sdb/30fd0a2f-ab42-4a8a-8f0b-67242dc2d15d/images/310d8b3e-d578-418d-9802-dc0ebcea06d6/aa758c51-8478-4273-aeef-d4b374b8d6b4',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}, {'name':
>> 'sdb', 'path':
>> '/rhev/data-center/mnt/glusterSD/gluster0.grove.silverorange.com:_data__sdb/30fd0a2f-ab42-4a8a-8f0b-67242dc2d15d/images/4072fda1-ec82-45c9-b353-91fceb13bf08/891f5982-dead-48b4-8907-caa1e309fa82',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}]},
>> '7e5156de-649d-4904-9092-21a699242a37': {'policy': [], 'current_values':
>> [{'name': 'vda', 'path': 
>> '/rhev/data-center/mnt/10.11.0.9:_vmstorage_nvme0n1/a99cd663-f6d5-42d8-bd7a-ee0b5d068608/images/ca0c1208-a7aa-4ef6-a450-4a40bd4455f3/a2335199-ddd4-429b-b55d-f4d527081fd3',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}]}}}
>> from=::1,35012 (api:54)
>> 2021-05-27 17:10:41,120+0000 WARN  (migmon/7e5156de) [virt.vm]
>> (vmId='7e5156de-649d-4904-9092-21a699242a37') Migration stalling: remaining
>> (32863MiB) > lowmark (32863MiB). (migration:801)
>> 2021-05-27 17:10:41,120+0000 INFO  (migmon/7e5156de) [virt.vm]
>> (vmId='7e5156de-649d-4904-9092-21a699242a37') Migration Progress: 200.037
>> seconds elapsed, 1% of data processed, total data: 32864MB, processed data:
>> 0MB, remaining data: 32863MB, transfer speed 0Mbps, zero pages: 160MB,
>> compressed: 0MB, dirty rate: 0, memory iteration: 1 (migration:814)
>> 2021-05-27 17:10:51,121+0000 WARN  (migmon/7e5156de) [virt.vm]
>> (vmId='7e5156de-649d-4904-9092-21a699242a37') Migration stalling: remaining
>> (32863MiB) > lowmark (32863MiB). (migration:801)
>> 2021-05-27 17:10:51,121+0000 INFO  (migmon/7e5156de) [virt.vm]
>> (vmId='7e5156de-649d-4904-9092-21a699242a37') Migration Progress: 210.039
>> seconds elapsed, 1% of data processed, total data: 32864MB, processed data:
>> 0MB, remaining data: 32863MB, transfer speed 0Mbps, zero pages: 160MB,
>> compressed: 0MB, dirty rate: 0, memory iteration: 1 (migration:814)
>> 2021-05-27 17:10:52,211+0000 INFO  (jsonrpc/1) [api.host] FINISH
>> getAllVmIoTunePolicies return={'status': {'code': 0, 'message': 'Done'},
>> 'io_tune_policies_dict': {'f8f4e4a1-b565-4663-8962-c8804dbb86fb':
>> {'policy': [], 'current_values': [{'name': 'vda', 'path':
>> '/rhev/data-center/mnt/10.11.0.9:_vmstorage_nvme1n1/bce04425-1d25-4489-bdab-2834a1a57db8/images/38b27cce-c744-4a12-85a3-3af07d386da2/93c1e793-f8cb-42c9-86a6-0e9ce4a6023a',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}]},
>> '2b87204f-f695-474a-9f08-47b85fcac366': {'policy': [], 'current_values':
>> [{'name': 'sda', 'path': 
>> '/rhev/data-center/mnt/10.11.0.9:_vmstorage_nvme0n1/a99cd663-f6d5-42d8-bd7a-ee0b5d068608/images/f2e0c9f3-ab0d-441a-85a6-07a42e78b5a8/848f353e-6787-4e20-ab7b-0541ebd852c6',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}]},
>> '26332421-54a3-4afc-90e7-551a7e314c80': {'policy': [], 'current_values':
>> [{'name': 'vda', 'path': 
>> '/rhev/data-center/mnt/10.11.0.9:_vmstorage_nvme0n1/a99cd663-f6d5-42d8-bd7a-ee0b5d068608/images/b7a785f9-307b-42af-9bbe-23cac884fe97/ed1d027e-a36a-4e6b-9207-119915044e06',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}]},
>> '60edbd80-dad7-4bf8-8fd1-e138413cf9f6': {'policy': [], 'current_values':
>> [{'name': 'sda', 'path': 
>> '/rhev/data-center/mnt/10.11.0.9:_vmstorage_nvme2n1/a7efa448-201b-4453-9bc9-900559b891ca/images/535fcb2e-ece9-4d50-86fe-bf6264d11ae1/6c01a036-8a14-46ba-a4b4-fe4f66a586a3',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}, {'name':
>> 'sdb', 'path': 
>> '/rhev/data-center/mnt/10.11.0.9:_vmstorage_nvme2n1/a7efa448-201b-4453-9bc9-900559b891ca/images/1f467fb5-5ea7-42ba-bace-f175c86791b2/cbe8327f-9b7f-442f-a650-6888bb11a674',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}, {'name':
>> 'sdd', 'path': 
>> '/rhev/data-center/mnt/10.11.0.9:_vmstorage_nvme2n1/a7efa448-201b-4453-9bc9-900559b891ca/images/c93956d5-c88d-41f9-8c38-9f5f62cc90dd/3920b46c-5fab-4b63-b47f-2fa5c6714c36',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}]},
>> 'beeefe06-78a0-4e14-a932-cc8d734d542d': {'policy': [], 'current_values':
>> [{'name': 'sda', 'path':
>> '/rhev/data-center/mnt/glusterSD/gluster0.grove.silverorange.com:_data__sdb/30fd0a2f-ab42-4a8a-8f0b-67242dc2d15d/images/310d8b3e-d578-418d-9802-dc0ebcea06d6/aa758c51-8478-4273-aeef-d4b374b8d6b4',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}, {'name':
>> 'sdb', 'path':
>> '/rhev/data-center/mnt/glusterSD/gluster0.grove.silverorange.com:_data__sdb/30fd0a2f-ab42-4a8a-8f0b-67242dc2d15d/images/4072fda1-ec82-45c9-b353-91fceb13bf08/891f5982-dead-48b4-8907-caa1e309fa82',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}]},
>> '7e5156de-649d-4904-9092-21a699242a37': {'policy': [], 'current_values':
>> [{'name': 'vda', 'path': 
>> '/rhev/data-center/mnt/10.11.0.9:_vmstorage_nvme0n1/a99cd663-f6d5-42d8-bd7a-ee0b5d068608/images/ca0c1208-a7aa-4ef6-a450-4a40bd4455f3/a2335199-ddd4-429b-b55d-f4d527081fd3',
>> 'ioTune': {'total_bytes_sec': 0, 'read_bytes_sec': 0, 'write_bytes_sec': 0,
>> 'total_iops_sec': 0, 'write_iops_sec': 0, 'read_iops_sec': 0}}]}}}
>> from=::1,35012 (api:54)
>> 2021-05-27 17:11:01,123+0000 WARN  (migmon/7e5156de) [virt.vm]
>> (vmId='7e5156de-649d-4904-9092-21a699242a37') Migration stalling: remaining
>> (32863MiB) > lowmark (32863MiB). (migration:801)
>> 2021-05-27 17:11:01,123+0000 INFO  (migmon/7e5156de) [virt.vm]
>> (vmId='7e5156de-649d-4904-9092-21a699242a37') Migration Progress: 220.041
>> seconds elapsed, 1% of data processed, total data: 32864MB, processed data:
>> 0MB, remaining data: 32863MB, transfer speed 0Mbps, zero pages: 160MB,
>> compressed: 0MB, dirty rate: 0, memory iteration: 1 (migration:814)ats
>> return={'86245648-abd8-46e3-9c10-432e8788a074': {'code': 0, 'lastCheck':
>> '1.6', 'delay': '0.00353497', 'valid': True, 'version': 5, 'acquired':
>> True, 'actual': True}} from=::1,35010,
>> task_id=c4e65f55-1367-41d3-9bf6-f357a382df4a (api:54)
>> 2021-05-27 17:09:33,156+0000 INFO  (jsonrpc/2) [api.host] START
>> getStats() from=::ffff:10.11.0.219,54952 (api:48)
>>
>
_______________________________________________
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/WMQP7DPUQGZHJHZTBQ3FO3F3ZF3NG2F5/

Reply via email to