Hi Matthew, You might need to sync back the master version and domain between the engine and vdsm. To verify those parameters on vdsm, run this command on the SPM host: vdsm-client StoragePool getInfo storagepoolID="f72ec125-69a1-4c1b-a5e1-313fcb70b6ff"
The result should be something like: "info": { "domains": "1234:Active,5678:Active,91011:Active", "isoprefix": "", "lver": 6, * "master_uuid": "123", "master_ver": 14,* "name": "No Description", "pool_status": "connected", "spm_id": 1, "type": "NFS", "version": "5" } Then, compare the master version value with the engine: engine=> select * from storage_pool where id = 'f72ec125-69a1-4c1b-a5e1-313fcb70b6ff'; And the master domain: engine=> select * from storage_domains where storage_pool_id='f72ec125-69a1-4c1b-a5e1-313fcb70b6ff' and storage_domain_type='0'; (0 means master, for reference, see https://github.com/oVirt/ovirt-engine/blob/a65cf0eae8858ab2278c3f537dc427e3ff20eba7/backend/manager/modules/common/src/main/java/org/ovirt/engine/core/common/businessentities/StorageDomainType.java ) Then we can get the bigger picture (and update the engine data to match the vdsm) *Regards,* *Shani Leviim* On Thu, Jul 29, 2021 at 8:40 PM Matthew Benstead <matth...@uvic.ca> wrote: > Thanks Shani - yes we plan to upgrade to 4.4 in the future, but we're on > 4.3 right now due to only running CentOS 7 at the moment. > > I was able to clear the job from the SPM: > > [root@daccs01 ~]# vdsm-client Host getAllTasksStatuses > { > "5fa9edf0-56c3-40e4-9327-47bf7764d28d": { > "message": "1 jobs completed successfully", > "code": 0, > "taskID": "5fa9edf0-56c3-40e4-9327-47bf7764d28d", > "taskResult": "success", > "taskState": "finished" > } > } > [root@daccs01 ~]# vdsm-client Task clear > taskID=5fa9edf0-56c3-40e4-9327-47bf7764d28d > true > [root@daccs01 ~]# vdsm-client Host getAllTasksStatuses > {} > > And confirm there were no async_tasks: > > engine=# select * from async_tasks; > task_id | action_type | status | result | step_id | command_id | > started_at | storage_pool_id | task_type | vdsm_task_id | root_command_id | > user_id > > ---------+-------------+--------+--------+---------+------------+------------+-----------------+-----------+--------------+-----------------+--------- > (0 rows) > > > However, when putting the vm-storage-ssd domain into maintenance mode, it > failed again: > > > > > > > > > > Here are some the logs entries - anything else I can look at? > > > 2021-07-29 10:30:37,848-07 ERROR > [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] > (EE-ManagedThreadFactory-engineScheduled-Thread-25) [35c5b47] EVENT_ID: > VDS_BROKER_COMMAND_FAILURE(10,802), > VDSM compute7.pcic.uvic.ca command ConnectStoragePoolVDS failed: Wrong > Master domain or its version: u'SD=a5a83df1-47e2-4927-9add-079199ca7ef8, > pool=f72ec125-69a1-4c1b-a5e1-313fcb70b6ff' > 2021-07-29 10:30:37,848-07 INFO > [org.ovirt.engine.core.vdsbroker.vdsbroker.ConnectStoragePoolVDSCommand] > (EE-ManagedThreadFactory-engineScheduled-Thread-25) [35c5b47] Command > 'org.ovirt.engine.core.vdsbroker.vd > sbroker.ConnectStoragePoolVDSCommand' return value 'StatusOnlyReturn > [status=Status [code=324, message=Wrong Master domain or its version: > u'SD=a5a83df1-47e2-4927-9add-079199ca7ef8, pool=f72ec125-69a1-4c1b-a5e1- > 313fcb70b6ff']]' > ... > 2021-07-29 10:30:37,848-07 INFO > [org.ovirt.engine.core.vdsbroker.vdsbroker.ConnectStoragePoolVDSCommand] > (EE-ManagedThreadFactory-engineScheduled-Thread-25) [35c5b47] HostName = > compute7.pcic.uvic.ca > 2021-07-29 10:30:37,849-07 ERROR > [org.ovirt.engine.core.vdsbroker.vdsbroker.ConnectStoragePoolVDSCommand] > (EE-ManagedThreadFactory-engineScheduled-Thread-25) [35c5b47] Command > 'ConnectStoragePoolVDSCommand(HostN > ame = compute7.pcic.uvic.ca, > ConnectStoragePoolVDSCommandParameters:{hostId='51769733-0cf6-4270-8288-ec96474b7609', > vdsId='51769733-0cf6-4270-8288-ec96474b7609', > storagePoolId='f72ec125-69a1-4c1b-a5e1-313fcb70b6 > ff', masterVersion='288'})' execution failed: IRSGenericException: > IRSErrorException: IRSNoMasterDomainException: Wrong Master domain or its > version: u'SD=a5a83df1-47e2-4927-9add-079199ca7ef8, pool=f72ec125-69a1 > -4c1b-a5e1-313fcb70b6ff' > ... > 2021-07-29 10:30:37,849-07 ERROR > [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] > (EE-ManagedThreadFactory-engineScheduled-Thread-25) [35c5b47] > IrsBroker::Failed::DeactivateStorageDomainVDS: IRSGener > icException: IRSErrorException: IRSNoMasterDomainException: Wrong Master > domain or its version: u'SD=a5a83df1-47e2-4927-9add-079199ca7ef8, > pool=f72ec125-69a1-4c1b-a5e1-313fcb70b6ff' > 2021-07-29 10:30:37,855-07 INFO > [org.ovirt.engine.core.vdsbroker.irsbroker.DeactivateStorageDomainVDSCommand] > (EE-ManagedThreadFactory-engineScheduled-Thread-25) [35c5b47] FINISH, > DeactivateStorageDomainVDSComm > and, return: , log id: 1c215ca4 > 2021-07-29 10:30:37,855-07 ERROR > [org.ovirt.engine.core.bll.storage.domain.DeactivateStorageDomainCommand] > (EE-ManagedThreadFactory-engineScheduled-Thread-25) [35c5b47] > DeactivateStorageDomainVDS failed 'a5a83df > 1-47e2-4927-9add-079199ca7ef8': > org.ovirt.engine.core.common.errors.EngineException: EngineException: > org.ovirt.engine.core.vdsbroker.irsbroker.IRSNoMasterDomainException: > IRSGenericException: IRSErrorException: > IRSNoMasterDomainException: Wrong Master domain or its version: > u'SD=a5a83df1-47e2-4927-9add-079199ca7ef8, > pool=f72ec125-69a1-4c1b-a5e1-313fcb70b6ff' (Failed with error > StoragePoolWrongMaster and code 324) > at > org.ovirt.engine.core.bll.VdsHandler.handleVdsResult(VdsHandler.java:118) > [bll.jar:] > at > org.ovirt.engine.core.bll.VDSBrokerFrontendImpl.runVdsCommand(VDSBrokerFrontendImpl.java:33) > [bll.jar:] > at > org.ovirt.engine.core.bll.CommandBase.runVdsCommand(CommandBase.java:2112) > [bll.jar:] > at > org.ovirt.engine.core.bll.storage.domain.DeactivateStorageDomainCommand.dectivateStorageDomain(DeactivateStorageDomainCommand.java:340) > [bll.jar:] > ... > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) > [rt.jar:1.8.0_292] > at java.lang.Thread.run(Thread.java:748) [rt.jar:1.8.0_292] > at > org.glassfish.enterprise.concurrent.ManagedThreadFactoryImpl$ManagedThread.run(ManagedThreadFactoryImpl.java:250) > [javax.enterprise.concurrent-1.0.jar:] > Caused by: > org.ovirt.engine.core.vdsbroker.irsbroker.IRSNoMasterDomainException: > IRSGenericException: IRSErrorException: IRSNoMasterDomainException: Wrong > Master domain or its version: u'SD=a5a83df1-47e2-4927-9add-079199ca7ef8, > pool=f72ec125-69a1-4c1b-a5e1-313fcb70b6ff' > at > org.ovirt.engine.core.vdsbroker.vdsbroker.BrokerCommandBase.proceedProxyReturnValue(BrokerCommandBase.java:50) > [vdsbroker.jar:] > at > org.ovirt.engine.core.vdsbroker.vdsbroker.ConnectStoragePoolVDSCommand.proceedConnectProxyReturnValue(ConnectStoragePoolVDSCommand.java:48) > [vdsbroker.jar:] > at > org.ovirt.engine.core.vdsbroker.vdsbroker.ConnectStoragePoolVDSCommand.proceedProxyReturnValue(ConnectStoragePoolVDSCommand.java:36) > [vdsbroker.jar:] > ... > at > org.jboss.weld.bean.proxy.CombinedInterceptorAndDecoratorStackMethodHandler.invoke(CombinedInterceptorAndDecoratorStackMethodHandler.java:79) > [weld-core-impl-3.1.1.Final.jar:3.1.1.Final] > at > org.jboss.weld.bean.proxy.CombinedInterceptorAndDecoratorStackMethodHandler.invoke(CombinedInterceptorAndDecoratorStackMethodHandler.java:68) > [weld-core-impl-3.1.1.Final.jar:3.1.1.Final] > at > org.ovirt.engine.core.vdsbroker.ResourceManager$Proxy$_$$_WeldSubclass.runVdsCommand(Unknown > Source) [vdsbroker.jar:] > ... 94 more > > 2021-07-29 10:30:37,861-07 ERROR > [org.ovirt.engine.core.bll.storage.domain.DeactivateStorageDomainCommand] > (EE-ManagedThreadFactory-engineScheduled-Thread-25) [35c5b47] Failed to > deactivate storage domain 'a5a83df1-47e2-4927-9add-079199ca7ef8' > 2021-07-29 10:30:37,868-07 INFO > [org.ovirt.engine.core.bll.CommandCompensator] > (EE-ManagedThreadFactory-engineScheduled-Thread-25) [35c5b47] Command > [id=c63199f8-a720-4053-8e5c-92c8d21e0ce2]: Compensating CHANGED_STATUS_ONLY > of org.ovirt.engine.core.common.businessentities.StoragePoolIsoMap; > snapshot: > EntityStatusSnapshot:{id='StoragePoolIsoMapId:{storagePoolId='f72ec125-69a1-4c1b-a5e1-313fcb70b6ff', > storageId='a5a83df1-47e2-4927-9add-079199ca7ef8'}', status='Unknown'}. > 2021-07-29 10:30:37,882-07 ERROR > [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] > (EE-ManagedThreadFactory-engineScheduled-Thread-25) [35c5b47] EVENT_ID: > USER_DEACTIVATE_STORAGE_DOMAIN_FAILED(969), Failed to deactivate Storage > Domain vm-storage-ssd (Data Center EDC2). > 2021-07-29 10:30:37,884-07 WARN > [org.ovirt.engine.core.bll.storage.pool.ReconstructMasterDomainCommand] > (EE-ManagedThreadFactory-engine-Thread-25) [60d33d] Validation of action > 'ReconstructMasterDomain' failed for user SYSTEM. Reasons: > VAR__ACTION__RECONSTRUCT_MASTER,VAR__TYPE__STORAGE__DOMAIN,ACTION_TYPE_FAILED_STORAGE_DOMAIN_STATUS_ILLEGAL2,$status > Locked > 2021-07-29 10:30:37,888-07 INFO > [org.ovirt.engine.core.bll.eventqueue.EventQueueMonitor] > (EE-ManagedThreadFactory-engine-Thread-48) [35c5b47] Finished reconstruct > for pool 'f72ec125-69a1-4c1b-a5e1-313fcb70b6ff'. Clearing event queue > 2021-07-29 10:30:37,899-07 INFO > [org.ovirt.engine.core.vdsbroker.vdsbroker.HSMGetAllTasksStatusesVDSCommand] > (EE-ManagedThreadFactory-engineScheduled-Thread-50) [] Command > 'org.ovirt.engine.core.vdsbroker.vdsbroker.HSMGetAllTasksStatusesVDSCommand' > return value ' > TaskStatusListReturn:{status='Status [code=654, message=Not SPM]'} > > > Thanks, > -Matthew > > On 7/29/21 2:52 AM, Shani Leviim wrote: > > Notice: This message was sent from outside the University of Victoria > email system. Please be cautious with links and sensitive information. > > Hi Matthew, > Actually, your description is related to 2 features available for ovirt > 4.4.5 <https://www.ovirt.org/release/4.4.5/> > 1. The ability to switch the master storage domain while domains are up > and running [1] > 2. Clearing the finished tasks from REST API [2] and UI [3]. > > We recommend you upgrade your engine to enjoy those features. > > In the meanwhile, as you've described, moving the Master role from one > storage to the other is available using putting the domain into maintenance. > In order to clear the finished tasks from SPM: > vdsm-client Host getAllTasksStatuses > > It should be something like that: > { > "1dc4d885-577a-4b6a-b01f-e682602a907c": { > "code": 0, > "message": "1 jobs completed successfully", > "taskID": "1dc4d885-577a-4b6a-b01f-e682602a907c", > "taskResult": "success", > "taskState": "finished" > } > } > > Then clear that tasks: > vdsm-client Task clear taskID=12345 > Once it gets cleared, the reconstruction can be finished. > > To verify there are no more finished async tasks, you can run this SQL > query on the engine: > engine=# select * from async_tasks WHERE storage_pool_id = '123'; > > [1] https://bugzilla.redhat.com/show_bug.cgi?id=1910022 > [2] https://bugzilla.redhat.com/show_bug.cgi?id=1627997 > [3] https://bugzilla.redhat.com/show_bug.cgi?id=1910302 > > > *Regards, * > > *Shani Leviim * > > > On Thu, Jul 29, 2021 at 8:33 AM Matthew Benstead <matth...@uvic.ca> wrote: > >> Hello, >> >> I'm trying to decommission the old master storage domain in ovirt, and >> replace it with a new one. All of the VMs have been migrated off of the old >> master, and everything has been running on the new storage domain for a >> couple months. But when I try to put the old domain into maintenance mode I >> get an error. >> >> Old Master: vm-storage-ssd >> New Domain: vm-storage-ssd2 >> >> The error is: >> >> Failed to Reconstruct Master Domain for Data Center EDC2 >> >> As well as: >> >> Sync Error on Master Domain between Host daccs01 and oVirt Engine. >> Domain: vm-storage-ssd is marked as Master in oVirt Engine database but not >> on the Storage side. Please consult with Support on how to fix this issue. >> >> 2021-07-28 11:41:34,870-07 WARN >> [org.ovirt.engine.core.vdsbroker.irsbroker.IrsProxy] >> (EE-ManagedThreadFactory-engine-Thread-23) [] Master domain version is not >> in sync between DB and VDSM. Domain vm-storage-ssd >> marked as master, but the version in DB: 283 and in VDSM: 280 >> >> And: >> >> Not stopping SPM on vds daccs01, pool id >> f72ec125-69a1-4c1b-a5e1-313fcb70b6ff as there are uncleared tasks Task >> '5fa9edf0-56c3-40e4-9327-47bf7764d28d', status 'finished' >> >> >> After a couple minutes all the domains are marked as active again and >> things continue, but vm-storage-ssd is still listed as the master domain. >> Any thoughts? >> >> This is on 4.3.10.4-1.el7 on CentOS 7. >> >> engine=# SELECT storage_name, storage_pool_id, storage, status FROM >> storage_pool_with_storage_domain ORDER BY storage_name; >> storage_name | storage_pool_id >> | storage | status >> >> -----------------------+--------------------------------------+----------------------------------------+-------- >> compute1-iscsi-ssd | f72ec125-69a1-4c1b-a5e1-313fcb70b6ff | >> yvUESE-yWUv-VIWL-qX90-aAq7-gK0I-EqppRL | 1 >> compute7-iscsi-ssd | f72ec125-69a1-4c1b-a5e1-313fcb70b6ff | >> 8ekHdv-u0RJ-B0FO-LUUK-wDWs-iaxb-sh3W3J | 1 >> export-domain-storage | f72ec125-69a1-4c1b-a5e1-313fcb70b6ff | >> d3932528-6844-481a-bfed-542872ace9e5 | 1 >> iso-storage | f72ec125-69a1-4c1b-a5e1-313fcb70b6ff | >> f800b7a6-6a0c-4560-8476-2f294412d87d | 1 >> vm-storage-7200rpm | f72ec125-69a1-4c1b-a5e1-313fcb70b6ff | >> a0bff472-1348-4302-a5c7-f1177efa45a9 | 1 >> vm-storage-ssd | f72ec125-69a1-4c1b-a5e1-313fcb70b6ff | >> 95acd9a4-a6fb-4208-80dd-1c53d6aacad0 | 1 >> vm-storage-ssd2 | f72ec125-69a1-4c1b-a5e1-313fcb70b6ff | >> 829d0600-c3f7-4dae-a749-d7f05c6a6ca4 | 1 >> (7 rows) >> >> Thanks, >> -Matthew >> -- >> _______________________________________________ >> Users mailing list -- users@ovirt.org >> To unsubscribe send an email to users-le...@ovirt.org >> Privacy Statement: https://www.ovirt.org/privacy-policy.html >> oVirt Code of Conduct: >> https://www.ovirt.org/community/about/community-guidelines/ >> List Archives: >> https://lists.ovirt.org/archives/list/users@ovirt.org/message/OXOXW6B2NWXOUGZV3OKO4OMDXVDJSQLZ/ >> > >
_______________________________________________ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/CNPYCHVVXD4SGSF5WB7F7LI3PYMSGHSH/