On Fri, 2019-06-14 at 18:27 +0200, Lentes, Bernd wrote: > Hi, > > i had that problem already once but still it's not clear for me what > really happens. > I had this problem some days ago: > I have a 2-node cluster with several virtual domains as resources. I > put one node (ha-idg-2) into standby, and two running virtual domains > were migrated to the other node (ha-idg-1). The other virtual domains > were already running on ha-idg-1. > Since then the two virtual domains which migrated (vm_idcc_devel and > vm_severin) start or stop every 15 minutes on ha-idg-1. > ha-idg-2 resides in standby. > I know that the 15 minutes interval is related to the "cluster- > recheck-interval". > But why are these two domains started and stopped ? > I looked around much in the logs, checked the pe-input files, watched > some graphs created by crm_simulate with dotty ... > I always see that the domains are started and 15 minutes later > stopped and 15 minutes later started ... > but i don't see WHY. I would really like to know that. > And why are the domains not started from the monitor resource > operation ? It should recognize that the domain is stopped and starts > it again. My monitor interval is 30 seconds. > I had two errors pending concerning these domains, a failed migrate > from ha-idg-1 to ha-idg-2, form some time before. > Could that be the culprit ? > > I still have all the logs from that time, if you need information > just let me know.
Yes the logs and pe-input files would be helpful. It sounds like a bug in the scheduler. What version of pacemaker are you running? > > Thanks. > > > Bernd -- Ken Gaillot <kgail...@redhat.com> _______________________________________________ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/