Re: [Pacemaker] migration fix for ocf:heartbeat:Xen

Daugherity, Andrew W Tue, 23 Aug 2011 09:00:14 -0700

> Message: 7
> Date: Thu, 11 Aug 2011 21:07:00 +0000
> From: "Daugherity, Andrew W" <adaugher...@tamu.edu>
> To: "pacemaker@oss.clusterlabs.org" <pacemaker@oss.clusterlabs.org>
> Subject: [Pacemaker] migration fix for ocf:heartbeat:Xen
> Message-ID: <93b5e618-ad19-4993-8066-cb4f8e4ef...@tamu.edu>
> Content-Type: text/plain; charset="us-ascii"
> 
> I have discovered that sometimes when migrating a VM, the migration itself 
> will succeed, but the migrate_from call on the target node will fail, as 
> apparently the status hasn't settled down yet.  This is more likely to happen 
> when stopping pacemaker on a node, causing all its VMs to migrate away.  
> Migration succeeds, but then (sometimes) the status call in migrate_from 
> fails, and the VM is unnecessarily stopped and started.  Note that it is NOT 
> a timeout problem, as the migrate_from operation (which only checks status) 
> takes less than a second.
> 
> I noticed the VirtualDomain RA does a loop rather than just checking the 
> status once as the Xen RA does, so I patched a similar thing into the Xen RA, 
> and that solved my problem.
(patch/logs snipped)



No comments?  What does it take to get this patch accepted?  I'd much rather 
use the mainline version than have to reapply my patch after every HAE update.  
I guess I could open an SR with Novell but this is ultimately an upstream issue.



Andrew Daugherity
Systems Analyst
Division of Research, Texas A&M University
adaugher...@tamu.edu


_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker

Re: [Pacemaker] migration fix for ocf:heartbeat:Xen

Reply via email to