So just further to this I found a Red Hat bug about this exact issue: https://bugzilla.redhat.com/show_bug.cgi?id=570373
And for me it works perfectly if the dom0 is fenced using fence_node on the command line. However, if the host becomes unavailable then it is not fenced, and from reading the fenced man page it seems this is because there isn't a shared resource like clvm or gfs, so therefore the cluster doesn't see a need to fence the host. This means subsequent fence_xvm commands fail. I guess I need to find a way to force fenced to operate without clvm and fence dom0s? Joel On Fri, Oct 1, 2010 at 3:12 PM, Joel Heenan <[email protected]> wrote: > I'm conducting some HA testing and I found that if I destroy a dom0, the > other guest nodes (running on separate dom0s) all try to fence the guest > that has disappeared but they fail because they can't reach fence_xvmd (the > dom0 is down). They sit there trying to fence forever and never make any > progress until the dom0 has rebooted and the guest is back up. This is a lot > of downtime. > > What I tried was to make a secondary fence_manual after the fence_xen but > the problem is that it then requires me to manuall acknowledge fencing. > > What I would like is "try fencing, if it fails don't worry just continue as > if it succeeded"? Is there an option for this? > > Thanks > > Joel >
-- Linux-cluster mailing list [email protected] https://www.redhat.com/mailman/listinfo/linux-cluster
