So just further to this I found a Red Hat bug about this exact issue:

https://bugzilla.redhat.com/show_bug.cgi?id=570373

And for me it works perfectly if the dom0 is fenced using fence_node on the
command line. However, if the host becomes unavailable then it is not
fenced, and from reading the fenced man page it seems this is because there
isn't a shared resource like clvm or gfs, so therefore the cluster doesn't
see a need to fence the host. This means subsequent fence_xvm commands fail.

I guess I need to find a way to force fenced to operate without clvm and
fence dom0s?

Joel

On Fri, Oct 1, 2010 at 3:12 PM, Joel Heenan <[email protected]> wrote:

> I'm conducting some HA testing and I found that if I destroy a dom0, the
> other guest nodes (running on separate dom0s) all try to fence the guest
> that has disappeared but they fail because they can't reach fence_xvmd (the
> dom0 is down). They sit there trying to fence forever and never make any
> progress until the dom0 has rebooted and the guest is back up. This is a lot
> of downtime.
>
> What I tried was to make a secondary fence_manual after the fence_xen but
> the problem is that it then requires me to manuall acknowledge fencing.
>
> What I would like is "try fencing, if it fails don't worry just continue as
> if it succeeded"? Is there an option for this?
>
> Thanks
>
> Joel
>
--
Linux-cluster mailing list
[email protected]
https://www.redhat.com/mailman/listinfo/linux-cluster

Reply via email to