Re: [Linux-cluster] rgmanager gets stuck on shutdown, if no services are running on its node.

2010-12-08 Thread Jankowski, Chris
Lon, I think that I got to the bottom of the problem: If there are *no* services running on a node and you issue "shutdown -h now" on the node, then when it comes to shutting down rgmanger, it executes the following sequence: 1. Outputs "Shutting down" message to /var/adm/messages 2. Waits for

Re: [Linux-cluster] How do I implement an unmount only filesystem resource agent

2010-12-08 Thread Jankowski, Chris
Lon, Thank you for your suggestion. In the meantime, I developed a script to do the unmount of a snapshot on stop and configured it as an additional resource agent of the type script. This works very well. Regards, Chris Jankowski -Original Message- From: linux-cluster-boun...@redhat

Re: [Linux-cluster] rgmanager gets stuck on shutdown, if no services are running on its node.

2010-12-08 Thread Jankowski, Chris
Lon, The problem is reproducible at will. I do have access to the system after the "shutdown -h now" command is issued and rgmanager blocks. I have gdb installed, but I do not know how to obtain rgmanager-debuginfo. The system is on an isolated network and I pointed you to an on-disk repository

Re: [Linux-cluster] Heuristics for quorum disk used as a tiebreaker in a two node cluster.

2010-12-08 Thread Jankowski, Chris
Lon, Thank you for your suggestions. 1. I like very much your idea of having additional fencing agent (called as the first one in the chain) with delay dependent on the presence of the service on the node. I understand the code. What I do not know is what are the steps in adding my own fenc

Re: [Linux-cluster] Heuristics for quorum disk used as a tiebreaker in a two node cluster.

2010-12-08 Thread Lon Hohberger
On Fri, 2010-12-03 at 10:10 +, Jankowski, Chris wrote: > This is exactly what I would like to achieve. I know which node > should stay alive - the one running my service, and it is trivial for > me to find this out directly, as I can query for its status locally on > a node. I do not have use

Re: [Linux-cluster] How do I implement an unmount only filesystem resource agent

2010-12-08 Thread Lon Hohberger
On Mon, 2010-12-06 at 12:27 +, Jankowski, Chris wrote: > > To have a foolproof HA service I need to: > > * Check, if the snapshot filesystem is mounted > * If it is, all processes running in it need to be killed > * Then the snapshot filesystem needs to be unmounted. >

Re: [Linux-cluster] rgmanager gets stuck on shutdown, if no services are running on its node.

2010-12-08 Thread Lon Hohberger
On Wed, 2010-12-08 at 03:11 +, Jankowski, Chris wrote: > Hi, > > I configured a cluster of 2 RHEL6 nodes. > The cluster has only one HA service defined. > > I have a problem with rgmanager getting stuck on shutdown when certain > set of conditions are met. The details follow. > > 1. >