[Linux-HA] OCF Script

2010-03-17 Thread Umakant Goyal
Hi All, Can someone give me the idea how to write OCF (Open Cluster Framework) script to monitor running process in the system and provide the status of the same to the HeartBeat? Thanks in Advance ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org h

Re: [Linux-HA] Resource colocation with a clone.

2010-03-17 Thread Andrew Beekhof
On Wed, Mar 17, 2010 at 6:44 PM, Michele Codutti wrote: > Hi all, It's possible to constrain a resource to run only on nodes where also > runs an instance of a patricular clone? > Example: I've a database-like application that i've setup as a clone to run > one instance for every node. Only one

Re: [Linux-HA] node2 wont stay up

2010-03-17 Thread Cameron Smith
A-HA! I see on node2: # netstat -lnup Active Internet connections (only servers) Proto Recv-Q Send-Q Local Address Foreign Address State PID/Program name udp0 0 0.0.0.0:694 0.0.0.0:* 2214/rpc.statd What is statd and why is it running us

Re: [Linux-HA] node2 wont stay up

2010-03-17 Thread Cameron Smith
Thanks Andrew! Yes I see that but the two servers are identically configured with the only difference being IP, hostname so why am I not getting that error on node1? Which address is the error referring to? The main shared IP? I don't understand how to troubleshoot from that error. Cameron On W

Re: [Linux-HA] node2 wont stay up

2010-03-17 Thread Andrew Beekhof
I wonder if this might be related: Mar 17 21:46:50 node2 heartbeat: [5289]: ERROR: glib: Error binding socket (Address already in use). Retrying. On Wed, Mar 17, 2010 at 9:44 PM, Cameron Smith wrote: > Here is more info: > > In checking /var/log/messages: > > Mar 17 21:46:50 node2 heartbeat: [5

Re: [Linux-HA] node2 wont stay up

2010-03-17 Thread Cameron Smith
Here is more info: In checking /var/log/messages: Mar 17 21:46:50 node2 heartbeat: [5288]: info: Version 2 support: false Mar 17 21:46:50 node2 heartbeat: [5288]: WARN: Logging daemon is disabled --enabling logging daemon is recommended Mar 17 21:46:50 node2 heartbeat: [5288]: info: *

[Linux-HA] node2 wont stay up

2010-03-17 Thread Cameron Smith
Brand new heartbeat user running my first test and have run into a problem. I installed heartbeat on node1 and node2 it works fine on node1 and shows a webpage to the eth0:0 IP but heartbeat won't stay running on node2. When I start heartbeat I get: # service heartbeat start logd is already runn

[Linux-HA] Virtual Name and Samba clarification.

2010-03-17 Thread Bruce Leggett
Hi, I recently setup Heartbeat on two SLES 10.2 boxes. Between yast and hb_gui it was simple to get the cluster up and running with a virtual IP in my resource group. I have a couple questions. I've been reading different resources and it's tough to get a grap on all the parts to better underst

Re: [Linux-HA] IP Power 9258 stonith plugin

2010-03-17 Thread Helmut Weymann
Hi, everything implemented and tested. README and script are updated. Am Dienstag, 9. März 2010 schrieb Dejan Muhamedagic: > Hi, > > On Mon, Mar 08, 2010 at 10:56:01PM +0100, Helmut Weymann wrote: > > Hi, > > > > Am Montag, 8. März 2010 schrieb Dejan Muhamedagic: > > > Hi, > > > > > > Sorry for t

[Linux-HA] Resource colocation with a clone.

2010-03-17 Thread Michele Codutti
Hi all, It's possible to constrain a resource to run only on nodes where also runs an instance of a patricular clone? Example: I've a database-like application that i've setup as a clone to run one instance for every node. Only one instance of this application could be queried the other ones are

[Linux-HA] permission denied error starting dopd w/ heartbeat

2010-03-17 Thread Alex Dean
I've build heartbeat+pacemaker from source on RHEL5. glue-1.0.3.tar.bz2 agents-1.0.2.tar.bz2 STABLE-3.0.2.tar.bz2 (heartbeat) Pacemaker-1.0.5.tar.bz2 I've added this to /etc/ha.d/ha.cf: respawn hacluster /usr/lib64/heartbeat/dopd apiauth dopd gid=haclient uid=hacluster When I reload heartb

[Linux-HA] File-system resources still running on unplugged fibre channel

2010-03-17 Thread Tony Gan
Hi, I'm using heartbeat-3.0.0-33.2 and pacemaker-1.0.5-4.6 to create a two node cluster. And both nodes connected to a shared storage device through Fibre Channel through a FC switch. And I am going to use the shared storage as my file system resource in cluster, I can mount the file system succesf

[Linux-HA] Usage of Cluster-Testsuite

2010-03-17 Thread Andreas Mock
Hi all, I've now pacemaker/corosync running on two machines with openSuSE 11.2, stonith agents are configured. So the base for a cluster is up and running. There were some postings regarding this combination having problems. Is there a way to run the cluster tests on this cluster? Probably someo

Re: [Linux-HA] OCF-RA and shell functions

2010-03-17 Thread Marian Marinov
On Wednesday 17 March 2010 12:16:58 Andreas Mock wrote: > Hi all, > > here some questions regarding programming ocf-ra: > > a) Am I right that programming a RA as portable shell script (no bashisms) > is preferred? Or are other (scripting) languages also welcome? I'm confident that no bashisms a

[Linux-HA] OCF-RA and shell functions

2010-03-17 Thread Andreas Mock
Hi all, here some questions regarding programming ocf-ra: a) Am I right that programming a RA as portable shell script (no bashisms) is preferred? Or are other (scripting) languages also welcome? b) I found RA scripts for daemons where the 'status' call checks for the existence of the daemon p

Re: [Linux-HA] Changing CIB Group Properties from the Command Line

2010-03-17 Thread Dejan Muhamedagic
Hi, On Tue, Mar 16, 2010 at 12:08:29PM -0700, Robinson, Eric wrote: > >> When you change or reorder group memberships like that (using either > >> crm configure or cibadmin) does it interrupt service? > >> > > > If you added a stopped resource before ones that were already started, > yes. > >