Re: [Linux-HA] (Possible) bug somewhere in Linux-HA

2007-04-05 Thread Michael Schwartzkopff
Am Donnerstag, 5. April 2007 06:22 schrieb Alan Robertson: > Michael Schwartzkopff wrote: > > Hi, > > > > I am writing an extension to IPaddr2 and this script is getting > > sometimes a wrong OCF_RESKEY_clusterip_hash. > > > > How to repeat: > > > > 1) Define a clone IPaddr2 ocf ressource. > > 2) T

Re: [Linux-HA] How to make a colocation rule between a Master/Slave resource's Master and another resource?

2007-04-05 Thread Andrew Beekhof
On 4/5/07, Alan Robertson <[EMAIL PROTECTED]> wrote: Andrew Beekhof wrote: > > On Mar 20, 2007 at 4:37 PM Alan Robertson <[EMAIL PROTECTED]> wrote: > >> >> Andrew Beekhof wrote: >>> On 3/18/07, Alan Robertson <[EMAIL PROTECTED]> wrote: Lars Marowsky-Bree wrote: > On 2007-03-16T10:38:25,

Re: [Linux-HA] Heartbeat compatibility question

2007-04-05 Thread Patrick Begou
Just to give some final info on this thread: - Heartbeat version 2.0.8 do not works with heartbeat version 1.2.x. i had to install two identical versions on the two nodes (downgrading FC6 official version) - version 1.2.x works without major problems between a node Fedora Core 6 X86_64 and a no

Re: [Linux-HA] Heartbeat compatibility question

2007-04-05 Thread Andrew Beekhof
On 4/5/07, Patrick Begou <[EMAIL PROTECTED]> wrote: Just to give some final info on this thread: - Heartbeat version 2.0.8 do not works with heartbeat version 1.2.x. i had to install two identical versions on the two nodes (downgrading FC6 official version) in what way does it not work? - ver

Re: [Linux-HA] Heartbeat compatibility question

2007-04-05 Thread Dejan Muhamedagic
On Thu, Apr 05, 2007 at 10:25:28AM +0200, Patrick Begou wrote: > Just to give some final info on this thread: > - Heartbeat version 2.0.8 do not works with heartbeat version 1.2.x. i > had to install two identical versions on the two nodes (downgrading FC6 > official version) Interesting. I wond

Re: [Linux-HA] cib.xml races on initialization

2007-04-05 Thread Andrew Beekhof
btw. you do know that you could have replaced the current configuration _without_ stopping the cluster at all right? editing or otherwise modifying the cib.xml file is _highly_ discouraged On 4/4/07, Andrew Beekhof <[EMAIL PROTECTED]> wrote: On 4/4/07, Bernd Schubert <[EMAIL PROTECTED]> wrote:

[Linux-HA] Can a RA know if a clone resource is ordered or interleave is true?

2007-04-05 Thread Michael Schwartzkopff
Hi, Can a RA script know if the clone resource has set ordered=true or interleave=true? Is this information somewhere set in a variable, like the OCF_RESKEY_CRM_meta_clone_max for the information about maximum number of clones in a resource? Thanks. -- Dr. Michael Schwartzkopff MultiNET Servi

Re: [Linux-HA] Can a RA know if a clone resource is ordered or interleave is true?

2007-04-05 Thread Andrew Beekhof
On 4/5/07, Michael Schwartzkopff <[EMAIL PROTECTED]> wrote: Hi, Can a RA script know if the clone resource has set ordered=true or interleave=true? Is this information somewhere set in a variable, like the OCF_RESKEY_CRM_meta_clone_max for the information about maximum number of clones in a reso

Re: [Linux-HA] STONITH in response to stop failures (suicide or ssh)

2007-04-05 Thread Christophe Zwecker
Hi Dave, its this: grep mw-test /etc/ha.d/ha.cf nodemw-test-n1.i-dis.net nodemw-test-n2.i-dis.net [EMAIL PROTECTED] ~]# uname -n mw-test-n2.i-dis.net Dave Blaschke wrote: Christophe Zwecker wrote: after trying it with fence I got the following, looks like stonith wanted to reset no

Re: [Linux-HA] STONITH in response to stop failures (suicide or ssh)

2007-04-05 Thread Dave Blaschke
Christophe Zwecker wrote: Hi Dave, its this: grep mw-test /etc/ha.d/ha.cf nodemw-test-n1.i-dis.net nodemw-test-n2.i-dis.net [EMAIL PROTECTED] ~]# uname -n mw-test-n2.i-dis.net And your cib.xml? Dave Blaschke wrote: Christophe Zwecker wrote: after trying it with fence I got the fo

Re: [Linux-HA] Heartbeat compatibility question

2007-04-05 Thread David Lee
On Thu, 5 Apr 2007, Dejan Muhamedagic wrote: > On Thu, Apr 05, 2007 at 10:25:28AM +0200, Patrick Begou wrote: > > > - version 1.2.x works without major problems between a node Fedora Core > > 6 X86_64 and a node Debian Sarge AMD64. > > Good. heartbeat should run in a mix of platforms. I think I ca

Re: [Linux-HA] Can a RA know if a clone resource is ordered or interleave is true?

2007-04-05 Thread Michael Schwartzkopff
Am Donnerstag, 5. April 2007 13:52 schrieb Andrew Beekhof: > On 4/5/07, Michael Schwartzkopff <[EMAIL PROTECTED]> wrote: > > Hi, > > > > Can a RA script know if the clone resource has set ordered=true or > > interleave=true? Is this information somewhere set in a variable, like > > the OCF_RESKEY_C

Re: [Linux-HA] Heartbeat compatibility question

2007-04-05 Thread Patrick Begou
Dejan Muhamedagic wrote: On Thu, Apr 05, 2007 at 10:25:28AM +0200, Patrick Begou wrote: Just to give some final info on this thread: - Heartbeat version 2.0.8 do not works with heartbeat version 1.2.x. i had to install two identical versions on the two nodes (downgrading FC6 official version)

Re: [Linux-HA] cib.xml races on initialization

2007-04-05 Thread Bernd Schubert
On Thursday 05 April 2007 13:49:08 Andrew Beekhof wrote: > btw. you do know that you could have replaced the current > configuration _without_ stopping the cluster at all right? Yeah I know, we just have a reset-cluster script, which is presently stopping the cluster to load a new cib.xml. I thin

Re: [Linux-HA] Masters take long time to get back the ip from slave

2007-04-05 Thread Austin Rock
Thanks Alan. Pls don`t say forgive me. The fact is fact. Now i will try to manage myself my linux prob. only for ha related prob. i will post. Thanks a lot for your time and help.. Thanks a lot alan. I will read the link send by you and reply you. Thanks once again. On 4/5/07, Alan Rober

Re: [Linux-HA] How to make a colocation rule between a Master/Slave resource's Master and another resource?

2007-04-05 Thread Alan Robertson
Andrew Beekhof wrote: > On 4/5/07, Alan Robertson <[EMAIL PROTECTED]> wrote: >> Andrew Beekhof wrote: >> > >> > On Mar 20, 2007 at 4:37 PM Alan Robertson <[EMAIL PROTECTED]> wrote: >> > >> >> >> >> Andrew Beekhof wrote: >> >>> On 3/18/07, Alan Robertson <[EMAIL PROTECTED]> wrote: >> Lars Marow

Re: [Linux-HA] Heartbeat compatibility question

2007-04-05 Thread Alan Robertson
Patrick Begou wrote: > Just to give some final info on this thread: > - Heartbeat version 2.0.8 do not works with heartbeat version 1.2.x. i > had to install two identical versions on the two nodes (downgrading FC6 > official version) I've tested this in the past. Without logs, I can't comment on

Re: [Linux-HA] Can a RA know if a clone resource is ordered or interleave is true?

2007-04-05 Thread Alan Robertson
Michael Schwartzkopff wrote: > Am Donnerstag, 5. April 2007 13:52 schrieb Andrew Beekhof: >> On 4/5/07, Michael Schwartzkopff <[EMAIL PROTECTED]> wrote: >>> Hi, >>> >>> Can a RA script know if the clone resource has set ordered=true or >>> interleave=true? Is this information somewhere set in a var

Re: [Linux-HA] STONITH in response to stop failures (suicide or ssh)

2007-04-05 Thread Christophe Zwecker
Dave Blaschke wrote: Christophe Zwecker wrote: Hi Dave, its this: grep mw-test /etc/ha.d/ha.cf nodemw-test-n1.i-dis.net nodemw-test-n2.i-dis.net [EMAIL PROTECTED] ~]# uname -n mw-test-n2.i-dis.net And your cib.xml? grep mw-test /var/lib/heartbeat/crm/cib.xml id="5b1a3c52

Re: [Linux-HA] Can a RA know if a clone resource is ordered or interleave is true?

2007-04-05 Thread Michael Schwartzkopff
Am Donnerstag, 5. April 2007 15:35 schrieb Alan Robertson: > Michael Schwartzkopff wrote: > > Am Donnerstag, 5. April 2007 13:52 schrieb Andrew Beekhof: > >> On 4/5/07, Michael Schwartzkopff <[EMAIL PROTECTED]> wrote: > >>> Hi, > >>> > >>> Can a RA script know if the clone resource has set ordered=

Re: [Linux-HA] STONITH in response to stop failures (suicide or ssh)

2007-04-05 Thread Alan Robertson
Christophe Zwecker wrote: > Dave Blaschke wrote: >> Christophe Zwecker wrote: >>> Hi Dave, >>> >>> its this: >>> >>> grep mw-test /etc/ha.d/ha.cf >>> nodemw-test-n1.i-dis.net >>> nodemw-test-n2.i-dis.net >>> >>> [EMAIL PROTECTED] ~]# uname -n >>> mw-test-n2.i-dis.net >>> >> And your cib.xml

Re: [Linux-HA] Can a RA know if a clone resource is ordered or interleave is true?

2007-04-05 Thread Carson Gaspar
Michael Schwartzkopff wrote: Am Donnerstag, 5. April 2007 15:35 schrieb Alan Robertson: So, at the moment, you really just want to make sure you're configured correctly. Is that right? Yes. My only comment on this is that if having two copies of your resource agent running at once causes

Re: [Linux-HA] Can a RA know if a clone resource is ordered or interleave is true?

2007-04-05 Thread Alan Robertson
Carson Gaspar wrote: > Michael Schwartzkopff wrote: >> Am Donnerstag, 5. April 2007 15:35 schrieb Alan Robertson: > >>> So, at the moment, you really just want to make sure you're configured >>> correctly. Is that right? >> >> Yes. > > My only comment on this is that if having two copies of your

Re: [Linux-HA] STONITH in response to stop failures (suicide or ssh)

2007-04-05 Thread Dave Blaschke
Christophe Zwecker wrote: Dave Blaschke wrote: Christophe Zwecker wrote: Hi Dave, its this: grep mw-test /etc/ha.d/ha.cf nodemw-test-n1.i-dis.net nodemw-test-n2.i-dis.net [EMAIL PROTECTED] ~]# uname -n mw-test-n2.i-dis.net And your cib.xml? grep mw-test /var/lib/heartbeat/crm/

Re: [Linux-HA] Can a RA know if a clone resource is ordered or interleave is true?

2007-04-05 Thread Carson Gaspar
Alan Robertson wrote: Carson Gaspar wrote: My only comment on this is that if having two copies of your resource agent running at once causes serious problems, you need to _strongly_ consider re-writing you agent to have sufficient locking / atomicity. Or it will come back to bite you some day

Re: [Linux-HA] OCF_RESKEY_interval

2007-04-05 Thread Bernd Schubert
On Wednesday 04 April 2007 23:23:52 Alan Robertson wrote: > Bernd Schubert wrote: > > Hi, > > > > after upgrading from heartbeat-2.0.5 to heartbeat-2.0.8 > > OCF_RESKEY_interval interval is not set anymore, which makes our > > monitoring actions to always return ${OCF_NOT_RUNNING}. > > > > As given

Re: [Linux-HA] Can a RA know if a clone resource is ordered or interleave is true?

2007-04-05 Thread Michael Schwartzkopff
Carson Gaspar schrieb: > My only comment on this is that if having two copies of your resource > agent running at once causes serious problems, you need to _strongly_ > consider re-writing you agent to have sufficient locking / atomicity. > Or it will come back to bite you some day... > Hi, have

Re: [Linux-HA] OCF_RESKEY_interval

2007-04-05 Thread Alan Robertson
Bernd Schubert wrote: > On Wednesday 04 April 2007 23:23:52 Alan Robertson wrote: >> Bernd Schubert wrote: >>> Hi, >>> >>> after upgrading from heartbeat-2.0.5 to heartbeat-2.0.8 >>> OCF_RESKEY_interval interval is not set anymore, which makes our >>> monitoring actions to always return ${OCF_NOT_R

Re: [Linux-HA] Can a RA know if a clone resource is ordered or interleave is true?

2007-04-05 Thread Alan Robertson
Carson Gaspar wrote: > Alan Robertson wrote: >> Carson Gaspar wrote: > >>> My only comment on this is that if having two copies of your resource >>> agent running at once causes serious problems, you need to _strongly_ >>> consider re-writing you agent to have sufficient locking / atomicity. Or >>