Re: [Pacemaker] crm command line tool problem

2009-05-22 Thread Karl Katzke
Joe, The way I'd fix this problem would be to change the stonith action to poweroff, then I would bring each machine up in single user mode and make sure that you won't automatically join the cluster when the machine comes up (chkconfig openais off in my system, ymmv.) Then you can bring it up

Re: [Pacemaker] cibadmin -E only temporarily erasing config

2009-05-22 Thread Joe Armstrong
If you shutdown hearbeat/openais then you can clear out the /var/lib/hearbeat/crm directory to *really* remove your cib. I'm not sure what would happen if you cleared out the directory without shutting down... When you restart heartbeat/openais you will have only cluster nodes. If you want to

[Pacemaker] cibadmin -E only temporarily erasing config

2009-05-22 Thread Jason Woodward
Hello, I am trying to get my config working, but have gone down the wrong path. Unfortunately, I can't erase my config and start over. I try cibadmin -E --force, which seems to work for about 10 seconds. cibadmin -Q shows a minimal config with just my nodes, but then it jumps right back to

Re: [Pacemaker] OpenAIS vs. Corosync

2009-05-22 Thread Andrew Beekhof
On Fri, May 22, 2009 at 7:03 PM, Steven Dake wrote: > On Fri, 2009-05-22 at 11:42 -0500, Karl Katzke wrote: >> > Karl, >> > >> > we are closing in on the final days of openais and corosync versions >> > 1.0. >> > >> > Regards >> > -steve >> >> So? When will we see some stable, well-documented conf

Re: [Pacemaker] cib still leaks in pacemaker-1.0.3

2009-05-22 Thread Andrew Beekhof
And you'll also want this patch for the crmd diff -r 4619c842d58c crmd/callbacks.c --- a/crmd/callbacks.c Fri May 22 16:52:14 2009 +0200 +++ b/crmd/callbacks.c Fri May 22 21:34:12 2009 +0200 @@ -179,7 +179,6 @@ crmd_ha_msg_callback(HA_Message *hamsg, } else { crmd_ha_msg_fil

Re: [Pacemaker] OpenAIS vs. Corosync

2009-05-22 Thread Steven Dake
On Fri, 2009-05-22 at 11:42 -0500, Karl Katzke wrote: > > Karl, > > > > we are closing in on the final days of openais and corosync versions > > 1.0. > > > > Regards > > -steve > > So? When will we see some stable, well-documented configurations that aren't > supported and tested by o

Re: [Pacemaker] OpenAIS vs. Corosync

2009-05-22 Thread Karl Katzke
> Karl, > > we are closing in on the final days of openais and corosync versions > 1.0. > > Regards > -steve So? When will we see some stable, well-documented configurations that aren't supported and tested by only one specific vendor? -K --- Karl Katzke Systems Analyst II TAMU -

Re: [Pacemaker] OpenAIS vs. Corosync

2009-05-22 Thread Steven Dake
On Fri, 2009-05-22 at 09:27 -0500, Karl Katzke wrote: > > > Well, I'm still not sure what the state of OpenAIS/CoroSync is, but seem > > > to > > > encounter only more problems the farther I dig in to it. > > > > This is quite unsurprising. > > The CoroSync API has changed almost daily for t

[Pacemaker] crm command line tool problem

2009-05-22 Thread Joe Armstrong
Hi All, I am playing around with the crm command line tool to create an HA config for pacemaker and am bumping into a problem. If I have a configuration running already, 3-node with ip & httpd (pretty simple) and I want to create a new configuration according to the "CRM CLI" document I should

Re: [Pacemaker] OpenAIS vs. Corosync

2009-05-22 Thread Karl Katzke
> > Well, I'm still not sure what the state of OpenAIS/CoroSync is, but seem to > > encounter only more problems the farther I dig in to it. > > This is quite unsurprising. > The CoroSync API has changed almost daily for the last few months. > > If you really want to use CoroSync, I'd sugg

Re: [Pacemaker] DRBD primary/primary

2009-05-22 Thread Nicholas Dronen
Hi, Florian: I don't think that suggestion appeals to me. :-) Any idea how much work it is to change the OCF RA to reliably support master_max=2? Regards, Nick On Fri, May 22, 2009 at 2:48 AM, Florian Haas wrote: > On 05/21/2009 11:12 PM, Nicholas Dronen wrote: > > Hi: > > > > The wiki says

Re: [Pacemaker] trigger STONITH for testing purposes

2009-05-22 Thread Andrew Beekhof
On Wed, May 20, 2009 at 6:39 PM, Bob Haxo wrote: > Hi Andrew, > > I'd say you removed no-quorum-policy=ignore > > Actually, the pair of no_quorum_policy and no-quorum-policy are set to > "ignore", and expected-quorum-votes is set to "2": > >   >     >   ... >   name="expected-quorum-vot

Re: [Pacemaker] globally-unique clone question

2009-05-22 Thread Andrew Beekhof
On Thu, May 21, 2009 at 5:04 PM, Joe Armstrong wrote: > Hi All, > > I am a little confused about globally-unique clones, since there can be no > instance attributes for a clone how do you tell each clone that it is unique ? The globally-unique=true option. Then, inside your resource agent, you n

Re: [Pacemaker] globally-unique clone question

2009-05-22 Thread Andrew Beekhof
The idea behind unique vs. non-unique is best illustrated by example. Take a CLUSTERIP resource... Based on some criteria (usually the source address), it allocates all requests into a bucket from 0..(N-1), where N ::= clone-max So when we ask "is the resource running here", we're really asking:

Re: [Pacemaker] OpenAIS vs. Corosync

2009-05-22 Thread Andrew Beekhof
On Fri, May 22, 2009 at 12:39 AM, Ryan Steele wrote: > Well, I'm still not sure what the state of OpenAIS/CoroSync is, but seem to > encounter only more problems the farther I dig in to it.  Here are the > versions of the software packages I'm using: > > corosync            0.92-0ubuntu3 > libcoro

Re: [Pacemaker] New fix for System Health

2009-05-22 Thread Andrew Beekhof
On Thu, May 21, 2009 at 9:22 PM, Mark Hamzy wrote: > I caught a bug in the code (first diff in the patch). Nod. I found that one too :) > I added some testcases. At least I hoped that I added them in the appriate > way. Yep. Thanks for those, I'll add them today. > I noticed that the source

Re: [Pacemaker] DRBD primary/primary

2009-05-22 Thread Florian Haas
On 05/21/2009 11:12 PM, Nicholas Dronen wrote: > Hi: > > The wiki says that DRBD primary/primary isn't supported yet: > > http://clusterlabs.org/wiki/DRBD_HowTo_1.0 > > Is there a workaround? lmb and Dominik are probably going to hit me over the head with a 2x4 for suggesting this, but unti