Re: [Linux-HA] Setup Q

2007-12-06 Thread Andrew Beekhof
On Dec 6, 2007, at 1:49 AM, Rafe Slattery wrote: thanks for the suggestion... It helped ... a bit... but I'm still not quite there... Can I get some clarificaton on what location constraints I should use. I want an active/passive combo. 1 web to be running web services , 1 passive. 1 db to ru

Re: [Linux-HA] about unmanaged resource

2007-12-06 Thread Andrew Beekhof
On Dec 3, 2007, at 1:08 PM, Dejan Muhamedagic wrote: Hi, On Fri, Nov 30, 2007 at 03:03:39PM -0800, En Zhu wrote: Hello! I'm very new with heartbeat. I just tried to create a resource with ipaddr2 as the instruction from: http://linux-ha.org/Education/Newbie/IPaddrScreencast However, the re

Re: [Linux-HA] Pingd

2007-12-06 Thread Andrew Beekhof
On Dec 5, 2007, at 6:38 PM, China wrote: Hi, I've configured two machine with Linux-ha in Active/passive mode. With v1 conf all is ok, instead with v2 conf there are the following behaviours: PC_A up with resource and PC_B up without resource | V shu

Re: [Linux-HA] Please Any Help!? STONITH suicide device

2007-12-06 Thread Andrew Beekhof
On Dec 5, 2007, at 10:19 PM, Rois Cannon wrote: Michael / all, Any chance you figured out how to keep ha from going back to node1 without human intervention? Rois This is my dilemma: I'm building a simple 2 node cluster, node1 and node2. HA works fine so far. If I issue "network stop" o

Re: [Linux-HA] Double node configuration with mixed ressource

2007-12-06 Thread Andrew Beekhof
On Dec 5, 2007, at 1:44 PM, Dejan Muhamedagic wrote: Hi, On Wed, Dec 05, 2007 at 11:51:20AM +0100, Franck Ganachaud wrote: Well I don't want hearbeat to stop or start mysql. You should be better off if you do. Otherwise, you'll probably end up with an unmaintainable and complex configuratio

Re: [Linux-HA] Newbie Questions on Heartbeat Startup

2007-12-06 Thread Andrew Beekhof
On Nov 30, 2007, at 7:39 PM, Art Age Software wrote: Hi, I'm setting up my first heartbeat cluster. (I have managed one in the past, but never set one up from scratch before.) It is going well, but I have a few questions: 1) In the log, the following sometimes appears during initial heartbeat

Re: [Linux-HA] drbd cluster?????

2007-12-06 Thread Andrew Beekhof
On Nov 30, 2007, at 5:59 PM, Remigiusz Stachura wrote: Hi, I have created a simple two-node cluster with 2 multi-state drbd resources on each node. I use DRBD 7.x and the most fresh HA 2.1.2-24 working on SLES10 SP1. The only rules for my cluster I need are: - resources must be promoted to mast

[Linux-HA] suicide behavior for each process

2007-12-06 Thread Junko IKEDA
Hi, I found something rule like this; When the following process was killed, the system would reboot. * ccm * cib * lrmd * crmd * pengine * tengine These processes would be restarted when they are killed. * FIFO * media (ex. write/read bcast) * stonithd * attrd * mgmtd * respawn (ex. pingd) If m

[Linux-HA] handling clone instances

2007-12-06 Thread Junko IKEDA
Hi, Clone resource have some instances like, clone:0, or :1. Is it possible to handle them as one resource about a fail count on the same node? I mean, if clone:0 fails on node_a, I want to up a fail count for clone:1/node_a at the same time. or, is there any good idea to work out the above behavi

Re: [Linux-HA] handling clone instances

2007-12-06 Thread Andrew Beekhof
On Dec 6, 2007, at 9:54 AM, Junko IKEDA wrote: Hi, Clone resource have some instances like, clone:0, or :1. Is it possible to handle them as one resource about a fail count on the same node? I mean, if clone:0 fails on node_a, I want to up a fail count for clone:1/node_a at the same time. o

Re: [Linux-HA] Pingd

2007-12-06 Thread China
Ok, now it works, but when the PC_A returns up the resource doesn't remains on PC_B and failback to PC_A. How can I configure to switch the first time to PC_B on PC_A failover, but not return back if PC_A returns UP? Thanks On Dec 6, 2007 9:14 AM, Andrew Beekhof <[EMAIL PROTECTED]> wrote: > > On

Re: [Linux-HA] Pingd

2007-12-06 Thread Dominik Klein
China wrote: Ok, now it works, but when the PC_A returns up the resource doesn't remains on PC_B and failback to PC_A. How can I configure to switch the first time to PC_B on PC_A failover, but not return back if PC_A returns UP? Set resource stickiness to a reasonable value. Here's roughly ho

Re: [Linux-HA] Pingd

2007-12-06 Thread China
Ok, I've set resource_stickiness to 150, a score of 100 to the default node PC_A and a score_attribute for pingd. Now the resource when fail doesn't start on PC_B. Why? On Dec 6, 2007 11:17 AM, Dominik Klein <[EMAIL PROTECTED]> wrote: > China wrote: > > Ok, now it works, but when the PC_A returns

Re: [Linux-HA] Pingd

2007-12-06 Thread Dominik Klein
China wrote: Ok, I've set resource_stickiness to 150, a score of 100 to the default node PC_A and a score_attribute for pingd. Now the resource when fail doesn't start on PC_B. Why? The way I understand you, and please correct me or post your current cib.xml, is: pingd multiplier: 200 (as su

RE: [Linux-HA] Setup Q

2007-12-06 Thread Rafe Slattery
Got it. Thank you very much. Rafe -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Andrew Beekhof Sent: 06 December 2007 08:12 To: General Linux-HA mailing list Subject: Re: [Linux-HA] Setup Q On Dec 6, 2007,

Re: [Linux-HA] Newbie Questions on Heartbeat Startup

2007-12-06 Thread Dejan Muhamedagic
Hi, I'm sure that I replied to this one, but... On Thu, Dec 06, 2007 at 09:32:20AM +0100, Andrew Beekhof wrote: > > On Nov 30, 2007, at 7:39 PM, Art Age Software wrote: > >> Hi, >> >> I'm setting up my first heartbeat cluster. (I have managed one in the >> past, but never set one up from scratch

Re: [Linux-HA] heartbeat 1.2.5 on CentOS 5?

2007-12-06 Thread Dejan Muhamedagic
Hi, On Thu, Dec 06, 2007 at 05:10:28PM +1100, Amos Shapira wrote: > On 05/12/2007, Dejan Muhamedagic <[EMAIL PROTECTED]> wrote: > > Hi, > > > > On Wed, Dec 05, 2007 at 04:16:16AM +, Amos Shapira wrote: > > > Hello, > > > > > > Has anyone got Heartbeat 1.2.5 to compile and run on CentOS 5? > >

Re: [Linux-HA] Pingd

2007-12-06 Thread China
This is my new cib.xml:

Re: [Linux-HA] Setup Q

2007-12-06 Thread Dejan Muhamedagic
Hi, On Thu, Dec 06, 2007 at 12:49:17AM -, Rafe Slattery wrote: > thanks for the suggestion... > It helped ... a bit... but I'm still not quite there... > Can I get some clarificaton on what location constraints I should use. > I want an active/passive combo. 1 web to be running web services

[Linux-HA] stonith

2007-12-06 Thread Papp Tamas
hi All, There is a problem again. The cluster has two nodes, and heartbeat's version is 2.1.2, CentOS 5. This is the resource (this is the same for teszt2, both of them do not work): Constraints: I start the reso

[Linux-HA] Possible bug in Score calculation?

2007-12-06 Thread Dominik Klein
Hi sorry I have to bother again about score calculation but I came across something I don't understand and that might be a bug. I have a master-slave drbd resource called ms-drbd (primitive is called drbd2) and a group named testdb (4 primitives, "mount" being the first primitive). pingd mu

Re: [Linux-HA] Pingd

2007-12-06 Thread Dominik Klein
With this configuration the resources doesn't failover to test, but remains on test-ppc. Why? I can't say. The configuration looks good to me. But again: What are you doing to force the failure? Do you really have just one connection between the nodes and unplug that connection to force the

[Linux-HA] Re: [Linux-ha-dev] ANNOUNCE: Heartbeat Cluster Resource Manager Ported to OpenAIS

2007-12-06 Thread Lars Marowsky-Bree
On 2007-12-05T21:06:38, Andrew Beekhof <[EMAIL PROTECTED]> wrote: > Over the last few months, Red Hat and SUSE engineers have been working > together to port Heartbeat's powerful Cluster Resource Manager (CRM) to run > natively on top of OpenAIS. Credit where credit is due: this means you, Andr

Re: [Linux-HA] Pingd

2007-12-06 Thread China
Sorry, I forgot it! I've two connection for the PCs: one with crossover cable, where heartbeat send packets directly to other PC one through network, where the services listen and where pingd test connectivity When I force the failure I disconnect the network cable that give services from PC_A.

Re: [Linux-HA] Pingd

2007-12-06 Thread Dominik Klein
China wrote: Sorry, I forgot it! I've two connection for the PCs: one with crossover cable, where heartbeat send packets directly to other PC one through network, where the services listen and where pingd test connectivity When I force the failure I disconnect the network cable that give servi

Re: [Linux-HA] Pingd

2007-12-06 Thread Dominik Klein
But, It's good to use a interface both for heartbeat and for services? It's pretty common I think. ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Double node configuration with mixed ressource

2007-12-06 Thread Franck Ganachaud
Thanks for the tips both of you. I'm going to work on that the next few days. Andrew Beekhof a écrit : On Dec 5, 2007, at 1:44 PM, Dejan Muhamedagic wrote: Hi, On Wed, Dec 05, 2007 at 11:51:20AM +0100, Franck Ganachaud wrote: Well I don't want hearbeat to stop or start mysql. You should

Re: [Linux-HA] Call for testers: 2.1.3

2007-12-06 Thread Dejan Muhamedagic
Hi, On Thu, Dec 06, 2007 at 10:54:36AM +1100, Amos Shapira wrote: > On 06/12/2007, Alan Robertson <[EMAIL PROTECTED]> wrote: > > We are in the final weeks of testing for release 2.1.3 - which has been > > delayed to the week of Dec 19. > > Trying to do "make rpm" on CentOS 5 I get the following e

Re: [Linux-HA] Pingd

2007-12-06 Thread China
But, It's good to use a interface both for heartbeat and for services? And yes, If I pull the heartbeat's cable I'm in a split brain situation. But it's not my problem now :D On Dec 6, 2007 2:23 PM, Dominik Klein <[EMAIL PROTECTED]> wrote: > China wrote: > > Sorry, I forgot it! > > > > I've two

Re: [Linux-HA] Pingd

2007-12-06 Thread China
Last question: how can I see what is the node's score during cluster execution? On Dec 6, 2007 2:59 PM, Dominik Klein <[EMAIL PROTECTED]> wrote: > > But, It's good to use a interface both for heartbeat and for services? > > It's pretty common I think. > ___

Re: [Linux-HA] stonith

2007-12-06 Thread Dejan Muhamedagic
Hi, On Thu, Dec 06, 2007 at 01:11:37PM +0100, Papp Tamas wrote: > hi All, > > > There is a problem again. The cluster has two nodes, and heartbeat's > version is 2.1.2, CentOS 5. > > This is the resource (this is the same for teszt2, both of them do not > work): > > id="stonith_teszt1"> >

Re: [Linux-HA] suicide behavior for each process

2007-12-06 Thread Dejan Muhamedagic
Hi, On Thu, Dec 06, 2007 at 05:42:10PM +0900, Junko IKEDA wrote: > Hi, > > I found something rule like this; > When the following process was killed, the system would reboot. > * ccm > * cib > * lrmd > * crmd > * pengine > * tengine > > These processes would be restarted when they are killed. >

Re: [Linux-HA] Pingd

2007-12-06 Thread Dominik Klein
China wrote: Last question: how can I see what is the node's score during cluster execution? You can grep it out of the "ptest" output. Or use my script: http://lists.community.tummy.com/pipermail/linux-ha/2007-September/027488.html which has been updated by Robert Lindgren: http://lists.comm

Re: [Linux-HA] Call for testers: 2.1.3

2007-12-06 Thread Robert Wipfel
Hi, We've been using OpenWbem on SUSE, Heartbeat-cim will run with either, Pegasus is preferred. There is a new release that runs on SUSE - http://www.openpegasus.org/ Iirc the configure script tries to figure out which devel package is installed (pegasus-devel or openwbem-devel) Hth, Robert >

Re: [Linux-HA] Call for testers: 2.1.3

2007-12-06 Thread Dejan Muhamedagic
Hi, On Thu, Dec 06, 2007 at 07:43:44AM -0700, Robert Wipfel wrote: > Hi, > > We've been using OpenWbem on SUSE, > Heartbeat-cim will run with either, Pegasus is > preferred. There is a new release that runs > on SUSE - http://www.openpegasus.org/ > Iirc the configure script tries to figure out w

Re: [Linux-HA] Pingd

2007-12-06 Thread China
Thank you. I've found the right score with the script: pingd: +1000 PC_A: +100 resource_stickiness: +100 (I've 3 resources so make 300, not 100) Now the problem is that i didn't understood why these score is ok for failover and don't failback. And why with pingd score 500 is good for failback to

Re: [Linux-HA] suicide behavior for each process

2007-12-06 Thread Andrew Beekhof
On Dec 6, 2007, at 3:25 PM, Dejan Muhamedagic wrote: Hi, On Thu, Dec 06, 2007 at 05:42:10PM +0900, Junko IKEDA wrote: Hi, I found something rule like this; When the following process was killed, the system would reboot. * ccm * cib * lrmd * crmd * pengine * tengine These processes would be

Re: [Linux-HA] stonith

2007-12-06 Thread Dejan Muhamedagic
Hi, On Thu, Dec 06, 2007 at 07:05:49PM +0100, Papp Tamas wrote: > Dejan Muhamedagic wrote: >> The error you encountered is probably from the monitor operation >> (there's a monitor implied in the start). Did you try with >> stonith (the program): >> # stonith -t apcsmart ttydev=/dev/ttyS0 hostlist

Re: [Linux-HA] stonith

2007-12-06 Thread Papp Tamas
Dejan Muhamedagic wrote: The error you encountered is probably from the monitor operation (there's a monitor implied in the start). Did you try with stonith (the program): # stonith -t apcsmart ttydev=/dev/ttyS0 hostlist=teszt1 -S # stonith -t apcsmart ttydev=/dev/ttyS0 hostlist=teszt1 -l (per

[Linux-HA] HA rookie needs help

2007-12-06 Thread Radu Handorean
Hi, I'm new to Linux HA and I am already a bit puzzled by a few things. I have two AMD64 Opteron machines (same hardware config) running SuSE 10.2 and HA. I managed to configure HA using the GUI and all the wizards I managed to bump into. It seems to work. The thing is I look at it like at a bl

Re: [Linux-HA] stonith

2007-12-06 Thread Dejan Muhamedagic
On Thu, Dec 06, 2007 at 07:43:20PM +0100, Dejan Muhamedagic wrote: > Hi, > > On Thu, Dec 06, 2007 at 07:05:49PM +0100, Papp Tamas wrote: > > Dejan Muhamedagic wrote: > >> The error you encountered is probably from the monitor operation > >> (there's a monitor implied in the start). Did you try wit

Re: [Linux-HA] Pingd

2007-12-06 Thread Andrew Beekhof
On Dec 6, 2007, at 5:15 PM, China wrote: Thank you. I've found the right score with the script: pingd: +1000 PC_A: +100 resource_stickiness: +100 (I've 3 resources so make 300, not 100) Now the problem is that i didn't understood why these score is ok for failover and don't failback. becaus

Re: [Linux-HA] Possible bug in Score calculation?

2007-12-06 Thread Andrew Beekhof
On Dec 6, 2007, at 1:39 PM, Dominik Klein wrote: Hi sorry I have to bother again about score calculation but I came across something I don't understand and that might be a bug. I have a master-slave drbd resource called ms-drbd (primitive is called drbd2) and a group named testdb (4 prim

Re: [Linux-HA] heartbeat 1.2.5 on CentOS 5?

2007-12-06 Thread Amos Shapira
On 06/12/2007, Dejan Muhamedagic <[EMAIL PROTECTED]> wrote: > Hi, > > On Thu, Dec 06, 2007 at 05:10:28PM +1100, Amos Shapira wrote: > > Would you be interested in the tiny diff's I had to make to the .spec > > file? Are there maintenance released for v1? > > Yes. I think that it was Horms maintaini

Re: [Linux-HA] Other node not found

2007-12-06 Thread Amos Shapira
On 05/12/2007, Dejan Muhamedagic <[EMAIL PROTECTED]> wrote: > On Fri, Nov 30, 2007 at 05:16:38PM +1100, Amos Shapira wrote: > > On 30/11/2007, Dejan Muhamedagic <[EMAIL PROTECTED]> wrote: > > > > > > Hi, > > > > > > On Thu, Nov 29, 2007 at 05:23:33PM +, Amos Shapira wrote: > > > > On 29/11/2007

Re: [Linux-HA] stonith

2007-12-06 Thread Papp Tamas
On Thu, Dec 06, 2007 at 08:07:03PM +0100, Dejan Muhamedagic wrote: > > This is also bad. Looks like it just checks the connection to the > > APC, but not if the given host is reachable. > > Er, very silly of me to say that, because such a device (UPS) > can't do that. Sorry for the confusion. What

Re: [Linux-HA] HA rookie needs help

2007-12-06 Thread Andrew Beekhof
On Dec 6, 2007, at 7:30 PM, Radu Handorean wrote: Hi, I'm new to Linux HA and I am already a bit puzzled by a few things. I have two AMD64 Opteron machines (same hardware config) running SuSE 10.2 and HA. I managed to configure HA using the GUI and all the wizards I managed to bump into.

Re: [Linux-HA] suicide behavior for each process

2007-12-06 Thread Alan Robertson
Junko IKEDA wrote: Hi, I found something rule like this; When the following process was killed, the system would reboot. * ccm * cib * lrmd * crmd * pengine * tengine These processes would be restarted when they are killed. * FIFO * media (ex. write/read bcast) * stonithd * attrd * mgmtd * resp

[Linux-HA] groups and colocation (lsb script and ip)

2007-12-06 Thread Jeff Humes
I have created a simple heartbeat cluster: 2 Centos 4.5 nodes HB version: heartbeat-pils-2.1.2-3.el4.centos heartbeat-stonith-2.1.2-3.el4.centos heartbeat-gui-2.1.2-3.el4.centos heartbeat-2.1.2-3.el4.centos Here is the issue I see, and I dont know what I am doing wrong. I have a simple group

Re: [Linux-HA] Possible bug in Score calculation?

2007-12-06 Thread Dominik Klein
Good morning Andrew sorry I have to bother again about score calculation but I came across something I don't understand and that might be a bug. I have a master-slave drbd resource called ms-drbd (primitive is called drbd2) and a group named testdb (4 primitives, "mount" being the first prim

Re: [Linux-HA] Possible bug in Score calculation?

2007-12-06 Thread Andrew Beekhof
On Dec 7, 2007, at 7:47 AM, Dominik Klein wrote: Good morning Andrew sorry I have to bother again about score calculation but I came across something I don't understand and that might be a bug. I have a master-slave drbd resource called ms-drbd (primitive is called drbd2) and a group nam

Re: [Linux-HA] Possible bug in Score calculation?

2007-12-06 Thread Dominik Klein
Just curious: I suppose its my first constraint that does this job? second - the colocation one Okay thanks - so sure I even got the 50/50 wrong :p Then I must ask another question: Why does this not apply to colocated primitives? I just tested with a single primitive (testdb) colocated to