Re: [Linux-HA] A Node cannot rejoin the cluster after a rebuild

2009-05-11 Thread Andrew Beekhof
On Sat, May 9, 2009 at 12:56 AM, Jerome Yanga jya...@esri.com wrote: Here is the scenario. 01)  There are two nodes in the Active-Passive cluster--Nomen and Rubric. 02)  Nomen had a hardware and software failure. 03)  Rubric took over the resources as expected. 04)  Due to the failures,

Re: [Linux-HA] [Pacemaker] new doc about stonith/fencing

2009-05-11 Thread Andrew Beekhof
On Mon, May 11, 2009 at 11:04 AM, Peter Kruse p...@q-leap.com wrote: Hi Andrew, Andrew Beekhof wrote: On Wed, May 6, 2009 at 10:13 AM, Peter Kruse p...@q-leap.com wrote: You are saying that it is okay that a single failure can bring the cluster in a unsolvable situation?  I thought SPoF

Re: [Linux-HA] Fw: Arp issues

2009-05-11 Thread Siakoulis Yiannis
Ryan, I'm doing the NAT out of the box at ASA5505 firewall so that rule i think is not necessary. David, The ASA doing the nat only for the VIP.The real IP's of the boxes doesn't have access to the outside world.   In the arp table of ASA i can see both ip's (real/virtual) with the same mac

Re: [Linux-HA] [Pacemaker] new doc about stonith/fencing

2009-05-11 Thread Peter Kruse
Hi Andrew, Andrew Beekhof wrote: Any switch that shares power with the host(s) it controls clearly has a SPoF. You don't need me to tell you that. But that does not have to be a SPoF for the entire system! The problem here is that a single failure (power loss) causes not only one node to go

Re: [Linux-HA] [Pacemaker] new doc about stonith/fencing

2009-05-11 Thread Andrew Beekhof
On Mon, May 11, 2009 at 11:52 AM, Peter Kruse p...@q-leap.com wrote: Hi Andrew, Andrew Beekhof wrote: Any switch that shares power with the host(s) it controls clearly has a SPoF. You don't need me to tell you that. But that does not have to be a SPoF for the entire system! By default, yes

[Linux-HA] How to make LSB script Hearbeat compliant

2009-05-11 Thread Tobias Appel
Hi, I know there was an document on linux-ha.org but I just can't find it anymore. It showed what the exit codes should be and how to run scripts from the shell and print out their exit status to make them heartbeat compliant. Does anyone have a working link for this site please? I really

Re: [Linux-HA] How to make LSB script Hearbeat compliant

2009-05-11 Thread Michael Schwartzkopff
Am Montag, 11. Mai 2009 14:01:38 schrieb Tobias Appel: Hi, I know there was an document on linux-ha.org but I just can't find it anymore. It showed what the exit codes should be and how to run scripts from the shell and print out their exit status to make them heartbeat compliant. Does

Re: [Linux-HA] How to make LSB script Hearbeat compliant

2009-05-11 Thread Tobias Appel
Thanks - that was exactly the website I was looking for. Going to save it now right away :) Michael Schwartzkopff wrote: http://www.linux-ha.org/LSBResourceAgent ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org

Re: [Linux-HA] [Pacemaker] new doc about stonith/fencing

2009-05-11 Thread Karl Katzke
But that does not have to be a SPoF for the entire system! The problem here is that a single failure (power loss) causes not only one node to go down (and the pdu itself, yes), but the whole system stops working properly. Now you now have to say that one has to equip the pdus with

[Linux-HA] Linux HA

2009-05-11 Thread Femi Soyemi
I am trying to install Oracle 11g on Linux HA but there is no document to guide us in Linux cluster configurations. Please if you can give a link to this i will appreciate it. Thanks Femi ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org

[Linux-HA] Problems With SLES11 + DRBD

2009-05-11 Thread Andre Eckstein
Hi, i do have the same setup, SLES 11 with the HASI extension. I'm also not able to get drbd running correctly with pacemaker. I followed the DRBD Howto on the pacemaker homepage step by step, the resources start correctly and when i deactivate one node within the hb_gui all resources migrate

[Linux-HA] Doubts about OCF RA

2009-05-11 Thread Rodrigo Costa
Dear Linux-HA list, I would like to clarify some doubts I'm having preparing a possible openldap OCF RA. I prepared the OCF RA using as reference the mysql resource agent already existent in the Linux-HA scripts. I already had some scripts to ldapsearch the database for integrity purposes

Re: [Linux-HA] crm CLI

2009-05-11 Thread Dejan Muhamedagic
Ciao, On Mon, May 04, 2009 at 02:36:48PM +0200, Cristina Bulfon wrote: Ciao, in attachment my cib.xml where I've already have a group and location constraint with score=100. If I understood correctly, the score is related to all resources so if I don't have one of them the score is less

Re: [Linux-HA] Problems With SLES11 + DRBD

2009-05-11 Thread Darren.Mansell
Woo-hoo I can finally give an answer on this list! :) You need to set the no-quorum-policy to ignore: # crm configure property no-quorum-policy=ignore Mine is now running very nice indeed. The DRBD-pacemaker1.0 howto is superb and if you follow it and understand it then you should be fine.

Re: [Linux-HA] drbd network connection failure

2009-05-11 Thread Darren.Mansell
I'm getting the following in my logs: [49294.905016] drbd1: PingAck did not arrive in time. [49294.905027] drbd1: peer( Secondary - Unknown ) conn( Connected - NetworkFailure ) pdsk( UpToDate - DUnknown ) [49294.905041] drbd1: asender terminated [49294.905044] drbd1: Terminating asender

[Linux-HA] Testing STONITH

2009-05-11 Thread Darren.Mansell
I've seen a few mentions of testing STONITH but I can't find any reference at the moment. I want to simulate a situation where one of my two nodes will want to fence the other. Is there any way to do this? I'm using the SSH STONITH plugin for testing and will likely use RSA / ILO for production