Re: [Linux-ha-dev] quorumd - is anyone using it?

2008-02-21 Thread Dejan Muhamedagic
Hi, On Thu, Feb 21, 2008 at 06:03:03PM +0100, Lars Marowsky-Bree wrote: Hi all, is anyone actually using the quorum daemon? That I wouldn't know, but there were quite a few posts about it recently. Though there was not much discussion. So, I'd say that there's interest and probably not many

[Linux-HA] Solving a strange split-brain with drbd and ha

2008-02-21 Thread Balabam
Hello, I've two nodes working in a split brain configuration and I'm not able to solve this problem. My config is: [EMAIL PROTECTED] ~]# crm_mon Defaulting to one-shot mode You need to have curses available at compile time to enable console mode Last updated: Fri Feb 15

Re: [Linux-HA] Solving a strange split-brain with drbd and ha

2008-02-21 Thread Nikita Michalko
Hello Balabam, Am Donnerstag, 21. Februar 2008 09:25 schrieb Balabam: Hello, I've two nodes working in a split brain configuration and I'm not able to solve this problem. My config is: [EMAIL PROTECTED] ~]# crm_mon Defaulting to one-shot mode You need to have curses available at

Re: [Linux-HA] Primary/Secondary switchover

2008-02-21 Thread Michael Schwartzkopff
Am Donnerstag, 21. Februar 2008 11:04 schrieb Guy: Make a location constraint that prefers the DRBD ressource in Master state on your first node. But why do you want to do this anyway? Only if the other node is considerably slower or has some other deficits. Beware that your

Re: [Linux-HA] Primary/Secondary switchover

2008-02-21 Thread Guy
Make a location constraint that prefers the DRBD ressource in Master state on your first node. But why do you want to do this anyway? Only if the other node is considerably slower or has some other deficits. Beware that your resource_stickiness is not higher than the location perference.

Re: [Linux-HA] Primary/Secondary switchover

2008-02-21 Thread Guy
On 21/02/2008, Michael Schwartzkopff [EMAIL PROTECTED] wrote: No. Perhaps marginally. Do your self a favor and use v2. I'll give it a go with v2 and see how that goes. Thanks for the help. Guy -- Don't just do something...sit there! ___ Linux-HA

Re: [Linux-HA] Primary/Secondary switchover

2008-02-21 Thread Michael Schwartzkopff
Am Donnerstag, 21. Februar 2008 11:04 schrieb Guy: Make a location constraint that prefers the DRBD ressource in Master state on your first node. But why do you want to do this anyway? Only if the other node is considerably slower or has some other deficits. Beware that your

Re: [Linux-HA] How to avoid split-brain with single network connection between 2 Nodes using stonith?

2008-02-21 Thread Dejan Muhamedagic
Hi, On Wed, Feb 20, 2008 at 03:30:44PM -0500, Blechman, Ronald I, Jr (Ron) wrote: I want to solicit the group's advice on the following proposed application of heartbeat. Hopefully this explanation will be clearer than my last posting on this! We are considering using Heartbeat 1.x to

Re: [Linux-HA] Solving a strange split-brain with drbd and ha

2008-02-21 Thread Balabam
Yep, sorry. Attached. Stefano Nikita Michalko [EMAIL PROTECTED] ha scritto: - REALLY ?? I don't see any attachment ! Check again ! Bye ! Nikita Michalko - - L'email della prossima generazione? Puoi

Re: [Linux-HA] monitor operations still running after setting a resource to is_managed=false?

2008-02-21 Thread Dejan Muhamedagic
Hi, On Thu, Feb 21, 2008 at 12:19:55AM +0100, Johan Hoeke wrote: LS, Running a 2 node cluster, heartbeat-2.1.3-3 centos rpms, RH AS 4.6 While testing a maintenance scenario for the cluster I set all resources to is_managed is false, Feb 20 21:09:41 sierpinski pengine: [15725]: notice:

Re: [Linux-HA] some questions about openais

2008-02-21 Thread Dejan Muhamedagic
Hi, On Thu, Feb 21, 2008 at 08:30:16AM +0800, ?? wrote: hello~~~ I have some questions about openais. (0.80.3) There are mailing lists for openais. This one is about heartbeat. Thanks, Dejan 1. If I only want to use ckpt service to sync app states between 2 PCs. Do I need to

Re: [Linux-HA] Proper BASH operators for MySQL RA

2008-02-21 Thread Dejan Muhamedagic
Hi, On Thu, Feb 21, 2008 at 04:11:20AM +0100, Christian Rish?j wrote: Hi, I was seeing Feb 21 04:09:09 jazz lrmd: [5079]: info: RA output: (mysql:monitor:stderr) [: 75: Feb 21 04:09:09 jazz lrmd: [5079]: info: RA output: (mysql:monitor:stderr) ==: unexpected operator Feb 21 04:09:09

Re: [Linux-HA] A little STONITH help please

2008-02-21 Thread Dejan Muhamedagic
Hi, On Wed, Feb 20, 2008 at 03:33:08PM -0500, Doug Lochart wrote: On Feb 20, 2008 11:53 AM, Dave Blaschke [EMAIL PROTECTED] wrote: Doug Lochart wrote: I finished flashing and configuring my IPMI devices on my two nodes so that I can implement STONITH resources. I have used the vendor

Re: [Linux-HA] monitor operations still running after setting a resource to is_managed=false?

2008-02-21 Thread Johan Hoeke
Dejan Muhamedagic wrote: Hi, On Thu, Feb 21, 2008 at 12:19:55AM +0100, Johan Hoeke wrote: LS, Running a 2 node cluster, heartbeat-2.1.3-3 centos rpms, RH AS 4.6 While testing a maintenance scenario for the cluster I set all resources to is_managed is false, and proceeded to shut oracle

[Linux-HA] Postgres + DRBD + Heartbeat

2008-02-21 Thread Carlos Alexandre de Souza da Silva
Greetings, I am a newbie in linux HA and need to get a postgresql database clustered (active/passive) 2 nodes in my work production environment. I've read some Slony-I docs but I found it very complex for someone who only have basic knlowdge on database administration so I decided to give

Re: [Linux-HA] about a problem in the mysql RA

2008-02-21 Thread Dejan Muhamedagic
Hi, On Thu, Feb 21, 2008 at 05:36:47PM +0900, [EMAIL PROTECTED] wrote: Hi Dejan and All, Thanks for your comments about apach RA. And I think that there is a problem in the mysql RA. My computer environment is as follows. # mysql --version mysql Ver 14.12 Distrib 5.0.54a, for

Re: [Linux-HA] monitor operations still running after setting a resource to is_managed=false?

2008-02-21 Thread Andreas Kurz
On Thu, Feb 21, 2008 at 12:22 PM, Dejan Muhamedagic [EMAIL PROTECTED] wrote: Hi, On Thu, Feb 21, 2008 at 12:19:55AM +0100, Johan Hoeke wrote: LS, Running a 2 node cluster, heartbeat-2.1.3-3 centos rpms, RH AS 4.6 While testing a maintenance scenario for the cluster I set all

Re: [Linux-HA] monitor operations still running after setting a resource to is_managed=false?

2008-02-21 Thread Dejan Muhamedagic
Hi, On Thu, Feb 21, 2008 at 01:26:12PM +0100, Johan Hoeke wrote: Dejan Muhamedagic wrote: Hi, On Thu, Feb 21, 2008 at 12:19:55AM +0100, Johan Hoeke wrote: LS, Running a 2 node cluster, heartbeat-2.1.3-3 centos rpms, RH AS 4.6 While testing a maintenance scenario for the cluster

Re: [Linux-HA] Solving a strange split-brain with drbd and ha

2008-02-21 Thread Dejan Muhamedagic
Hi, On Thu, Feb 21, 2008 at 12:07:11PM +0100, Balabam wrote: Yep, sorry. Attached. You're missing constraints to run the group on the drbd master node. See http://www.linux-ha.org/DRBD/HowTov2 Thanks, Dejan Stefano Nikita Michalko [EMAIL PROTECTED] ha scritto: - REALLY ??

Re: [Linux-HA] Solving a strange split-brain with drbd and ha

2008-02-21 Thread [EMAIL PROTECTED]
Nikita Michalko ha scritto: Hello Balabam, A Ok, I've solved the drbd split brain, but is not clear: [EMAIL PROTECTED] ~]# crm_mon -1 Defaulting to one-shot mode You need to have curses available at compile time to enable console mode Last updated: Thu Feb 21 14:35:08 2008

Re: [Linux-HA] monitor operations still running after setting a resource to is_managed=false?

2008-02-21 Thread Dejan Muhamedagic
Hi, On Thu, Feb 21, 2008 at 02:05:32PM +0100, Andreas Kurz wrote: On Thu, Feb 21, 2008 at 12:22 PM, Dejan Muhamedagic [EMAIL PROTECTED] wrote: Hi, On Thu, Feb 21, 2008 at 12:19:55AM +0100, Johan Hoeke wrote: LS, Running a 2 node cluster, heartbeat-2.1.3-3 centos rpms, RH AS

Re: [Linux-HA] Apache running on multiple nodes

2008-02-21 Thread Dejan Muhamedagic
Hi, On Thu, Feb 21, 2008 at 09:44:57AM -0500, Jason Erickson wrote: I added the path for httpd to the apache file and every instance that had to do with that path but it still does not work. Is there another spot I need to change? I really wouldn't know. Just look for ERROR: strings in the

AW: [Linux-HA] Heartbeat 2.1.3 error

2008-02-21 Thread Schmidt, Florian
On 2008-02-14T14:17:05, Nikita Michalko [EMAIL PROTECTED] wrote: heartbeat[5612]: 2008/02/14_09:47:59 WARN: Managed /usr/lib/heartbeat/cib process 5630 exited with return code 1. heartbeat[5612]: 2008/02/14_09:47:59 EMERG: Rebooting system. Reason: /usr/lib/heartbeat/cib Someone

Re: [Linux-HA] monitor operations still running after setting a resource to is_managed=false?

2008-02-21 Thread Johan Hoeke
Dejan Muhamedagic wrote: Hi, On Thu, Feb 21, 2008 at 01:26:12PM +0100, Johan Hoeke wrote: Dejan Muhamedagic wrote: On Thu, Feb 21, 2008 at 12:19:55AM +0100, Johan Hoeke wrote: Oops. So there's an on_fail=fence for this monitor operation. Is that necessary? We want the cluster to failover

Re: [Linux-HA] Solving a strange split-brain with drbd and ha

2008-02-21 Thread Dejan Muhamedagic
Hi, On Thu, Feb 21, 2008 at 02:37:33PM +0100, [EMAIL PROTECTED] wrote: Nikita Michalko ha scritto: Hello Balabam, A Ok, I've solved the drbd split brain, but is not clear: [EMAIL PROTECTED] ~]# crm_mon -1 Defaulting to one-shot mode You need to have curses available at compile time to

[Linux-HA] Scoring system question

2008-02-21 Thread Zoltan Boszormenyi
Hi, we have a problem with automatic IPaddr failback on a system. There are two nodes, IPaddr is preferred running on the master node. Static score for that is 20. Resource stickiness for IPaddr is 40. Pingd is set up the same way the documentation mentions, ha.cf has this: respawn root

Re: [Linux-HA] Primary/Secondary switchover

2008-02-21 Thread Guy
Thanks for the tip. Look forward to that doc, they're hard to find when it comes to V2 if you're new like me. Guy On 21/02/2008, Fajar Priyanto [EMAIL PROTECTED] wrote: On Wednesday 20 February 2008 20:11:35 Guy wrote: Hi, I'm busy trying out DRBD for the first time. I've got it all

[Linux-HA] quorumd - is anyone using it?

2008-02-21 Thread Lars Marowsky-Bree
Hi all, is anyone actually using the quorum daemon? My assessment seems to suggest that it is not workable in any scenario; but maybe I have missed something? If not and I'm right, I am afraid that users might actually deploy it, thinking it solves something and then be very upset when it fails

Re: [Linux-HA] Primary/Secondary switchover

2008-02-21 Thread Fajar Priyanto
On Wednesday 20 February 2008 20:11:35 Guy wrote: Hi, I'm busy trying out DRBD for the first time. I've got it all set up and it runs nicely, but when the Primary fails (stopping heartbeat), the Secondary becomes Primary and stays that way once the Primary comes back up again. I've been

Re: [Linux-HA] Primary/Secondary switchover

2008-02-21 Thread Guy
Hi Florian, I've tried with the config you've described with no success. Hence trying V2 now. Guy On 21/02/2008, Schmidt, Florian [EMAIL PROTECTED] wrote: On 21/02/2008, Michael Schwartzkopff [EMAIL PROTECTED] wrote: No. Perhaps marginally. Do your self a favor and use v2. I'll give it a

AW: [Linux-HA] Primary/Secondary switchover

2008-02-21 Thread Schmidt, Florian
On 21/02/2008, Michael Schwartzkopff [EMAIL PROTECTED] wrote: No. Perhaps marginally. Do your self a favor and use v2. I'll give it a go with v2 and see how that goes. Thanks for the help. Guy [Florian] Anyway this should be able with V1-style config, too. auto_failback on and drbd started by

AW: [Linux-HA] Primary/Secondary switchover

2008-02-21 Thread Schmidt, Florian
Could you please do the following: (drbd started on init) post a cat /proc/drbd on both nodes and also your haresources-file Tanks :) Hi Florian, I've tried with the config you've described with no success. Hence trying V2 now. Guy On 21/02/2008, Schmidt, Florian [EMAIL PROTECTED] wrote:

Re: [Linux-HA] monitor operations still running after setting a resource to is_managed=false?

2008-02-21 Thread Dejan Muhamedagic
Hi, On Thu, Feb 21, 2008 at 04:09:19PM +0100, Johan Hoeke wrote: Dejan Muhamedagic wrote: Hi, On Thu, Feb 21, 2008 at 01:26:12PM +0100, Johan Hoeke wrote: Dejan Muhamedagic wrote: On Thu, Feb 21, 2008 at 12:19:55AM +0100, Johan Hoeke wrote: Oops. So there's an on_fail=fence for this

Re: [Linux-HA] Scoring system question

2008-02-21 Thread Zoltan Boszormenyi
Zoltan Boszormenyi írta: Hi, we have a problem with automatic IPaddr failback on a system. There are two nodes, IPaddr is preferred running on the master node. Static score for that is 20. Resource stickiness for IPaddr is 40. Pingd is set up the same way the documentation mentions, ha.cf has

Re: [Linux-HA] quorumd - is anyone using it?

2008-02-21 Thread Michael Schwartzkopff
Lars Marowsky-Bree schrieb: Hi all, is anyone actually using the quorum daemon? My assessment seems to suggest that it is not workable in any scenario; but maybe I have missed something? If not and I'm right, I am afraid that users might actually deploy it, thinking it solves something

Re: [Linux-HA] Scoring system question

2008-02-21 Thread Dejan Muhamedagic
Hi, On Thu, Feb 21, 2008 at 06:40:57PM +0100, Zoltan Boszormenyi wrote: Zoltan Boszormenyi ?rta: Hi, we have a problem with automatic IPaddr failback on a system. There are two nodes, IPaddr is preferred running on the master node. Static score for that is 20. Resource stickiness for IPaddr

[Linux-HA] Master/Slave problems

2008-02-21 Thread Adrian Chapela
Hello again, A few days ago I sent to you a MySQL Master/Slave OCF Script. After a hard test I find some errors on the script (Now, some are partially solved but I didn't upload the script for now...) and in my config: This is the config of Master/Slave: master_slave id=MySQL_Server

Re: [Linux-HA] Salut from far away

2008-02-21 Thread Dejan Muhamedagic
Hi, On Thu, Feb 21, 2008 at 11:35:51PM +0700, Fajar Priyanto wrote: Hello all, I've been a lurker in the list for quite sometime, and been using linux-ha v1. Recently I encourage myself to venture into v2. After reading the wonderful documentation in linux-ha's website, google, and Alan's

Re: [Linux-HA] Heartbeat 2.1.3 error

2008-02-21 Thread maike
then install a missing library libgnutls13-2.0.1-20.i586.rpm and heartbeat ok =D thanks 2008/2/21, Schmidt, Florian [EMAIL PROTECTED]: On 2008-02-14T14:17:05, Nikita Michalko [EMAIL PROTECTED] wrote: heartbeat[5612]: 2008/02/14_09:47:59 WARN: Managed /usr/lib/heartbeat/cib process 5630

[Linux-HA] Some problems with monitoring

2008-02-21 Thread Adrian Chapela
Hello, I am having troubles with resource monitoring. It only runs well some seconds, then monitoring stops and the log says: tengine[23994]: 2008/02/21_18:22:10 info: match_graph_event: Action mysqld-child:0_monitor_2 (2) confirmed on debian_master (rc=0) tengine[23994]:

Re: [Linux-HA] Scoring system question

2008-02-21 Thread Zoltan Boszormenyi
Dejan Muhamedagic írta: Hi, On Thu, Feb 21, 2008 at 06:40:57PM +0100, Zoltan Boszormenyi wrote: Zoltan Boszormenyi ?rta: Hi, we have a problem with automatic IPaddr failback on a system. There are two nodes, IPaddr is preferred running on the master node. Static score for that is 20.

Re: [Linux-HA] quorumd - is anyone using it?

2008-02-21 Thread Lars Marowsky-Bree
On 2008-02-21T18:28:45, Michael Schwartzkopff [EMAIL PROTECTED] wrote: I would like to give it a try if somebody could explain me how it works. That's the problem, I don't see how you can use it to build a working and reliable configuration ;-) ___

[Linux-HA] Heartbeat and DRBD with harmonious configuration

2008-02-21 Thread Doug Lochart
I have been focusing my efforts the past week to learn about stonith and fencing. First I needed to setup and configure my ipmi devices that stonith will use. That works and stonith is now fat and happy. Thanks to those that told me about the undocumented -d option for stonith that unveiled the

Re: [Linux-HA] Some problems with monitoring

2008-02-21 Thread Lars Marowsky-Bree
On 2008-02-21T18:28:00, Adrian Chapela [EMAIL PROTECTED] wrote: Hello, I am having troubles with resource monitoring. It only runs well some seconds, then monitoring stops and the log says: tengine[23994]: 2008/02/21_18:22:10 info: match_graph_event: Action mysqld-child:0_monitor_2

Re: [Linux-HA] Heartbeat and DRBD with harmonious configuration

2008-02-21 Thread Doug Lochart
4) # The node is currently primary, but should become sync target after the negotiating phase. Alert someone about this incident. pri-lost echo pri-lost. Have a look at the log files. | mail -s 'DRBD Alert' [EMAIL PROTECTED]; This just tells me that this node was primary and

Re: [Linux-HA] Heartbeat and DRBD with harmonious configuration

2008-02-21 Thread Lars Marowsky-Bree
On 2008-02-21T14:35:30, Doug Lochart [EMAIL PROTECTED] wrote: So now I am walking through my ha.cf with crm off (yes I want to get this working in version 1 then convert my haresources to cib format afterwards). I don't think this approach is going to make you very happy. v2 is quite

Re: [Linux-HA] monitor operations still running after setting a resource to is_managed=false?

2008-02-21 Thread Johan Hoeke
Dejan Muhamedagic wrote: Hi, On Thu, Feb 21, 2008 at 04:09:19PM +0100, Johan Hoeke wrote: Dejan Muhamedagic wrote: Hi, On Thu, Feb 21, 2008 at 01:26:12PM +0100, Johan Hoeke wrote: Dejan Muhamedagic wrote: On Thu, Feb 21, 2008 at 12:19:55AM +0100, Johan Hoeke wrote: Oops. So there's an

Re: [Linux-HA] Heartbeat and DRBD with harmonious configuration

2008-02-21 Thread Lars Marowsky-Bree
On 2008-02-21T15:08:52, Doug Lochart [EMAIL PROTECTED] wrote: after the negotiating phase. Alert someone about this incident. pri-lost echo pri-lost. Have a look at the log files. | mail -s 'DRBD Alert' [EMAIL PROTECTED]; This just tells me that this node was primary and

[Linux-HA] Apache LSB example

2008-02-21 Thread Jason Erickson
What are the steps to write a LSB script for apache. I am using version 2.1.3 with crm. I found an example once but I can not find it anymore. Does anyone have a good example? Jason ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org

Re: [Linux-HA] Apache LSB example

2008-02-21 Thread Serge Dubrouski
If you are using 2.1.3 with CRM why you don't use included Apache OCF script? On Thu, Feb 21, 2008 at 2:17 PM, Jason Erickson [EMAIL PROTECTED] wrote: What are the steps to write a LSB script for apache. I am using version 2.1.3 with crm. I found an example once but I can not find it anymore.

[Linux-HA] How to determine director state?

2008-02-21 Thread Nicholas Guarracino
Hi everyone, Is there any way to ask heartbeat what mode it is operating in, i.e. active or standby director? I'm sure I could determine this through some indirect means (seeing if one of the processes heartbeat oversees is running, write something to a file from a resource.d script, etc.) but I

Re: [Linux-HA] Apache LSB example

2008-02-21 Thread Jason Erickson
We compiled apache from source so for some reason it will not start up with the ocf file. I went into the file /usr/lib/ocf/resource.d/heartbeat/apache and changed the path for httpd.conf wherever applicable. This still did not work. So I was looking into lsb resources instead to see if that

Re: [Linux-HA] Apache LSB example

2008-02-21 Thread Serge Dubrouski
Attached is my cib.xml file. maybe this will help. Jason Serge Dubrouski wrote: Most probably you incorrectly configured Apache resource in your cib.xml. That OCF script is pretty flexible and shouldn't depend on how you compiled your Apache. As a matter of fact I myself support a cluster

Re: [Linux-HA] Apache LSB example

2008-02-21 Thread Serge Dubrouski
Apache OCF RA needs a statusurl parameter set. Something like that: nvpair id=Apache:statusurl name=statusurl value=http://YOUR_VIRTUAL_IP/server-status/ Without it monitoring function won't work properly. On Thu, Feb 21, 2008 at 2:46 PM, Jason Erickson [EMAIL PROTECTED] wrote: Attached is

[Linux-HA] ha.cf stonith command question

2008-02-21 Thread Doug Lochart
I have tested stonith from the command line and was able to reset the target PC. On the command I used the following: stonith -t external/ipmi -T reset -p capestor2 10.43.120.134 ADMIN mypassword capestor2 This worked marvelously! So then I move the stuff into the ha.cf. Not having much in

[Linux-HA] clarification on operation and operation timers

2008-02-21 Thread Damon Estep
Can someone clarify the following on an lsb resource status operation? Interval - simple enough, how frequently a check is done. Timeout - is this the time allowed for each status query to respond, or the elapsed time required before the specified action is taken? Start Delay - is this

Re: [Linux-HA] How to determine director state?

2008-02-21 Thread Nicholas Guarracino
On Thu, 2008-02-21 at 14:38 -0700, Serge Dubrouski wrote: crm_mon ? I'm guessing from the name that crm_mon only works with crm-style clusters? I am using heartbeat v2 but with a v1-style cluster. DISCLAIMER: Important Notice * This e-mail may

[Linux-HA] Re: Re: Re: RE: Re: About pgsql RA.

2008-02-21 Thread HIDEO YAMAUCHI
Hi Serge, I confirmed it by two patterns. As for the first pattern, PostgreSQL is one instance in Active/Standby. As for the seconds pattern, PostgreSQL is two instance in Active/Active. I confirmed the case which PID was left each and the case which collided, but there was not the problem.