Re: [Pacemaker] Could not connect to the CIB service: connection failed

2010-08-18 Thread Dejan Muhamedagic
sys { > subsys: AMF > debug: off > } > } > > amf { > mode: disabled > } > > --END-- > > /etc/corosync/service.d/pcmk: > service { > # Load the Pacemaker Cluster Resource Manager > name: pacemaker >

Re: [Pacemaker] Could not connect to the CIB service: connection failed

2010-08-17 Thread Dejan Muhamedagic
Hi, On Tue, Aug 17, 2010 at 10:28:01AM +0100, Brett Delle Grazie wrote: > Hi, > > Are you using backports or the madkis repository? > For lenny you should be using backports. > > I had a similar problem, remove the 'use_logd' and 'use_mgmtd' lines from > your > service entry for pacemaker. Tr

Re: [Pacemaker] migration-threshold causing unnecessary restart of underlying resources

2010-08-17 Thread Dejan Muhamedagic
On Tue, Aug 17, 2010 at 04:14:17AM +0200, Cnut Jansen wrote: > Am 16.08.2010 13:29, schrieb Dejan Muhamedagic: > >On Sat, Aug 14, 2010 at 06:26:58AM +0200, Cnut Jansen wrote: > >>Am 12.08.2010 18:46, schrieb Dejan Muhamedagic: > >>>The migration-threshold shouldn&#x

Re: [Pacemaker] new in list - corosync failover problem

2010-08-17 Thread Dejan Muhamedagic
Hi, On Tue, Aug 17, 2010 at 08:25:24AM +0100, Brett Delle Grazie wrote: > Hello, > > On Mon, 2010-08-16 at 17:27 +0200, Andreas Kurz wrote: > > hello, > > > > On 2010-08-16 04:30, Emanuel dos Reis Rodrigues wrote: > > > hello all, > > > > > > I has beens installed a pacemaker with corosync clust

Re: [Pacemaker] new in list - corosync failover problem

2010-08-16 Thread Dejan Muhamedagic
Hi, On Sun, Aug 15, 2010 at 10:30:13PM -0400, Emanuel dos Reis Rodrigues wrote: > hello all, > > I has beens installed a pacemaker with corosync cluster with 2 > nodes, and one resource to ip failover follow instructions from: > http://clusterlabs.org/wiki/Debian_Lenny_HowTo > > > When I put

Re: [Pacemaker] crm and primitive meta id - 1.0.8 vs 1.0.9

2010-08-16 Thread Dejan Muhamedagic
Hi, On Fri, Aug 13, 2010 at 04:02:33PM -0700, Bob Schatz wrote: > On 1.0.6 and 1.0.8 I use to do this to create a primitive: > > crm configure primitive SS1 ocf:omneon:ss params ss_resource="SS1" \ > ssconf="${CONFIG_FILE}" op monitor interval="3s" role="Master" \ > timeout="7

Re: [Pacemaker] Pacemaker 1.0.8 and -INFINITY master score

2010-08-16 Thread Dejan Muhamedagic
Hi, On Fri, Aug 13, 2010 at 09:25:08AM -0700, Bob Schatz wrote: > Dejan, > > Thanks for the quick response! > > Comments below with [BS] > > - Original Message > From: Dejan Muhamedagic > To: The Pacemaker cluster resource manager > Sent: Fri, August

Re: [Pacemaker] Pacemaker 1.0.8 and -INFINITY master score

2010-08-16 Thread Dejan Muhamedagic
0) > > > Thanks, > > Bob > > > > - Original Message > From: Bob Schatz > To: The Pacemaker cluster resource manager > Sent: Fri, August 13, 2010 9:25:08 AM > Subject: Re: [Pacemaker] Pacemaker 1.0.8 and -INFINITY master score > > Dejan, >

Re: [Pacemaker] migration-threshold causing unnecessary restart of underlying resources

2010-08-16 Thread Dejan Muhamedagic
Hi, On Sat, Aug 14, 2010 at 06:26:58AM +0200, Cnut Jansen wrote: > Hi, > > and first of all thanks for answering so far. > > > Am 12.08.2010 18:46, schrieb Dejan Muhamedagic: > > > >The migration-threshold shouldn't in any way influence resources > >

Re: [Pacemaker] Best way to specify colocation and ordering

2010-08-13 Thread Dejan Muhamedagic
Hi, On Fri, Aug 13, 2010 at 02:55:30PM +, Chris Picton wrote: > On Fri, 13 Aug 2010 14:37:18 +, Chris Picton wrote: > > >>> On Fri, Aug 13, 2010 at 01:44:28PM +, Chris Picton wrote: I have a > >>> drbd backed mysql server which has the following resources: > >>> > >>> drbd0 -> lvm_da

Re: [Pacemaker] Best way to specify colocation and ordering

2010-08-13 Thread Dejan Muhamedagic
On Fri, Aug 13, 2010 at 01:44:28PM +, Chris Picton wrote: > Hi all > > I have a drbd backed mysql server which has the following resources: > > drbd0 -> lvm_data -> mount_data > drbd1 -> lvm_logs -> mount_logs > mysqld > floatingip > > I would like the drbd based filesystems to start up in

Re: [Pacemaker] Occasional error running ocf scripts

2010-08-13 Thread Dejan Muhamedagic
Hi, On Fri, Aug 13, 2010 at 10:29:43AM +, Chris Picton wrote: > On Fri, 13 Aug 2010 12:06:27 +0200, Dejan Muhamedagic wrote: > > > Hi, > > > > On Fri, Aug 13, 2010 at 11:20:38AM +0200, Chris Picton wrote: > >> Hi all > >> > >> I have see

Re: [Pacemaker] Occasional error running ocf scripts

2010-08-13 Thread Dejan Muhamedagic
Hi, On Fri, Aug 13, 2010 at 11:20:38AM +0200, Chris Picton wrote: > Hi all > > I have seen the following behaviour on a few occasions in the past few > months. It seems as if the resource script get called, but without the > correct OCF_RESOURCE parameters. > > Here is an example: > >

Re: [Pacemaker] Pacemaker 1.0.8 and -INFINITY master score

2010-08-13 Thread Dejan Muhamedagic
Hi, On Thu, Aug 12, 2010 at 12:54:10PM -0700, Bob Schatz wrote: > I upgraded to Pacemaker 1.0.8 since my application consists of Master/Slave > resources and I wanted to pick up the fix for setting negative master scores. Why not to 1.0.9.1? > I am now able to set negative master scores when a

Re: [Pacemaker] IPv6addr resource error messages on startup

2010-08-13 Thread Dejan Muhamedagic
uld match the parameters. Are you sure that your IPv6addr config is OK? Can you show us your interfaces. Thanks, Dejan > > > > > --- On Thu, 8/12/10, Dejan Muhamedagic wrote: > > > From: Dejan Muhamedagic > > Subject: Re: [Pacemaker] IPv6addr resource error m

Re: [Pacemaker] IPv6addr resource error messages on startup

2010-08-12 Thread Dejan Muhamedagic
Hi, On Thu, Aug 12, 2010 at 11:17:50AM -0700, Dusty Mabe wrote: > Hi, > > I am having a problem trying to get ocf::heartbeat:IPv6addr to work with > pacemaker. I keep getting the following message > > WARN: unpack_rsc_op: Processing failed op myIPv6addr_start_0 on hasync-1a: > unknown error (

Re: [Pacemaker] migration-threshold causing unnecessary restart of underlying resources

2010-08-12 Thread Dejan Muhamedagic
Hi, On Thu, Aug 12, 2010 at 04:12:02AM +0200, Cnut Jansen wrote: > Hi, > > I'm once again experiencing (imho) strange behaviour respectively > decision-making by Pacemaker, and I hope that someone can either > enlighten me a little about this, its intention and/or a possible > misconfiguration o

Re: [Pacemaker] lrmd WARN on high IO load

2010-08-11 Thread Dejan Muhamedagic
Hi, On Wed, Aug 11, 2010 at 05:17:03PM -0300, Diego Woitasen wrote: > Hi > > 2010/8/2 Dejan Muhamedagic : > > Hi, > > > > On Mon, Jul 19, 2010 at 07:09:11PM -0300, Diego Woitasen wrote: > >> 2010/7/16 Diego Woitasen : > >> > Hi, > >> &

Re: [Pacemaker] Temporarely suspending monitoring

2010-08-11 Thread Dejan Muhamedagic
Hi, On Thu, Aug 12, 2010 at 12:14:59AM +0200, Bart Coninckx wrote: > On Wednesday 11 August 2010 23:55:42 Bart Coninckx wrote: > > On Wednesday 11 August 2010 23:01:22 Vince Gabriel wrote: > > > > -Original Message- > > > > From: Bart Coninckx [mailto:bart.conin...@telenet.be] > > > > Sent

Re: [Pacemaker] Need help using OCFS2 with openais/pacemaker

2010-08-11 Thread Dejan Muhamedagic
On Wed, Aug 11, 2010 at 11:11:45PM +0300, Vladislav Bogdanov wrote: > 11.08.2010 22:09, patrick.ouel...@promutuel.ca пишет: > > First of all, wow guys great software I love it so far. > > > > Second, I hope im posting this at the right place or i'll get flamed. > > > > I have followed the great

Re: [Pacemaker] Antwort: Re: stonith sbd problem

2010-08-11 Thread Dejan Muhamedagic
Hi, On Wed, Aug 11, 2010 at 11:48:17AM +0200, philipp.achmuel...@arz.at wrote: > i removed the clone, set the global cluster property for stonith-timeout. > > the nodes need about 3-5 minutes to startup after they get "shot" > > i did some more tests and found out that if the node, which runs re

Re: [Pacemaker] An issue about failcount of resources when start action failed

2010-08-10 Thread Dejan Muhamedagic
Hi, On Tue, Aug 10, 2010 at 06:29:45PM +0800, Jingcheng zhang wrote: > Dear Beekhof, > I configured a two node cluster with clone resource A. When the > resource A start failed, I saw the failed actions (start operation) in > crm_mon but the failcount displayed by "crm resource" is 0. Is

Re: [Pacemaker] stonith sbd problem

2010-08-10 Thread Dejan Muhamedagic
Hi, On Tue, Aug 10, 2010 at 10:16:05AM +0200, philipp.achmuel...@arz.at wrote: > hi, > > following configuration: > > node lnx0047a > node lnx0047b > primitive lnx0101a ocf:heartbeat:KVM \ > params name="lnx0101a" \ > meta allow-migrate="1" target-role="Started" \ > op mi

Re: [Pacemaker] Preventing resource from becoming inactive

2010-08-10 Thread Dejan Muhamedagic
Hi, On Tue, Aug 10, 2010 at 07:56:47AM +0200, Torsten Bronger wrote: > Hallöchen! > > Sometimes Pacemaker just switches off my Lighttpd. How do you mean "just switches off"? > It becomes > inactive and is never reanimated. Only restarting the Heartbeat > service helps. How can I tell Pacemaker

Re: [Pacemaker] rsc_order plus resource_set problem

2010-08-10 Thread Dejan Muhamedagic
Hi, On Tue, Aug 10, 2010 at 11:00:29AM +0800, Michael Fung wrote: > Hello Dejan, > > > > > That cannot be the case, because tabs are treated as space. The > > problem comes from specifying 'sequential="true"' which, because > > default, is not generated hence the original XML and the > > generat

Re: [Pacemaker] rsc_order plus resource_set problem

2010-08-09 Thread Dejan Muhamedagic
Hi, On Mon, Aug 09, 2010 at 09:57:18PM +0800, Michael Fung wrote: > > On 2010/8/9 下午 07:53, Dejan Muhamedagic

Re: [Pacemaker] Unexpected master/slave behavior

2010-08-09 Thread Dejan Muhamedagic
Hi, On Mon, Aug 09, 2010 at 01:34:16PM +0200, David Mohr wrote: > > Hi everyone, > we are trying to understand how pacemaker deals with master/slave > resources. After reading the documentation we believe we understand how > location and colocation work. Yet in practice we don't get the results t

Re: [Pacemaker] rsc_order plus resource_set problem

2010-08-09 Thread Dejan Muhamedagic
Hi, On Sun, Aug 08, 2010 at 09:00:37PM +0800, Michael Fung wrote: > Hello again, > > I have a long list of resources that must be started in order, so I > followed the "Pacemaker 1.0 Configuration Explained" doc, to write it > like that: > > (the constraints scope was originally blank as ) > >

Re: [Pacemaker] Processes are being run with realtime priority

2010-08-09 Thread Dejan Muhamedagic
Hi, On Fri, Aug 06, 2010 at 02:17:26PM -0400, Doug _ wrote: > I'm using Pacemaker and Corosync, and I notice that all my cluster resources > are started with realtime priority. How do I lower this to more normal > process priorities? It should be done automatically by lrmd. At one point this fun

[Pacemaker] testing resources (was: Crazy idea #1)

2010-08-03 Thread Dejan Muhamedagic
Hi, On Mon, Aug 02, 2010 at 11:07:46PM +0200, Lars Marowsky-Bree wrote: > On 2010-08-02T19:05:53, Dejan Muhamedagic wrote: > > > Testing/starting/etc resources is easy, but the shell doesn't > > know about dependencies. > > I think this might not even be needed f

Re: [Pacemaker] Two node lsb:nfs failing starting second node

2010-08-02 Thread Dejan Muhamedagic
Hi, On Wed, Jul 28, 2010 at 06:48:49AM -0400, Rick Day wrote: > I am setting up a two node cluster on RHEL 5.5 with Pacemaker > 1.0.9.1-1. I have a resource set up to start NFS with lsb. I > bring up my first node and everything is fine. All the > resources start up. When I bring up the second nod

Re: [Pacemaker] Question: Two Nodes - Mirrored SANs - SBD?

2010-08-02 Thread Dejan Muhamedagic
Hi, On Wed, Jul 21, 2010 at 08:08:30AM +, Rainer Lutz wrote: > Hello all, > > first the scenario: > > Two Nodes connected to two SANs via fibre channel and > mirrored devices. Multipath Partitons are used. > > So i have two disks ( one disk SAN A and one disk SAN B ) > combined to let`s s

Re: [Pacemaker] Adding a ressource

2010-08-02 Thread Dejan Muhamedagic
Hi, On Fri, Jul 16, 2010 at 11:52:42AM +0200, pierre.casen...@almerys.com wrote: > Hello all, > I'm running an active/passive cluster with apache running on it. > I've declared 2 IPs and the apache process as ressources. I've declared a > colocation between these IP and apache, and an order so th

Re: [Pacemaker] lrmd WARN on high IO load

2010-08-02 Thread Dejan Muhamedagic
Hi, On Mon, Jul 19, 2010 at 07:09:11PM -0300, Diego Woitasen wrote: > 2010/7/16 Diego Woitasen : > > Hi, > >  I've installed Heartbeat+Pacemaker (3.0.3 and 1.0.9). I have a > > resource which executes an script to check the service: > > > > primitive kolab_imapd ocf:heartbeat:kolab-service \ > >  

Re: [Pacemaker] /.crm_help_index file (in system root aka /)

2010-07-15 Thread Dejan Muhamedagic
Hi, On Wed, Jul 14, 2010 at 04:16:24PM +0200, Raoul Bhatia [IPAX] wrote: > On 07/13/2010 09:47 PM, Maros Timko wrote: > > The python crm scripts use os.getenv("HOME") to decide where to look > > for or store the history file. Some of the environments (cronjob or > > sudo) do have HOME set to "/".

Re: [Pacemaker] [patch v2] low: remove various bashisms

2010-07-13 Thread Dejan Muhamedagic
Horman wrote: > # HG changeset patch > # User Simon Horman > # Date 1278925288 -32400 > # Node ID e78d5591acbc9b76e85d8e31b410dea5bfe8e574 > # Parent 110d056193472fa64ffabd3069d5ed20d32b01c2 > low: remove various bashisms > > --- > > v2 > Address concerns raised by

Re: [Pacemaker] crm configure update and properties

2010-07-12 Thread Dejan Muhamedagic
Hi, On Fri, Jul 09, 2010 at 09:53:04PM -0400, Vadym Chepkov wrote: > Hi, > > I am not sure if it's a bug or not, but certainly not a pleasant feature. > At first I configured cluster with property > stonith-enabled="false", because I didn't have a stontih device handy. > Then, after I got an APC

Re: [Pacemaker] [PATCH 2 of 2] low: remove various bashisms

2010-07-12 Thread Dejan Muhamedagic
Hi, On Mon, Jul 12, 2010 at 09:54:01AM +0900, Simon Horman wrote: > On Fri, Jul 09, 2010 at 04:05:13PM +0200, Dejan Muhamedagic wrote: > > Hi, > > > > On Thu, Jul 08, 2010 at 03:16:03PM +0900, Simon Horman wrote: > > > # HG changeset patch > > > # Use

Re: [Pacemaker] [PATCH] suggested bashism fixes for HealthSMART OCF RA

2010-07-09 Thread Dejan Muhamedagic
Hi, On Tue, Jul 06, 2010 at 04:29:48PM +0200, Raoul Bhatia [IPAX] wrote: > # HG changeset patch > # User Raoul Bhatia [IPAX] > # Date 1278426578 -7200 > # Branch stable-1.0 > # Node ID 31401399d6334467296a60a13d0cea7641fc9358 > # Parent 338113649a70f80fe89ac0765035a79f70cb202f > suggested bashis

Re: [Pacemaker] [PATCH 2 of 2] low: remove various bashisms

2010-07-09 Thread Dejan Muhamedagic
Hi, On Thu, Jul 08, 2010 at 03:16:03PM +0900, Simon Horman wrote: > # HG changeset patch > # User Simon Horman > # Date 1278569313 -32400 > # Node ID 48a51108d0d181ecb21c3289d3bc86b46f77f622 > # Parent 110d056193472fa64ffabd3069d5ed20d32b01c2 > low: remove various bashisms > > As reported by De

Re: [Pacemaker] [PATCH 1 of 2] low: Use awk instead of bash to calculate memory and disk sizes

2010-07-09 Thread Dejan Muhamedagic
Hi Horms, Amazing work :) If nobody objects, I'll apply the patch. Cheers, Dejan On Thu, Jul 08, 2010 at 03:16:02PM +0900, Simon Horman wrote: > # HG changeset patch > # User Simon Horman > # Date 1278569160 -32400 > # Node ID 110d056193472fa64ffabd3069d5ed20d32b01c2 > # Parent e823bf55e0d875

Re: [Pacemaker] [PATCH] suggested bashism fixes for hb2openais.sh

2010-07-09 Thread Dejan Muhamedagic
Hi, On Wed, Jul 07, 2010 at 12:39:57PM +0200, Raoul Bhatia [IPAX] wrote: > On 07/07/2010 11:37 AM, Dejan Muhamedagic wrote: > > Yes, this is just an auxiliary script, I'll make it a bash > > script. > > why not apply the rather trivial changes? OK, we can do that inst

Re: [Pacemaker] Possible bug with ibmrsa stonith resource

2010-07-08 Thread Dejan Muhamedagic
Hi, On Wed, Jul 07, 2010 at 04:09:51PM -0400, claude.duroc...@mcccf.gouv.qc.ca wrote: > > - Avis: Ce message est confidentiel et ne s'adresse qu'aux destinataires. > Si vous le recevez par erreur, veuillez le supprimer et nous en aviser. > - > I'm using the latest version of the external/ibmrsa

Re: [Pacemaker] Upgraded mysql from 5.0 to 5.1

2010-07-07 Thread Dejan Muhamedagic
Hi, On Tue, Jul 06, 2010 at 12:42:41PM -0400, Jake Bogie wrote: > So I took Raoul's advice and ditched the lsb:mysql check and went for > the ocf:heartbeat version however... > > I'm getting this now... > > What am I missing? I'm having a hard time finding a document on how to > setup this resou

Re: [Pacemaker] [PATCH] suggested bashism fixes for hb2openais.sh

2010-07-07 Thread Dejan Muhamedagic
Hi, On Wed, Jul 07, 2010 at 01:12:20AM +0200, Lars Ellenberg wrote: > On Tue, Jul 06, 2010 at 04:45:13PM +0200, Raoul Bhatia [IPAX] wrote: > > # HG changeset patch > > # User Raoul Bhatia [IPAX] > > # Date 1278427473 -7200 > > # Branch stable-1.0 > > # Node ID 6396b06964a167a53b57b80ab316c96c9de3

Re: [Pacemaker] crm resource cleanup ignored

2010-07-02 Thread Dejan Muhamedagic
Hi, On Fri, Jul 02, 2010 at 02:56:04PM +0200, Bernd Schubert wrote: > Hello all, > > after the update 1.0.9 on our test cluster, new weird stonith issues > come up. > > 1) It fails to start stonith resources on *some* nodes > === > > Jul 02

Re: [Pacemaker] Stop one instance in a clone

2010-07-02 Thread Dejan Muhamedagic
Hi, On Thu, Jul 01, 2010 at 10:23:41PM +0200, Andrew Beekhof wrote: > On Wed, Jun 30, 2010 at 7:07 PM, Dejan Muhamedagic > wrote: > > Hi, > > > > On Wed, Jun 30, 2010 at 10:57:21AM -0600, Serge Dubrouski wrote: > >> Hello - > >> > >> Is there

Re: [Pacemaker] /.crm_help_index file (in system root aka /)

2010-07-02 Thread Dejan Muhamedagic
On Thu, Jul 01, 2010 at 07:46:44PM +0200, Raoul Bhatia [IPAX] wrote: > On 07/01/2010 05:46 PM, Dejan Muhamedagic wrote: > > The help index is created in the user's home. Is that the home of > > the root user? Shouldn't it be /root? BTW, there are many other > > prog

Re: [Pacemaker] /.crm_help_index file (in system root aka /)

2010-07-01 Thread Dejan Muhamedagic
Hi, On Thu, Jul 01, 2010 at 03:23:10PM +0200, Raoul Bhatia [IPAX] wrote: > hi, > > sometimes, i see a /.crm_help_index file being created on my system(s). > i do not exactly know when this happens, but i get the feeling that this > is not the correct place for this file ;) The help index is crea

Re: [Pacemaker] starting resources: Interrupted system call

2010-07-01 Thread Dejan Muhamedagic
Hi, On Thu, Jul 01, 2010 at 03:37:57PM +0200, Bernd Schubert wrote: > Never mind, seems to be fixed in 1.0.9 I have no idea what was going on in there. The pacemaker bits shouldn't make a difference. Thanks, Dejan > Thanks, > Bernd > > On Thursday, July 01, 2010, Bernd Schubert wrote: > > Hi

Re: [Pacemaker] pacemaker fails to start drbd using ocf:linbit:drbd

2010-07-01 Thread Dejan Muhamedagic
Hi, On Thu, Jul 01, 2010 at 10:37:10AM +0200, martin.br...@icw.de wrote: > Hi Bart, > > my guess is that you did forget the start-delay attribute for the monitor > operations, that's why you see the time-out error message. > > Here is an example: > > > op monitor interval="20" role="

Re: [Pacemaker] Stop one instance in a clone

2010-06-30 Thread Dejan Muhamedagic
On Wed, Jun 30, 2010 at 11:16:13AM -0600, Serge Dubrouski wrote: > On Wed, Jun 30, 2010 at 11:07 AM, Dejan Muhamedagic > wrote: > > Hi, > > > > On Wed, Jun 30, 2010 at 10:57:21AM -0600, Serge Dubrouski wrote: > >> Hello - > >> > >> Is there an

Re: [Pacemaker] Stop one instance in a clone

2010-06-30 Thread Dejan Muhamedagic
Hi, On Wed, Jun 30, 2010 at 10:57:21AM -0600, Serge Dubrouski wrote: > Hello - > > Is there any way to stop an instance of a cloned resource on a > particular node using crm shell? How would you stop it with crm_resource? Perhaps with one -inf location constraint for the target node? Thanks,

Re: [Pacemaker] crm_node "-A" option does not work

2010-06-29 Thread Dejan Muhamedagic
Hi, On Tue, Jun 29, 2010 at 11:52:01AM +0200, Andrew Beekhof wrote: > 2010/6/28 NAKAHIRA Kazutomo : > > Hi, > > > > I noticed that crm_node "-A" option does not work now. > > And I wrote a small patch for this issue. > > # Please see attached patch. > > Applied. Thanks! > > > > > BTW, Is removin

Re: [Pacemaker] Trouble getting stonith with external/ipmi to work

2010-06-25 Thread Dejan Muhamedagic
Hi, On Fri, Jun 25, 2010 at 11:58:01AM -0500, Bart Willems wrote: > I am setting SLES11 SP1 HA on 2 nodes and would like to use external/ipmi > for stonith. I have setup a resource that successfully migrates an IP from > node1 to node2 when I turn off openais on node1, and migrates back when I > t

Re: [Pacemaker] Master/Slave not failing over

2010-06-24 Thread Dejan Muhamedagic
Hi, On Thu, Jun 24, 2010 at 12:12:34PM -0400, Eliot Gable wrote: > On another note, I cannot seem to get Pacemaker to monitor the master node. > It monitors the slave node just fine. These are the operations I have defined: > > op monitor interval="5" timeout="30s" \ > op monitor

Re: [Pacemaker] Active/active firewall using pacemaker ... and a helluva lot of IP addresses

2010-06-24 Thread Dejan Muhamedagic
Hi, On Wed, Jun 23, 2010 at 06:44:44PM +0200, Roberto Suarez Soto wrote: > Hi, > > we've configured several active/active two-node firewalls using > pacemaker and clusterip (an iptables extension; we use Linux), with good > results. We have several IP addresses on the firewall that we use f

Re: [Pacemaker] compile error with stable-1.0/15618

2010-06-24 Thread Dejan Muhamedagic
Hi, On Thu, Jun 24, 2010 at 02:31:24PM +0200, Oliver Heinz wrote: > > I'm trying to rebuild debian packages for the current stable-1.0 branch in > the > repository on debian squeeze. > > Rebuilding the debian packages (pacemaker-1.0.8+hg15494) works fine so build > dependencies should be sati

Re: [Pacemaker] ClusterMon failing: call=220, rc=1, status=complete): unknown error

2010-06-24 Thread Dejan Muhamedagic
On Thu, Jun 24, 2010 at 02:14:51PM +0200, Koch, Sebastian wrote: > Hi, > > i got a small issue with the CLusterMon agent. The monitor > actions for this agent seem to fail (if i look into syslog, > you'll find it below) and i am not able to troubleshoot it. I > tried to start the agent on the fail

Re: [Pacemaker] suse 11 sp1 ha use_mgmtd:1 issue

2010-06-23 Thread Dejan Muhamedagic
Hi, On Wed, Jun 23, 2010 at 05:01:58PM +0200, office xh9 wrote: > Hey all, > > i m having problem with suse 11 server sp1 ha extension. if i provide the > > use_mgmtd: 1 Should be use_mgmtd: yes, like this: service { # Load the Pacemaker Cluster Resource Manager ver: 0

Re: [Pacemaker] pacemaker-mgmt compile error

2010-06-22 Thread Dejan Muhamedagic
Hi, On Tue, Jun 22, 2010 at 08:57:23PM +0800, Michael Fung wrote: > Hi All, > > > I would like to try the Python GUI. According to a previous post by Yan Gao: > > > If you are using pacemaker 1.0 series, you could either retrieve > > pacemaker-mgmt-2.0.0 from: > > > http://hg.clusterlabs.org/pa

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-17 Thread Dejan Muhamedagic
On Wed, Jun 16, 2010 at 11:07:40AM +0200, Andrew Beekhof wrote: > On Wed, Jun 16, 2010 at 10:29 AM, Dejan Muhamedagic > wrote: > > Hi, > > > > On Wed, Jun 16, 2010 at 08:55:26AM +0200, Andrew Beekhof wrote: > >> On Tue, Jun 15, 2010 at 9:41 P

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-17 Thread Dejan Muhamedagic
Hi, On Wed, Jun 16, 2010 at 09:07:10AM -0400, Vadym Chepkov wrote: > > On Jun 16, 2010, at 2:55 AM, Andrew Beekhof wrote: > > > On Tue, Jun 15, 2010 at 9:41 PM, Dejan Muhamedagic > > wrote: > > > >> colocation not-together -inf: d1 d2 d3 > > >

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-17 Thread Dejan Muhamedagic
On Wed, Jun 16, 2010 at 08:54:37AM -0400, Vadym Chepkov wrote: > > On Jun 15, 2010, at 3:52 PM, Dejan Muhamedagic wrote: > > > On Tue, Jun 15, 2010 at 12:53:07PM -0400, Vadym Chepkov wrote: > >> > >> On Jun 15, 2010, at 9:26 AM, Vadym Chepkov wrote: > >&

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-16 Thread Dejan Muhamedagic
Hi, On Wed, Jun 16, 2010 at 08:55:26AM +0200, Andrew Beekhof wrote: > On Tue, Jun 15, 2010 at 9:41 PM, Dejan Muhamedagic > wrote: > > > colocation not-together -inf: d1 d2 d3 > > I think there is a problem with this syntax, particularly for +inf. > > Consider: &

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-15 Thread Dejan Muhamedagic
On Tue, Jun 15, 2010 at 04:44:31PM -0400, Vadym Chepkov wrote: > > On Jun 15, 2010, at 3:55 PM, Dejan Muhamedagic wrote: > > > On Tue, Jun 15, 2010 at 03:41:17PM -0400, Vadym Chepkov wrote: > >> > >> On Jun 15, 2010, at 3:36 PM, Dejan Muhamedagic wrote: > &g

Re: [Pacemaker] abrupt power failure problem

2010-06-15 Thread Dejan Muhamedagic
Hi, On Tue, Jun 15, 2010 at 02:25:51PM -0600, Dan Urist wrote: > On Tue, 15 Jun 2010 22:08:37 +0200 > Dejan Muhamedagic wrote: > > > Hi, > > > > On Tue, Jun 15, 2010 at 01:15:08PM -0600, Dan Urist wrote: > > > I've recently had exactly the sam

Re: [Pacemaker] abrupt power failure problem

2010-06-15 Thread Dejan Muhamedagic
Hi, On Tue, Jun 15, 2010 at 01:15:08PM -0600, Dan Urist wrote: > I've recently had exactly the same thing happen. One (highly kludgey!) > solution I've considered is hacking a custom version of the stonith IPMI > agent that would check whether the node was at all reachable following a > stonith fa

Re: [Pacemaker] crm node delete

2010-06-15 Thread Dejan Muhamedagic
Hi, On Tue, Jun 15, 2010 at 05:09:14PM +0100, Maros Timko wrote: > > On Fri, Jun 11, 2010 at 03:45:19PM +0100, Maros Timko wrote: > >> Hi all, > >> > >> using heartbeat stack. I have a system with one node offline: > >> > >> Last updated: Fri Jun 11 13:52:40 2010 > >> Stack: Heartb

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-15 Thread Dejan Muhamedagic
On Tue, Jun 15, 2010 at 03:41:17PM -0400, Vadym Chepkov wrote: > > On Jun 15, 2010, at 3:36 PM, Dejan Muhamedagic wrote: > > > Hi, > > > > On Tue, Jun 15, 2010 at 08:45:37AM -0400, Vadym Chepkov wrote: > >> > >> On Jun 15, 2010, at 6:1

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-15 Thread Dejan Muhamedagic
On Tue, Jun 15, 2010 at 12:53:07PM -0400, Vadym Chepkov wrote: > > On Jun 15, 2010, at 9:26 AM, Vadym Chepkov wrote: > >>> > >>> what about this part? what do I need to do to prevent them from running > >>> on different nodes for sure? > >> > >> You can't have it both ways. > >> Either they hav

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-15 Thread Dejan Muhamedagic
On Tue, Jun 15, 2010 at 01:50:06PM +0200, Andrew Beekhof wrote: > On Tue, Jun 15, 2010 at 1:38 PM, Vadym Chepkov wrote: > > > > On Jun 15, 2010, at 4:57 AM, Andrew Beekhof wrote: > > > >> On Tue, Jun 15, 2010 at 10:23 AM, Andreas Kurz > >> wrote: > >>> On Tuesday 15 June 2010 08:40:58 Andrew Bee

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-15 Thread Dejan Muhamedagic
Hi, On Tue, Jun 15, 2010 at 08:45:37AM -0400, Vadym Chepkov wrote: > > On Jun 15, 2010, at 6:14 AM, Dejan Muhamedagic wrote: > > > Hi, > > > > On Tue, Jun 15, 2010 at 10:57:47AM +0200, Andrew Beekhof wrote: > >> On Tue, Jun 15, 2010 at 10:23 AM, Andreas Ku

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-15 Thread Dejan Muhamedagic
On Tue, Jun 15, 2010 at 12:56:11PM +0200, Andrew Beekhof wrote: > On Tue, Jun 15, 2010 at 12:39 PM, Dejan Muhamedagic > wrote: > > On Tue, Jun 15, 2010 at 12:30:45PM +0200, Andrew Beekhof wrote: > >> On Tue, Jun 15, 2010 at 12:14 PM, Dejan Muhamedagic > >> wrot

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-15 Thread Dejan Muhamedagic
On Tue, Jun 15, 2010 at 12:30:45PM +0200, Andrew Beekhof wrote: > On Tue, Jun 15, 2010 at 12:14 PM, Dejan Muhamedagic > wrote: > > Hi, > > > > On Tue, Jun 15, 2010 at 10:57:47AM +0200, Andrew Beekhof wrote: > >> On Tue, Jun 15, 2010 at 10:23 AM, Andreas Kurz

Re: [Pacemaker] UPDATE...2 node cluster with clvm, configuration help needed...

2010-06-15 Thread Dejan Muhamedagic
Hi, On Tue, Jun 15, 2010 at 11:09:15AM +0200, patrik.rappo...@knapp.com wrote: > > hy guys, > > my colleague gave me a tip, that the stonith ressource on node 1, when node > 2 is offline, won't work cause of a false state (cant reach the asm module > of node 2) and so the other ressources (vg, l

Re: [Pacemaker] Shouldn't colocation -inf: be mandatory?

2010-06-15 Thread Dejan Muhamedagic
Hi, On Tue, Jun 15, 2010 at 10:57:47AM +0200, Andrew Beekhof wrote: > On Tue, Jun 15, 2010 at 10:23 AM, Andreas Kurz > wrote: > > On Tuesday 15 June 2010 08:40:58 Andrew Beekhof wrote: > >> On Mon, Jun 14, 2010 at 4:22 PM, Vadym Chepkov wrote: > >> > On Jun 7, 2010, at 8:04 AM, Vadym Chepkov wr

Re: [Pacemaker] how do I avoid infinite reboot cycles by fencing just the offline node?

2010-06-14 Thread Dejan Muhamedagic
Hi, On Mon, Jun 14, 2010 at 06:29:59PM +0200, Oliver Heinz wrote: > Am Montag, 14. Juni 2010, um 16:43:54 schrieb Dejan Muhamedagic: > > Hi, > > > > On Mon, Jun 14, 2010 at 02:26:57PM +0200, Oliver Heinz wrote: > > > I configured a sbd fencing device on the

Re: [Pacemaker] how do I avoid infinite reboot cycles by fencing just the offline node?

2010-06-14 Thread Dejan Muhamedagic
Hi, On Mon, Jun 14, 2010 at 02:26:57PM +0200, Oliver Heinz wrote: > > I configured a sbd fencing device on the shared storage to prevent data > corruption. It works basically, but when I pull the network plugs on one node > to simulate a failure one of the nodes is fenced (not necessarily the o

Re: [Pacemaker] crm node delete

2010-06-13 Thread Dejan Muhamedagic
Hi, On Fri, Jun 11, 2010 at 03:45:19PM +0100, Maros Timko wrote: > Hi all, > > using heartbeat stack. I have a system with one node offline: > > Last updated: Fri Jun 11 13:52:40 2010 > Stack: Heartbeat > Current DC: vsp7.example.com (ba6d6332-71dd-465b-a030-227bcd31a25f) - > par

Re: [Pacemaker] How to replace an agent

2010-06-11 Thread Dejan Muhamedagic
Hi, On Fri, Jun 11, 2010 at 09:10:17AM -0400, Vadym Chepkov wrote: > > On Jun 10, 2010, at 9:03 AM, Dejan Muhamedagic wrote: > > > Hi, > > > > On Thu, Jun 10, 2010 at 08:46:22AM -0400, Vadym Chepkov wrote: > >> Hi, > >> > >> I stumb

Re: [Pacemaker] Active/Active issue

2010-06-11 Thread Dejan Muhamedagic
Hi, On Thu, Jun 10, 2010 at 10:47:07AM -0500, David wrote: > Got my first install of pacemaker/corosync running for a 2 node > apache cluster in active/active mode yesterday on CentOS 5.5 x86_64. > > Everything was working just fine until I tested disaster recovery by > failing both servers and r

Re: [Pacemaker] How to replace an agent

2010-06-10 Thread Dejan Muhamedagic
Hi, On Thu, Jun 10, 2010 at 08:46:22AM -0400, Vadym Chepkov wrote: > Hi, > > I stumbled upon interesting feature or a bug, not sure how to classify it. > > I needed to add a resource to a cluster and since it didn't have native RA, I > used 'anything' RA while I was working on a new script. Whe

Re: [Pacemaker] Service failback issue with SLES11 and HAE 11

2010-06-10 Thread Dejan Muhamedagic
B ? Don't know. Never heard of such a case. Some network equipment doesn't deal well with multicasting. > I'm curious about the way how suse cluster nodes communicate with each > other. Multicast UDP. > Can you point the way to shed me some light ? No, sorry.

Re: [Pacemaker] Issues with constraints - working for start/stop, being ignored on "failures"

2010-06-10 Thread Dejan Muhamedagic
Hi, On Thu, Jun 10, 2010 at 01:14:28AM +0200, Cnut Jansen wrote: > > > Am 08.06.2010 19:32, schrieb Dejan Muhamedagic: > >Hi, > > > >On Sun, Jun 06, 2010 at 07:07:32PM -0600, Tim Serong wrote: > >>Say what? The CRM shell shouldn't be canceling ops...

Re: [Pacemaker] 2 node cluster with clvm, configuration help needed...

2010-06-09 Thread Dejan Muhamedagic
Hi, On Wed, Jun 09, 2010 at 08:37:38AM +0200, Andrew Beekhof wrote: > On Fri, Jun 4, 2010 at 10:03 AM, Dejan Muhamedagic > wrote: > > On Thu, Jun 03, 2010 at 07:57:59AM +0200, Andrew Beekhof wrote: > >> On Wed, Jun 2, 2010 at 1:25 PM,   wrote: > >> > >

Re: [Pacemaker] Cluster frozen after "crm resource cleanup"

2010-06-09 Thread Dejan Muhamedagic
Hi, On Wed, Jun 09, 2010 at 07:16:28AM +0200, Stefan Foerster wrote: > * Dejan Muhamedagic : > > > http://www.incertum.net/~cite/messages.mudslide1 > > > http://www.incertum.net/~cite/messages.mudslide2 > [...] > > Please make a hb_report for this incident and open

Re: [Pacemaker] Dependent Resources

2010-06-09 Thread Dejan Muhamedagic
Hi, On Wed, Jun 09, 2010 at 08:19:16AM -0500, Schaefer, Diane E wrote: > >> Hi, > > >> > > >> I have a parent resource(A) with two others that depend on it (B, C). The > > >> resources of B and C will not run if A is not running. I would like to > > >> monitor B and C in addition to A for avai

Re: [Pacemaker] Service failback issue with SLES11 and HAE 11

2010-06-09 Thread Dejan Muhamedagic
Hi, On Wed, Jun 09, 2010 at 03:27:08PM +0800, ben180 wrote: > Dear Dejan, > > The test sequence is: > > 1.Service is running on ServerA(tibcodb) > 2.The Network Cable on ServerA is pulled out > 3.ServerB(tibcodb2) fenced ServerA,ServerA reboot > 4.ServerB take over the service > 5.ServerA restar

Re: [Pacemaker] Cluster split brain on vmware VSphere

2010-06-09 Thread Dejan Muhamedagic
gt; defaults. You should also raise the consensus value to 12000. corosync would even refuse to start in this case. Thanks, Dejan > > Thank you again. > Regards, > Roberto > > > > > -Original Message- > > From: Dejan Muhamedagic [mailto:deja...@fas

Re: [Pacemaker] Issues with constraints - working for start/stop, being ignored on "failures"

2010-06-08 Thread Dejan Muhamedagic
Hi, On Sun, Jun 06, 2010 at 07:07:32PM -0600, Tim Serong wrote: > On 6/2/2010 at 11:10 AM, Cnut Jansen wrote: > > Am 31.05.2010 05:47, schrieb Tim Serong: > > > On 5/31/2010 at 12:57 PM, Cnut Jansen wrote: > > > > > >> Current constraints: > > >> colocation TEST_colocO2cb inf: cloneO2cb clo

Re: [Pacemaker] Pacemaker resource management

2010-06-08 Thread Dejan Muhamedagic
Hi, On Sun, Jun 06, 2010 at 05:47:35PM +0300, Dan Frincu wrote: > Hello all, > > I have a couple of questions and I haven't found any relevant > documentation about it so I would appreciate any answers on the > matter. > > I'm using drbd 8.3.2-6 with pacemaker 1.0.5-4.2, openais 0.80.5-15.2 > an

Re: [Pacemaker] active-active setup with crm clone and load balancing

2010-06-08 Thread Dejan Muhamedagic
Hi, On Sun, Jun 06, 2010 at 03:11:10PM +0200, Tomas Kouba wrote: > Hello all, > > I am running a simple information system that does not need to have > a backend storage. > I would like to run it in two instances on two nodes and have them in > high available and load balancing setup. So the beha

Re: [Pacemaker] pingd problems

2010-06-08 Thread Dejan Muhamedagic
Hi, On Tue, Jun 08, 2010 at 06:43:11PM +0200, Dalibor Dukic wrote: > On Sat, 2010-06-05 at 15:36 +0200, Dalibor Dukic wrote: > > I have problem with ping RA not correctly updating CIB with appropriate > > attributes when doing fresh start. So afterwards IPaddr2 resources wont > > start. > > Have

Re: [Pacemaker] Cluster split brain on vmware VSphere

2010-06-08 Thread Dejan Muhamedagic
Hi, On Mon, Jun 07, 2010 at 02:57:57PM +0200, Torresani, Roberto wrote: > Sorry for have choosen the wrong ml... That's no problem. There's just better chance of getting help on the other list. > Here the corosync.conf used by one cluster, the other one is > just the same provided by the epel r

Re: [Pacemaker] Service failback issue with SLES11 and HAE 11

2010-06-08 Thread Dejan Muhamedagic
Hi, On Tue, Jun 08, 2010 at 10:00:37AM +0800, ben180 wrote: > Dear all, > > There are two nodes in my customer's environment. We installed SuSE > Linux Enterprise Server 11 and HAE on the two node. The cluster is for > oracle database service HA purpose. > We have set clone resource for pingd, an

Re: [Pacemaker] Cluster frozen after "crm resource cleanup"

2010-06-08 Thread Dejan Muhamedagic
Hi, On Tue, Jun 08, 2010 at 05:28:20PM +0200, Stefan Foerster wrote: > This morning, I wanted to do a "cleanup" on a "ping" resource (which > at the time was in a "started" state but had a fail-count of 3. After > that operation, the cluster didn't do any monitor operations and > refused to do any

Re: [Pacemaker] Cluster split brain on vmware VSphere

2010-06-07 Thread Dejan Muhamedagic
Hi, On Mon, Jun 07, 2010 at 01:04:53PM +0200, Torresani, Roberto wrote: > Hi list, > sorry if this is not the right ml or if the question is already answered > somewhere...if this is the case, just point me to the solution please. > > I have two active/passive mysql cluster on vmware vsphere ru

Re: [Pacemaker] cluster got stuck on stopping resources

2010-06-07 Thread Dejan Muhamedagic
Hi, On Mon, Jun 07, 2010 at 12:13:41PM +0200, Andreas Kurz wrote: > Hi all, > > I observed a strange behaviour when trying to stop two resources with latest > pacemaker: > > I updated two resources (ping) and changed some constraints. One of the > changed resources is mentioned in the logs wit

Re: [Pacemaker] Both nodes become master

2010-06-07 Thread Dejan Muhamedagic
Hi, On Mon, Jun 07, 2010 at 11:17:42AM +0200, Jorge Santos Fonseca wrote: > Hi again, > > I have made some tests wtihout bonding, and it appears that all > works perfect. You didn't show ha.cf. Did you try (with bonding) to use unicast? Thanks, Dejan > Regards, > > Jorge > > >

<    3   4   5   6   7   8   9   10   11   12   >