Re: [Pacemaker] "stonith_admin -F node" results in a pair of reboots

2014-01-07 Thread Fabio M. Di Nitto
On 1/6/2014 6:24 PM, Bob Haxo wrote: > Hi Fabio, > >>> There is an example on how to configure gfs2 also in the rhel6.5 >>> pacemaker documentation, using pcs. > > Super! Please share the link to this documentation. I only discovered > the gfs2+pcs example with the rhel7 beta docs. You are rig

[Pacemaker] Howto check if the current node is active?

2014-01-07 Thread Bauer, Stefan (IZLBW Extern)
Hi Folks! How can i check if the current node i'm connected to is the active? It should be parseable because i want to use it in a script. We want to use this to trigger a cronjob on the active. Stefan ___ Pacemaker mailing list: Pacemaker@oss.clusterl

Re: [Pacemaker] Howto check if the current node is active?

2014-01-07 Thread Lars Marowsky-Bree
On 2014-01-07T12:33:01, "Bauer, Stefan (IZLBW Extern)" wrote: > Hi Folks! > > How can i check if the current node i'm connected to is the active? > It should be parseable because i want to use it in a script. What do you mean with "active"? And on what platform? "systemctl status pacemaker" w

[Pacemaker] starting resources with failed stonith resource

2014-01-07 Thread Frank Van Damme
Hi list, I recently had some trouble with a dual-node mysql cluster, which runs in master-slave mode with Percona resource manager. While analyzing what happened to the cluster, I found this in syslog (network trouble, the cluster lost disk/iscsi access on both nodes, this is a piece from the form

Re: [Pacemaker] "stonith_admin -F node" results in a pair of reboots

2014-01-07 Thread Bob Haxo
Hi Fabio, > the gfs2 example was not You are forgiven ... and you are light years ahead of me. I have folks after my docs, and have not had time to convert notes to docs. Top of my to-do list after this task. I have been fumbling getting a combination of "chkconfig blah on" and Pacemake

Re: [Pacemaker] Howto check if the current node is active?

2014-01-07 Thread Jacek Konieczny
On 2014-01-07 13:33, Bauer, Stefan (IZLBW Extern) wrote: > How can i check if the current node i’m connected to is the active? > > It should be parseable because i want to use it in a script. Pacemaker is not limited to Active-Passive setups, in fact it has no notion of 'Active' node – every node

Re: [Pacemaker] Howto check if the current node is active?

2014-01-07 Thread Jorge Armando Medina
We used this in a postgres cluster # Run drbdadm verify every saturday 3 am 00 03 * * 06 root [ $(/usr/sbin/crm_resource -Q -r g_postgres -W ) = $(hostname) ] && /sbin/drbdadm verify all I hope you can use it. Best regards. - Mensaje original - De: "Stefan Bauer (IZLBW Extern)"

Re: [Pacemaker] Howto check if the current node is active?

2014-01-07 Thread Michael Schwartzkopff
Am Dienstag, 7. Januar 2014, 12:33:01 schrieb Bauer, Stefan: > Hi Folks! > > How can i check if the current node i'm connected to is the active? > It should be parseable because i want to use it in a script. > > We want to use this to trigger a cronjob on the active. > > Stefan Use a monitoring

Re: [Pacemaker] CentOS 6.5 Pacemaker Oracle Active/Failover cluster setup on SAN

2014-01-07 Thread Andrew Beekhof
On 6 Jan 2014, at 4:15 pm, Pui Edylie wrote: > Good Day members, > > I am wondering if anyone has set this up successfully? > > I noticed that there is a lack of Oracle script to initiate this. > > I would willing to pay someone for this effort and hopefully we could create > a howto to bene

Re: [Pacemaker] Time to get ready for 1.1.11

2014-01-07 Thread David Vossel
- Original Message - > From: "Andrew Beekhof" > To: "The Pacemaker cluster resource manager" > Sent: Thursday, December 19, 2013 2:25:00 PM > Subject: Re: [Pacemaker] Time to get ready for 1.1.11 > > > On 20 Dec 2013, at 2:11 am, Andrew Martin wrote: > > > David/Andrew, > > > > Once

Re: [Pacemaker] Manual fence confirmation by stonith_admin doesn't work again.

2014-01-07 Thread Andrew Beekhof
On 19 Dec 2013, at 6:54 pm, Nikita Staroverov wrote: > >> Please see: >> >> https://access.redhat.com/site/articles/36302 >> >> If you don't have an account, the relevant part is: >> >> "Usage of fence_manual is not supported in any production cluster. You may >> use this fence agent for de

Re: [Pacemaker] lrmd segfault at pacemaker 1.1.11-rc1

2014-01-07 Thread Andrew Beekhof
On 18 Dec 2013, at 9:50 pm, Kazunori INOUE wrote: > Hi David, > > 2013/12/18 David Vossel : >> >> That's a really weird one... I don't see how it is possible for op->id to be >> NULL there. You might need to give valgrind a shot to detect whatever is >> really going on here. >> >> -- Voss

Re: [Pacemaker] reboot of non-vm host results in VM restart -- of chickens and eggs and VMs

2014-01-07 Thread Andrew Beekhof
On 20 Dec 2013, at 5:30 am, Bob Haxo wrote: > Hello, > > Earlier emails related to this topic: > [pacemaker] chicken-egg-problem with libvirtd and a VM within cluster > [pacemaker] VirtualDomain problem after reboot of one node > > > My configuration: > > RHEL6.5/CMAN/gfs2/Pacemaker/crmsh >

Re: [Pacemaker] Minor buffer overflow..

2014-01-07 Thread Andrew Beekhof
On 5 Dec 2013, at 3:20 pm, Rob Thomas wrote: > I was idly wondering why the SMTP and SNMP modules were disabled by > default on the RHEL builds, and was in the middle of writing a shell > script to duplicate them when I noticed there was a tiny buffer > overflow in crm_mon. > > This may be why

Re: [Pacemaker] Minor buffer overflow..

2014-01-07 Thread Rob Thomas
>> https://github.com/xrobau/pacemaker/commit/b1515e3f83fceeac951de8823d718bdf13e4a093 > > Can you make a pull request for that? Not only can I read your mind and make a pull request for it, the pull request has already been pulled! https://github.com/ClusterLabs/pacemaker/commit/9663a1b1d5fd1d35

Re: [Pacemaker] Starting Pacemaker Cluster Manager [FAILED]

2014-01-07 Thread Andrew Beekhof
On 21 Nov 2013, at 9:56 pm, Miha wrote: > HI, > > how can i delete/reset all config, so that I could do again: "pcs cluster destroy" on all nodes looks about right > > 'pcs cluster setup mycluster pcmk-1 pcmk-2' and begin again at the beginning? > tnx! > > p.s.: bellowe is a log > > Nov 2

Re: [Pacemaker] some questions about STONITH

2014-01-07 Thread Andrew Beekhof
On 26 Nov 2013, at 12:39 am, Andrey Groshev wrote: >> ...snip... >>> Make next test: >>> #stonith_admin --reboot=dev-cluster2-node2 >>> Node reboot, but resource don't start. >>> In crm_mon status - Node dev-cluster2-node2 (172793105): pending. >>> And it will be hung. >> >> That is *proba

Re: [Pacemaker] disable migration after the faicount

2014-01-07 Thread Andrew Beekhof
On 26 Nov 2013, at 11:55 pm, ESWAR RAO wrote: > Hi All, > > Even I tried with meta allow-migrate="false", but still the resource is > migrating to another node. I think you mean "move" - aka. "stopped here and started there". In which case you need to use a location constraint (with score=-IN

Re: [Pacemaker] Breaking dependency loop && stonith

2014-01-07 Thread Andrew Beekhof
On 27 Nov 2013, at 12:26 am, Andrey Groshev wrote: > Hi, ALL. > > I want to clarify two more questions. > After stonith reboot - this node hangs with status "pending". > The logs found string . > >info: rsc_merge_weights:pgsql:1: Breaking dependency loop at > msPostgresql >inf

Re: [Pacemaker] Weird behavior of PCS command while defining DRBD resources

2014-01-07 Thread Andrew Beekhof
On 27 Nov 2013, at 10:21 pm, Muhammad Kamran Azeem wrote: > Apologies for double post. In my initial post, I forgot to set the subject > properly. > > > Hello List, > > I am new here. I worked with Linux HA during 2006-2008, went in HPC > direction, and came back to HA a month ago. Realize

Re: [Pacemaker] hangs pending

2014-01-07 Thread Andrew Beekhof
On 29 Nov 2013, at 7:17 pm, Andrey Groshev wrote: > Hi, ALL. > > I'm still trying to cope with the fact that after the fence - node hangs in > "pending". Please define "pending". Where did you see this? > At this time, there are constant re-election. > Also, I noticed the difference when yo

Re: [Pacemaker] prevent starting resources on failed node

2014-01-07 Thread Andrew Beekhof
On 7 Dec 2013, at 2:17 am, Brian J. Murrell (brian) wrote: > [ Hopefully this doesn't cause a duplicate post but my first attempt > returned an error. ] > > Using pacemaker 1.1.10 (but I think this issue is more general than that > release), I want to enforce a policy that once a node fails, n

Re: [Pacemaker] error: send_cpg_message: Sending message via cpg FAILED: (rc=6) Try again

2014-01-07 Thread Andrew Beekhof
What version of pacemaker? There were some improvements to how we handle sending messages via CPG recently. On 10 Dec 2013, at 4:40 am, Brian J. Murrell wrote: > On Mon, 2013-12-09 at 09:28 +0100, Jan Friesse wrote: >> >> Error 6 error means "try again". This is happening ether if corosync is >

Re: [Pacemaker] Reg. trigger when node failure occurs

2014-01-07 Thread Andrew Beekhof
On 11 Dec 2013, at 3:45 pm, ESWAR RAO wrote: > Hi Micheal, > > I am configuring the ClusterMon as below on the 3 node setup: > I am following > http://floriancrouzat.net/2013/01/monitor-a-pacemaker-cluster-with-ocfpacemakerclustermon-andor-external-agent/ > > # crm configure primitive Cluster

Re: [Pacemaker] host came online but it was ignored

2014-01-07 Thread Andrew Beekhof
On 18 Dec 2013, at 4:23 pm, ESWAR RAO wrote: > Hi All, > > Can someone help me how to narrow down the problem?? I'd probably start with an upgrade. There were some membership issues around about the time of 1.1.7, but they may have been corosync specific (I don't really test pacemaker with he

Re: [Pacemaker] reboot of non-vm host results in VM restart -- of chickens and eggs and VMs

2014-01-07 Thread Bob Haxo
Hi Andrew, With configuration fumble, err, test, that brought about this "of chickens and eggs and VMs" request, the situation is that the reboot of the non-host server results in the restart of the VM running on the host server. >From earlier [Pacemaker] thread: > From: Tom Fernandes > Subje

Re: [Pacemaker] Question about node-action-limit and migration-limit

2014-01-07 Thread Andrew Beekhof
On 18 Dec 2013, at 9:51 pm, Kazunori INOUE wrote: > Hi, > > When I set only migration-limit without setting node-action-limit in > pacemaker-1.1, > the number of 'operation' other than migrate_to/from was limited to > the value of migration-limit. > (The node that I used has 8 cores.) > > [cib