Re: [Pacemaker] [Partially SOLVED] pacemaker/dlm problems

2011-12-08 Thread Andrew Beekhof
On Fri, Dec 9, 2011 at 3:16 PM, Vladislav Bogdanov wrote: > 09.12.2011 03:11, Andrew Beekhof wrote: >> On Fri, Dec 2, 2011 at 1:32 AM, Vladislav Bogdanov >> wrote: >>> Hi Andrew, >>> >>> I investigated on my test cluster what actually happens with dlm and >>> fencing. >>> >>> I added more debug

Re: [Pacemaker] Postgresql streaming replication failover - RA needed

2011-12-08 Thread Takatoshi MATSUO
Hi Attila 2011/12/8 Attila Megyeri : > Hi Takatoshi, > > One strange thing I noticed and could probably be improved. > When there is data inconsistency, I have the following node properties: > > * Node psql2: >+ default_ping_set : 100 >+ master-postgresql:1 :

Re: [Pacemaker] [Partially SOLVED] pacemaker/dlm problems

2011-12-08 Thread Vladislav Bogdanov
09.12.2011 03:11, Andrew Beekhof wrote: > On Fri, Dec 2, 2011 at 1:32 AM, Vladislav Bogdanov > wrote: >> Hi Andrew, >> >> I investigated on my test cluster what actually happens with dlm and >> fencing. >> >> I added more debug messages to dlm dump, and also did a re-kick of nodes >> after some t

Re: [Pacemaker] [Partially SOLVED] pacemaker/dlm problems

2011-12-08 Thread Vladislav Bogdanov
09.12.2011 03:15, Andrew Beekhof wrote: > On Thu, Nov 24, 2011 at 6:21 PM, Vladislav Bogdanov > wrote: >> 24.11.2011 08:49, Andrew Beekhof wrote: >>> On Thu, Nov 24, 2011 at 3:58 PM, Vladislav Bogdanov >>> wrote: 24.11.2011 07:33, Andrew Beekhof wrote: > On Tue, Nov 15, 2011 at 7:36 AM,

Re: [Pacemaker] [Partially SOLVED] pacemaker/dlm problems

2011-12-08 Thread Nick Khamis
It can't. Nothing will work at that point. Not even a simple ls. Reboot! Nick. On Thu, Dec 8, 2011 at 7:15 PM, Andrew Beekhof wrote: > On Thu, Nov 24, 2011 at 6:21 PM, Vladislav Bogdanov > wrote: >> 24.11.2011 08:49, Andrew Beekhof wrote: >>> On Thu, Nov 24, 2011 at 3:58 PM, Vladislav Bogdanov

Re: [Pacemaker] [Partially SOLVED] pacemaker/dlm problems

2011-12-08 Thread Andrew Beekhof
On Thu, Nov 24, 2011 at 6:21 PM, Vladislav Bogdanov wrote: > 24.11.2011 08:49, Andrew Beekhof wrote: >> On Thu, Nov 24, 2011 at 3:58 PM, Vladislav Bogdanov >> wrote: >>> 24.11.2011 07:33, Andrew Beekhof wrote: On Tue, Nov 15, 2011 at 7:36 AM, Vladislav Bogdanov wrote: > Hi Andrew,

Re: [Pacemaker] [Partially SOLVED] pacemaker/dlm problems

2011-12-08 Thread Andrew Beekhof
On Fri, Dec 2, 2011 at 1:32 AM, Vladislav Bogdanov wrote: > Hi Andrew, > > I investigated on my test cluster what actually happens with dlm and > fencing. > > I added more debug messages to dlm dump, and also did a re-kick of nodes > after some time. > > Results are that stonith history actually d

Re: [Pacemaker] CMAN - Pacemaker - Porftpd setup

2011-12-08 Thread Andrew Beekhof
On Wed, Dec 7, 2011 at 9:49 AM, Florian Haas wrote: > On Tue, Dec 6, 2011 at 3:47 PM, Bensch, Kobus > wrote: >> 2.) I pasted the outcome here http://pastebin.com/uPcHiM4p > > So, you should be seeing lines akin to the following in your logs: > > ERROR: clone_rsc_colocation_rh: Cannot interleave c

Re: [Pacemaker] Accessing GFS2 SAN drive, without Pacemaker?

2011-12-08 Thread Andrew Beekhof
On Fri, Dec 9, 2011 at 10:11 AM, Charles DeVoe wrote: > We have a three node cluster that we are going to run dedicated services > on each box. That is one will be used for analysis, one for data > collection, one for mysql. We need to be able to access data on a shared > SAN drive using iSCSI.

Re: [Pacemaker] faq / howto needed for cib troubleshooting

2011-12-08 Thread Andrew Beekhof
On Fri, Nov 25, 2011 at 8:44 AM, Attila Megyeri wrote: > Hi Gents, > > I see from time to time that you are asking for "cibadmin -Ql" type outputs > to help people troubleshoot their problems. > > Currenty I have an issue promoting a MS resource (the PSQL issue in the > previous mail) - and I wo

Re: [Pacemaker] Excessive migrate_from is run after migrate_to failed

2011-12-08 Thread Andrew Beekhof
On Thu, Dec 1, 2011 at 9:30 PM, Vladislav Bogdanov wrote: > Hi Andrew, all, > > I found that pacemaker runs migrate_from on a migration destination node > even if preceding migrate_to command failed (github master). > > Is it intentional? I think so, but I can see that its not a good idea in all

[Pacemaker] Accessing GFS2 SAN drive, without Pacemaker?

2011-12-08 Thread Charles DeVoe
We have a three node cluster that we are going to run dedicated services on each box.  That is one will be used for analysis, one for data collection, one for mysql.   We need to be able to access data on a shared SAN drive using iSCSI.  2 nodes are running Fedora 16 and 1 node on Fedora 14.  Th

Re: [Pacemaker] colocation issue with master-slave resources

2011-12-08 Thread Andrew Beekhof
On Tue, Nov 29, 2011 at 10:10 AM, Patrick H. wrote: > Upgraded to 1.1.6 and put in an ordering constraint, still no joy. Could you file a bug and include a crm_report for this please? > > # crm status > > Last updated: Mon Nov 28 23:09:37 2011 > Last change: Mon Nov 28 23:08:34 2011

Re: [Pacemaker] Make IP master

2011-12-08 Thread Andrew Beekhof
On Thu, Dec 8, 2011 at 6:34 AM, Charles DeVoe wrote: > We are attempting to st up the cluster such that a user will be logged > into the least busy node via ssh. The configuration and crm_mon results > are included here. Is it possible to set this up such that doing an ssh to > the cluster IP wi

Re: [Pacemaker] don't want to restart clone resource

2011-12-08 Thread Andrew Beekhof
Can you file a bug and attach a crm_report to it please? Unfortunately there's not enough information here to figure out the cause (although it does look like a bug) 2011/12/1 Sha Fanghao : > Hi, > > > > I have a cluster 3 nodes (CentOS 5.2) using pacemaker-1.0.11(also 1.0.12), > with heartbeat-3.

Re: [Pacemaker] are stopped resources monitored?

2011-12-08 Thread Andrew Beekhof
On Wed, Nov 30, 2011 at 1:26 PM, James Harper wrote: >> > >> > That thread goes around in circles and completely contradicts what > I'm >> > seeing. What I'm seeing is that unmanaged resources are never > monitored. >> >> would be strange and how do you verify this? A look at your config may > als

Re: [Pacemaker] (no subject)

2011-12-08 Thread Charles DeVoe
Oh good, the infamous system settingGreat,  I always love chasing these things down  Thanks for the help --- On Wed, 12/7/11, Andrew Beekhof wrote: From: Andrew Beekhof Subject: Re: [Pacemaker] (no subject) To: "The Pacemaker cluster resource manager" Date: Wednesday, December 7, 2011

Re: [Pacemaker] Postgresql streaming replication failover - RA needed

2011-12-08 Thread Attila Megyeri
Hi Takatoshi, One strange thing I noticed and could probably be improved. When there is data inconsistency, I have the following node properties: * Node psql2: + default_ping_set : 100 + master-postgresql:1 : -INFINITY + pgsql-data-status