Re: [Pacemaker] crm resource move doesn't move the resource

2010-10-07 Thread Pavlos Parissis
On 8 October 2010 08:29, Andrew Beekhof wrote: > On Thu, Oct 7, 2010 at 9:58 PM, Pavlos Parissis > wrote: >> >> >> On 7 October 2010 09:01, Andrew Beekhof wrote: >>> >>> On Sat, Oct 2, 2010 at 6:31 PM, Pavlos Parissis >>> wrote: >>> > Hi, >>> > >>> > I am having again the same issue, in a diffe

Re: [Pacemaker] pacemaker version

2010-10-07 Thread Pavlos Parissis
On 8 October 2010 07:47, Andrew Beekhof wrote: > On Thu, Oct 7, 2010 at 10:10 PM, Pavlos Parissis > wrote: >> On 7 October 2010 08:33, Andrew Beekhof wrote: >>> >>> On Wed, Oct 6, 2010 at 5:04 PM, Gianluca Cecchi >>> wrote: >>> > On Wed, Oct 6, 2010 at 4:25 PM, Shravan Mishra >>> > wrote: >>>

Re: [Pacemaker] crm resource move doesn't move the resource

2010-10-07 Thread Andrew Beekhof
On Thu, Oct 7, 2010 at 9:58 PM, Pavlos Parissis wrote: > > > On 7 October 2010 09:01, Andrew Beekhof wrote: >> >> On Sat, Oct 2, 2010 at 6:31 PM, Pavlos Parissis >> wrote: >> > Hi, >> > >> > I am having again the same issue, in a different set of 3 nodes. When I >> > try >> > to failover manuall

Re: [Pacemaker] pacemaker version

2010-10-07 Thread Andrew Beekhof
On Thu, Oct 7, 2010 at 10:10 PM, Pavlos Parissis wrote: > On 7 October 2010 08:33, Andrew Beekhof wrote: >> >> On Wed, Oct 6, 2010 at 5:04 PM, Gianluca Cecchi >> wrote: >> > On Wed, Oct 6, 2010 at 4:25 PM, Shravan Mishra >> > wrote: >> >> That is what I heard too, that's the reason for this qu

Re: [Pacemaker] crm resource move doesn't move the resource

2010-10-07 Thread Pavlos Parissis
On 8 October 2010 04:26, jiaju liu wrote: > Message: 2 > Date: Thu, 7 Oct 2010 21:58:29 +0200 > From: Pavlos Parissis > http://cn.mc157.mail.yahoo.com/mc/compose?to=pavlos.paris...@gmail.com> > > > To: The Pacemaker cluster resource manager > > http://cn.mc157.mail.yahoo.com/mc/compose?to=p

Re: [Pacemaker] crm resource move doesn't move the resource

2010-10-07 Thread jiaju liu
lcount 1.1 or 1.2 branch? -- next part -- An HTML attachment was scrubbed... URL: <http://oss.clusterlabs.org/pipermail/pacemaker/attachments/20101007/ce6d0b4e/attachment-0001.htm> ___ Pacemaker mailing list: Pa

Re: [Pacemaker] [Problem]The monitor that start-delay is long does not stop.

2010-10-07 Thread renayama19661014
Hi Andrew, Thank you for comment. > Funnily enough I was just looking at that message and saw that the > code relevant to this one looked wrong too. > > I believe this should fix the issue: >http://hg.clusterlabs.org/pacemaker/1.1/rev/e06810256413 > > > > > I registered log and more with Bu

[Pacemaker] stonith pacemaker problem

2010-10-07 Thread Shravan Mishra
Hi, Description of my environment: corosync=1.2.8 pacemaker=1.1.3 Linux= 2.6.29.6-0.6.smp.gcc4.1.x86_64 #1 SMP We are having a problem with our pacemaker which is continuously canceling the monitoring operation of our stonith devices. We ran: stonith -d -t external/safe/ipmi hostname=

Re: [Pacemaker] pacemaker version

2010-10-07 Thread Pavlos Parissis
On 7 October 2010 08:33, Andrew Beekhof wrote: > > On Wed, Oct 6, 2010 at 5:04 PM, Gianluca Cecchi > wrote: > > On Wed, Oct 6, 2010 at 4:25 PM, Shravan Mishra > > wrote: > >> That is what I heard too, that's the reason for this question. > >> > > > > On June, inside a complex thread regarding "

Re: [Pacemaker] crm resource move doesn't move the resource

2010-10-07 Thread Pavlos Parissis
On 7 October 2010 09:01, Andrew Beekhof wrote: > On Sat, Oct 2, 2010 at 6:31 PM, Pavlos Parissis > wrote: > > Hi, > > > > I am having again the same issue, in a different set of 3 nodes. When I > try > > to failover manually the resource group on the standby node, the ms-drbd > > resource is not

Re: [Pacemaker] how to test network access and fail over accordingly?

2010-10-07 Thread Craig Hurley
Yesterday, the last few emails between Vadym and I were inadvertently not posted to this list. Here are those posts for anyone having similar issues. Regards, Craig. On 7 October 2010 15:20, Vadym Chepkov wrote: > no, default is 0 - it is not taken into consideration at all. > Resource stays in

Re: [Pacemaker] Missing lrm_opstatus

2010-10-07 Thread Andrew Beekhof
On Thu, Oct 7, 2010 at 6:06 PM, Ron Kerry wrote: > On 10/7/2010 8:00 AM, Andrew Beekhof wrote: >> >> On Thu, Oct 7, 2010 at 11:13 AM, Dejan Muhamedagic >> wrote: >>  > On Thu, Oct 07, 2010 at 09:49:05AM +0200, Andrew Beekhof wrote: >>  >> On Tue, Oct 5, 2010 at 1:50 PM, Dejan Muhamedagic >> wrot

Re: [Pacemaker] Missing lrm_opstatus

2010-10-07 Thread Ron Kerry
On 10/7/2010 8:00 AM, Andrew Beekhof wrote: On Thu, Oct 7, 2010 at 11:13 AM, Dejan Muhamedagic wrote: > On Thu, Oct 07, 2010 at 09:49:05AM +0200, Andrew Beekhof wrote: >> On Tue, Oct 5, 2010 at 1:50 PM, Dejan Muhamedagic wrote: >> > Hi, >> > >> > On Tue, Oct 05, 2010 at 11:18:37AM +0200,

Re: [Pacemaker] stonith resource issue

2010-10-07 Thread Dejan Muhamedagic
Hi, On Wed, Oct 06, 2010 at 01:32:06PM -0400, Shravan Mishra wrote: > Please fine hb_report. hb_report couldn't find the logs, probably because you have both syslog and to file logging. Anyway, it could be that stuff such as external/safe/ipmi cannot work, i.e. that you can't create subdirectorie

Re: [Pacemaker] starting a xen-domU depending on available hardware-resources using SysInfo-RA

2010-10-07 Thread Dejan Muhamedagic
Hi, On Thu, Sep 30, 2010 at 08:52:16AM -0400, Vadym Chepkov wrote: > > On Sep 30, 2010, at 2:35 AM, Sascha Reimann wrote: > > > Hi Dejan, > > > > it's working fine with the amount of free ram as the score and a bigger > > default-resource-stickiness: > > > > primitive v01 ocf:heartbeat:Xen \

Re: [Pacemaker] Monitor ops do not get cancelled

2010-10-07 Thread Dejan Muhamedagic
On Thu, Sep 30, 2010 at 08:09:38AM +0200, Andrew Beekhof wrote: > On Tue, Sep 28, 2010 at 2:55 PM, Phil Armstrong wrote: > >> From Andrew Beekof > >> 1.1.3 came out the other day. > >> which distro are you using? > > > > I'm not sure if this answers your question: > > > > novell/sles/updates/SLE11

Re: [Pacemaker] Problem with log level

2010-10-07 Thread Eberhard Kümmerle
I have solved the problem. There was an error in corosync.conf in a section before the logging section so that the logging section was'nt interpreted correctly. Thank you! -

Re: [Pacemaker] About behavior in "Action Lost".

2010-10-07 Thread Andrew Beekhof
On Thu, Oct 7, 2010 at 11:48 AM, Keisuke MORI wrote: > Andrew, > > 2010/9/23 Andrew Beekhof : >> Pushed as: >>   http://hg.clusterlabs.org/pacemaker/1.1/rev/8433015faf18 >> >> Not sure about applying to 1.0 though, its a dramatic change in behavior. > > I would like to backport this to 1.0. > Woul

Re: [Pacemaker] About behavior in "Action Lost".

2010-10-07 Thread Keisuke MORI
Andrew, 2010/9/23 Andrew Beekhof : > Pushed as: >   http://hg.clusterlabs.org/pacemaker/1.1/rev/8433015faf18 > > Not sure about applying to 1.0 though, its a dramatic change in behavior. I would like to backport this to 1.0. Would you agree with this? Without this the failed node was not fenced

Re: [Pacemaker] Backports from 1.1 to 1.0

2010-10-07 Thread Andrew Beekhof
On Thu, Oct 7, 2010 at 10:55 AM, Raoul Bhatia [IPAX] wrote: > hi all, > > do you have any further information, eta, repository, etc. in > regard of the backported patches from 1.1 to 1.0? I saw a bunch go into stable-1.0 the other day, so I think they're done. I just need to find some time to do

Re: [Pacemaker] Can somebody please explain pengine's urge to move all resources?

2010-10-07 Thread Andrew Beekhof
On Thu, Oct 7, 2010 at 11:02 AM, Raoul Bhatia [IPAX] wrote: > On 10/06/2010 11:16 AM, Keisuke MORI wrote: >> This should have been fix with this: >> http://hg.clusterlabs.org/pacemaker/stable-1.0/rev/5fe02f48c47b >> >> The patch has been already backported to the 1.0 repository and will >> be incl

Re: [Pacemaker] Missing lrm_opstatus

2010-10-07 Thread Andrew Beekhof
On Thu, Oct 7, 2010 at 11:13 AM, Dejan Muhamedagic wrote: > On Thu, Oct 07, 2010 at 09:49:05AM +0200, Andrew Beekhof wrote: >> On Tue, Oct 5, 2010 at 1:50 PM, Dejan Muhamedagic >> wrote: >> > Hi, >> > >> > On Tue, Oct 05, 2010 at 11:18:37AM +0200, Andrew Beekhof wrote: >> >> Dejan: looks like so

Re: [Pacemaker] Missing lrm_opstatus

2010-10-07 Thread Dejan Muhamedagic
On Thu, Oct 07, 2010 at 09:49:05AM +0200, Andrew Beekhof wrote: > On Tue, Oct 5, 2010 at 1:50 PM, Dejan Muhamedagic wrote: > > Hi, > > > > On Tue, Oct 05, 2010 at 11:18:37AM +0200, Andrew Beekhof wrote: > >> Dejan: looks like something in the lrm library. > >> Any idea why the message doesn't cont

Re: [Pacemaker] Can somebody please explain pengine's urge to move all resources?

2010-10-07 Thread Raoul Bhatia [IPAX]
On 10/06/2010 11:16 AM, Keisuke MORI wrote: > This should have been fix with this: > http://hg.clusterlabs.org/pacemaker/stable-1.0/rev/5fe02f48c47b > > The patch has been already backported to the 1.0 repository and will > be included in 1.0.10. > Will you test with the tip of 1.0 repository if y

[Pacemaker] Backports from 1.1 to 1.0

2010-10-07 Thread Raoul Bhatia [IPAX]
hi all, do you have any further information, eta, repository, etc. in regard of the backported patches from 1.1 to 1.0? i would be very interested in tracking them :) thanks, raoul -- DI (FH) Raoul Bhatia M.Sc. email.

Re: [Pacemaker] "Election Timeout" and node became the "Pending" state.

2010-10-07 Thread Andrew Beekhof
On Tue, Oct 5, 2010 at 6:44 AM, wrote: > Hi, > > We tested complicated node trouble. > > An error of "Election Timeout" occurred then. > >  * Pacemaker:pacemaker-1.0.9.1 >  * heartbeat-3.0.3-2.3.el5 >  * cluster-glue:cluster-glue-1.0.6-1.6.el5 >  * resource-agents-1.0.3-1.0.dev.b7a3b1973ba7 > > W

Re: [Pacemaker] ActiveMQ on pacemaker

2010-10-07 Thread Andrew Beekhof
On Thu, Oct 7, 2010 at 12:01 AM, Ivo Rodrigues wrote: > Hello guys, > > I'm trying to make activeMQ working on pacemaker (master/slave) with DRBD > for the kahaDB. This way, if a node goes down the second will step up. > > I created a symbolic link for activemq start script on etc/init.d/ and > de

Re: [Pacemaker] [Problem]The monitor that start-delay is long does not stop.

2010-10-07 Thread Andrew Beekhof
On Thu, Oct 7, 2010 at 8:39 AM, wrote: > Hi, > > I operated the next to confirm the contribution of the mailing list. > >  * http://www.gossamer-threads.com/lists/linuxha/pacemaker/66939 > > > Step1) I prepare cib.xml having monitor which set start-delay than five > minutes.. > Step2) I start tw

Re: [Pacemaker] Missing lrm_opstatus

2010-10-07 Thread Andrew Beekhof
On Tue, Oct 5, 2010 at 1:50 PM, Dejan Muhamedagic wrote: > Hi, > > On Tue, Oct 05, 2010 at 11:18:37AM +0200, Andrew Beekhof wrote: >> Dejan: looks like something in the lrm library. >> Any idea why the message doesn't contain lrm_opstatus? > > Becase this monitor operation never run. Which seems t

Re: [Pacemaker] syslog-ng as resource / how to make sure it gets restarted

2010-10-07 Thread Andrew Beekhof
On Fri, Oct 1, 2010 at 9:41 AM, Koch, Sebastian wrote: > Hi Andrew, > > > > thanks for your answer. I still need syslog-ng to restart on all nodes after > the ClusterIp moved. I tried it like this: > > > > > > > > Resource: > > primitive res_SyslogNG lsb:syslog-ng \ > >     op monitor interval

Re: [Pacemaker] Problem with log level

2010-10-07 Thread Andrew Beekhof
Could you look for "CRM Hg Version:" in the logs please? Perhaps the logging macro was broken in that version. Strange. On Tue, Oct 5, 2010 at 1:51 PM, Eberhard Kuemmerle wrote: > Hi, > > I use pacemaker 1.1.2.1 + corosync 1.2.1 (on openSuse 11.3). > > Logging is configured in corosync.conf as f

Re: [Pacemaker] Patch for slow remote connections

2010-10-07 Thread Andrew Beekhof
Applied: http://hg.clusterlabs.org/pacemaker/1.1/rev/675c88f7546a Thanks to you both. 2010/10/4 Ante Karamatić : > Hi > > This patch solves slow responses from remote nodes. Author is Al Stone > (in CC); he's not on the list as far as I know. > > I've tested the patch and it does do the trick. Wit

Re: [Pacemaker] crm resource move doesn't move the resource

2010-10-07 Thread Andrew Beekhof
On Sat, Oct 2, 2010 at 6:31 PM, Pavlos Parissis wrote: > Hi, > > I am having again the same issue, in a different set of 3 nodes. When I try > to failover manually the resource group on the standby node, the ms-drbd > resource is not moved as well and as a result the resource group is not > fully