Re: [Pacemaker] pacemaker/dlm problems

2011-09-26 Thread Andrew Beekhof
On Mon, Sep 26, 2011 at 6:41 PM, Vladislav Bogdanov wrote: > 26.09.2011 11:16, Andrew Beekhof wrote: > [snip] >>> Regardless, for 1.1.6 the dlm would be better off making a call like:           rc = st->cmds->fence(st, st_opts, target, "reboot", 120); from fencing/adm

Re: [Pacemaker] Query regarding component failover

2011-09-26 Thread Andrew Beekhof
On Tue, Sep 27, 2011 at 1:25 PM, Bhagwan Sahai wrote: > Hello, >       There are two copies of service 1 running in master slave mode > in the cluster. > > > Regards > Bhagwan Sahai > > On Mon, Sep 26, 2011 at 9:12 AM, Andrew Beekhof wrote: >> On Fri, Sep 9, 2011 at 3:08 PM, Bhagwan Sahai >> wr

Re: [Pacemaker] Query regarding component failover

2011-09-26 Thread Bhagwan Sahai
Hello, There are two copies of service 1 running in master slave mode in the cluster. Regards Bhagwan Sahai On Mon, Sep 26, 2011 at 9:12 AM, Andrew Beekhof wrote: > On Fri, Sep 9, 2011 at 3:08 PM, Bhagwan Sahai wrote: >> Hi, >>     I have a HA requirement wherein I need to have followin

Re: [Pacemaker] resource moving unnecessarily due to ping race condition

2011-09-26 Thread Andrew Beekhof
On Mon, Sep 26, 2011 at 10:57 PM, Brad Johnson wrote: > I agree that the patch assumes the use of "pingd" for the attribute name, > and there may be a better way of coding that. However, I don't see how > setting dampen=0 fixes our problem. The problem occurs when a ping node > becomes inaccessib

Re: [Pacemaker] Pacemaker 1.0 and compiler optimization

2011-09-26 Thread Andrew Beekhof
On Tue, Sep 27, 2011 at 1:47 AM, Rainer Weikusat wrote: > Is there a specific reason why compiler optimization is disabled (line > 1323 in configure.ac, '-ggdb3 -O0') when building pacemaker? Something we inherited from heartbeat I guess - possibly so that stack traces were sane. Not sure we shou

[Pacemaker] Pacemaker 1.0 and compiler optimization

2011-09-26 Thread Rainer Weikusat
Is there a specific reason why compiler optimization is disabled (line 1323 in configure.ac, '-ggdb3 -O0') when building pacemaker? ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project H

Re: [Pacemaker] resource moving unnecessarily due to ping race condition

2011-09-26 Thread Brad Johnson
I agree that the patch assumes the use of "pingd" for the attribute name, and there may be a better way of coding that. However, I don't see how setting dampen=0 fixes our problem. The problem occurs when a ping node becomes inaccessible to all nodes in the cluster (it is rebooted for example).

[Pacemaker] Master won't get promoted

2011-09-26 Thread Charles Richard
Hi, I'm making some headway finally with my pacemaker install but now that crm_mon doesn't return errors any more and crm_verify is clear, I'm having a problem where my master won't get promoted. Not sure what to do with this one, any suggestions? Here's the log snippet and config files: Sep 2

Re: [Pacemaker] Call cib_modify failed (-22): The object/attribute does not exist

2011-09-26 Thread Brian J. Murrell
On 11-09-25 09:21 PM, Andrew Beekhof wrote: > > As the error says, the resource R_10.10.10.101 doesn't exist yet. > Put it in a tag or use -C instead of -U Thanks much. I already replied to Tim, but the summary is that the manpage is incorrect in two places. One is specifying the attributes ta

Re: [Pacemaker] Call cib_modify failed (-22): The object/attribute does not exist

2011-09-26 Thread Brian J. Murrell
On 11-09-26 03:44 AM, Tim Serong wrote: > > Because: > > 1) You need to run "cibadmin -o resources -C -x test.xml" to create the >resource (-C creates, -U updates an existing resource). That's what I thought/wondered but the EXAMPLES section in the manpage is quite clear that it's asking one

[Pacemaker] FW: [Linux-HA] Install problems with ha resource

2011-09-26 Thread Andreas Mock
Hi all, can anybody help me with that? Who has the knowledge to correct the variable paths in the scripts? Best regards Andreas Mock > -Original Message- > From: linux-ha-boun...@lists.linux-ha.org > [mailto:linux-ha-boun...@lists.linux-ha.org] On Behalf Of Andreas Mock > Sent: Sunday

Re: [Pacemaker] When a resource starts all at once in environment using utilization, score may not work

2011-09-26 Thread Gao,Yan
On 09/26/11 11:51, Andrew Beekhof wrote: > 2011/8/18 Yuusuke IIDA : >> Hi, Yan >> Hi, Andrew >> >> I started service by the following setting all at once. >> However, the placement of the resource was not carried out as expected. >> >> * By a function of utilization, I confine the resource that can

Re: [Pacemaker] pacemaker/dlm problems

2011-09-26 Thread Vladislav Bogdanov
26.09.2011 11:16, Andrew Beekhof wrote: [snip] >> >>> >>> Regardless, for 1.1.6 the dlm would be better off making a call like: >>> >>> rc = st->cmds->fence(st, st_opts, target, "reboot", 120); >>> >>> from fencing/admin.c >>> >>> That would talk directly to the fencing daemon, bypassing

Re: [Pacemaker] [Problem]Time-out(action lost) of completed monitor occurs.

2011-09-26 Thread renayama19661014
Hi Andrew, Thank you for comment. > Which still appears to be down :-( > Do you have the tarball still? I may not be the completely same as the contents which I attached for Bugzilla. I send log and pe-file again. * 1655.tar.gz * https://skydrive.live.com/?cid=3a14d57622c66876&id=3A14D57622C

Re: [Pacemaker] pacemaker/dlm problems

2011-09-26 Thread Andrew Beekhof
On Mon, Sep 26, 2011 at 5:38 PM, Vladislav Bogdanov wrote: > Hi Andrew, > > 26.09.2011 10:10, Andrew Beekhof wrote: >> On Tue, Sep 6, 2011 at 5:27 PM, Vladislav Bogdanov >> wrote: >>> Hi Andrew, hi all, >>> >>> I'm further investigating dlm lockspace hangs I described in >>> https://www.redhat.c

Re: [Pacemaker] Call cib_modify failed (-22): The object/attribute does not exist

2011-09-26 Thread Tim Serong
On 25/09/11 01:16, Brian J. Murrell wrote: Using pacemaker-1.0.10-1.4.el5 I am trying to add the "R_10.10.10.101" IPaddr2 example resource: from the cibadmin manpage under EXAMPLES and getting: # cibadmin -o resources -U -x test.xml Call cib_modify failed (-22): The o

Re: [Pacemaker] pacemaker/dlm problems

2011-09-26 Thread Vladislav Bogdanov
Hi Andrew, 26.09.2011 10:10, Andrew Beekhof wrote: > On Tue, Sep 6, 2011 at 5:27 PM, Vladislav Bogdanov > wrote: >> Hi Andrew, hi all, >> >> I'm further investigating dlm lockspace hangs I described in >> https://www.redhat.com/archives/cluster-devel/2011-August/msg00133.html >> and in the threa

Re: [Pacemaker] Pacemaker on CentOS 5.6 & CentOS 6.0

2011-09-26 Thread Andrew Beekhof
Based on the versions, this should work. Could you provide logs from the 4 nodes? On Wed, Sep 7, 2011 at 10:40 PM, sdr(friedrich reichhart) wrote: > Hi Andrew! > > CentOs 5 > > > [root@inet-storage01 ~]# yum list pacemaker corosync cluster-glue > Loaded plugins: fastestmirror > Loading m

Re: [Pacemaker] [Problem]Time-out(action lost) of completed monitor occurs.

2011-09-26 Thread Andrew Beekhof
On Tue, Sep 6, 2011 at 12:53 PM, wrote: > Hi All, > > We came across a mysterious phenomenon on a test of the drbd environment. > > It is the following procedure. > > Step 1) Start two nodes. > > Step 2) Cause the hang of the kernel in an active node. > > Step 3) In a standby node, the cancellati

Re: [Pacemaker] When a disk becomes to the full, memory leak happens in pengine of the DC.

2011-09-26 Thread Andrew Beekhof
On Tue, Sep 20, 2011 at 12:31 PM, Yuusuke IIDA wrote: > Hi, Andrew > > When the disk utilization of the DC node became 100%, I found the phenomenon > that memory was used in large quantities by pengine. > > When pengine fails in the output of the pe-input file, this memory consumption > seems to h

Re: [Pacemaker] pacemaker/dlm problems

2011-09-26 Thread Andrew Beekhof
On Tue, Sep 6, 2011 at 5:27 PM, Vladislav Bogdanov wrote: > Hi Andrew, hi all, > > I'm further investigating dlm lockspace hangs I described in > https://www.redhat.com/archives/cluster-devel/2011-August/msg00133.html > and in the thread starting from > https://lists.linux-foundation.org/pipermail