Re: [Pacemaker] Resource capacity limit

2009-11-12 Thread Yan Gao
Hi Andrew, Lars, Andrew Beekhof wrote: > I'd like to see the while-block from native_color() be a function that > is called from native_assign_node(). It seems to be too late to filter out the nodes without enough capacity from native_assign_node(). I wrote a have_enough_capacity() function which

Re: [Pacemaker] Resource capacity limit

2009-11-12 Thread Steven Dake
On Thu, 2009-11-12 at 14:53 +0100, Andrew Beekhof wrote: > On Wed, Nov 11, 2009 at 1:36 PM, Lars Marowsky-Bree wrote: > > On 2009-11-05T14:45:36, Andrew Beekhof wrote: > > > >> Lastly, I would really like to defer this for 1.2 > >> I know I've bent the rules a bit for 1.0 in the past, but its rea

[Pacemaker] Multiple thread after rebooting server: the node doesn't go online

2009-11-12 Thread Giovanni Di Milia
I set up a cluster of two servers CentOS 5.4 x86_64 with pacemaker 1.06 and corosync 1.1.2 I only installed the x86_64 packages (yum install pacemaker try to install also the 32 bits one). I configured a shared cluster IP (it's a public ip) and a cluster website. Everything work fine if

Re: [Pacemaker] Resource capacity limit

2009-11-12 Thread Lars Marowsky-Bree
On 2009-11-12T14:53:24, Andrew Beekhof wrote: > At this point in time, I can't see us going back to the way heartbeat > releases were done. > If there was a single thing that I'd credit Pacemaker's current > reliability to, it would be our release strategy. Well, exactly, and that's what pacemak

Re: [Pacemaker] ocf:heartbeat:pingd receives wrong value in $__ACTION

2009-11-12 Thread hj lee
On Thu, Nov 12, 2009 at 3:58 AM, Lars Marowsky-Bree wrote: > On 2009-11-11T16:48:36, hj lee wrote: > > > Here rewriting again with correct name: > > > > I am using pacemaker-1.0.5 on CentOS 5.3 with openais cluster stack. I > set > > up ocf:hearbeat:pingd as a clone and enabled monitor operation

[Pacemaker] cLVM from SUSE(pacemaker) to CentOS 5.3( Redhat cluster) ?

2009-11-12 Thread Димитър Бойн
Hi! Lars, Are you aware of the right way to implement DLM/cLVM with Pacemaker but on CentOS? It seems clvmd provided by lvm2-cluster package on RedHat/CentOS expect cman configured cluster. + the Redhat packages miss the ocf:lvm2:clvmd script  Any advise on this is highly appreciated! Thanks

Re: [Pacemaker] resource stickyness

2009-11-12 Thread Bernd Schubert
On Thursday 12 November 2009, Andrew Beekhof wrote: > On Thu, Nov 12, 2009 at 11:54 AM, Bernd Schubert > > wrote: > > Hello, > > > > I try to prevent auto-migration back from mds2 to mds1, but somehow > > resource- stickiness doesn't seem to work. After a failure of mds1 and > > takeover on mds2,

Re: [Pacemaker] Remote Access not Working

2009-11-12 Thread Colin
On Thu, Nov 12, 2009 at 3:36 PM, Andrew Beekhof wrote: > I used it the other day. > > http://www.clusterlabs.org/doc/pacemaker-explained/ch-advanced-options.html#s-remote-connection > > Try setting CIB_encrypted to false. Thanks, that got me a step further... ...but there are still various issue

Re: [Pacemaker] Remote Access not Working

2009-11-12 Thread Andrew Beekhof
I used it the other day. http://www.clusterlabs.org/doc/pacemaker-explained/ch-advanced-options.html#s-remote-connection Try setting CIB_encrypted to false. On Tue, Nov 10, 2009 at 3:54 PM, Colin wrote: > Does anybody else successfully use this feature, or is it suffering > from bit-rot? > > Th

Re: [Pacemaker] resource stickyness

2009-11-12 Thread Andrew Beekhof
On Thu, Nov 12, 2009 at 11:54 AM, Bernd Schubert wrote: > Hello, > > I try to prevent auto-migration back from mds2 to mds1, but somehow resource- > stickiness doesn't seem to work. After a failure of mds1 and takeover on mds2, > it still migrates back to mds1 when this system comes back > > primi

Re: [Pacemaker] drbd83 and pacemaker/openais service stop

2009-11-12 Thread Andrew Beekhof
On Wed, Nov 11, 2009 at 3:17 PM, Testuser SST wrote: > Hi, > > I´m using 2 CentOS 5.4 node with openais/pacemaker. There is a drbd device in > master/slave modus running on it. When I stop the openais-service > (/etc/init.d/openais stop) on the master-node, the slave-node is not able to > beco

Re: [Pacemaker] Resource capacity limit

2009-11-12 Thread Andrew Beekhof
On Wed, Nov 11, 2009 at 1:36 PM, Lars Marowsky-Bree wrote: > On 2009-11-05T14:45:36, Andrew Beekhof wrote: > >> Lastly, I would really like to defer this for 1.2 >> I know I've bent the rules a bit for 1.0 in the past, but its really >> late in the game now. > > Personally, I think the Linux kern

Re: [Pacemaker] RFC: Compacting constraints

2009-11-12 Thread Lars Marowsky-Bree
On 2009-11-12T14:41:49, Andrew Beekhof wrote: > On Wed, Nov 11, 2009 at 3:12 PM, Lars Marowsky-Bree wrote: > > On 2009-11-11T14:46:02, Lars Marowsky-Bree wrote: > > > >> I went through the possible associations here, and I realize that > >> "conjoin" is a rare word in English. In theory though,

Re: [Pacemaker] Resource capacity limit

2009-11-12 Thread Andrew Beekhof
On Wed, Nov 11, 2009 at 1:42 PM, Lars Marowsky-Bree wrote: > On 2009-11-06T12:45:17, Andrew Beekhof wrote: > >> And instead of a limit-utilization option, we'd have >> placement-strategy=(default|utilization|minimal) >> >> Default ::= what we do now >> Utilization ::= what you've implemented > >

Re: [Pacemaker] RFC: Compacting constraints

2009-11-12 Thread Andrew Beekhof
On Wed, Nov 11, 2009 at 3:12 PM, Lars Marowsky-Bree wrote: > On 2009-11-11T14:46:02, Lars Marowsky-Bree wrote: > >> I went through the possible associations here, and I realize that >> "conjoin" is a rare word in English. In theory though, "conjoin" more >> readily implies an order/merge thing go

Re: [Pacemaker] STONITH request failed

2009-11-12 Thread Lars Marowsky-Bree
On 2009-11-12T10:07:00, Maros Timko wrote: > # rpm -qa|egrep 'heart|pacem' > libheartbeat2-2.99.2-6.1 > heartbeat-common-2.99.2-6.1 > heartbeat-resources-2.99.2-6.1 > heartbeat-2.99.2-6.1 > pacemaker-1.0.2-11.1 > libpacemaker3-1.0.2-11.1 Please update to something resembling current versions and

Re: [Pacemaker] ocf:heartbeat:pingd receives wrong value in $__ACTION

2009-11-12 Thread Lars Marowsky-Bree
On 2009-11-11T16:48:36, hj lee wrote: > Here rewriting again with correct name: > > I am using pacemaker-1.0.5 on CentOS 5.3 with openais cluster stack. I set > up ocf:hearbeat:pingd as a clone and enabled monitor operation. The monitor > operation returns $OCF_ERR_UNIMPLEMENTED. I printed $__OC

[Pacemaker] resource stickyness

2009-11-12 Thread Bernd Schubert
Hello, I try to prevent auto-migration back from mds2 to mds1, but somehow resource- stickiness doesn't seem to work. After a failure of mds1 and takeover on mds2, it still migrates back to mds1 when this system comes back primitive MDT_HC3WORK ocf:ddn:lustre_server \ params dev

[Pacemaker] STONITH request failed

2009-11-12 Thread Maros Timko
Hi all, has anybody experienced following. We were simulating the failure of a resource stop with STONITH enabled. The node that failed to stop the resource requested to stonith itself. However, the request was not processed by peer node. It has not logged anything into log file at that time. Inte