[Linux-HA] WARN: unpack_rsc_op:

2007-07-17 Thread Taldevkar, Chetan
Hi all, I am using heartbeat 2.0.8 (two node cluster) and single resource of class 'heartbeat'. My script is configured for start, status and stop calls. When I start cluster lunixha is able to invoke start call on both the nodes. On first node start fails as script calls echo "stopped" fo

[Linux-HA] statement about active-backup mode

2007-07-17 Thread Jan Kalcic
Hi All, in the document http://www.linux-ha.org/IpFailoverChannelBonding a possible concern for HA is indicated as a disagreement with "standby" mode which I can't totally see. There is mentioned that the active-backup mode defines the transmit policy only whereas receiving is always in round rob

Re: [Linux-HA] /var/lib/heartbeat permission all belong to root

2007-07-17 Thread Lars Marowsky-Bree
On 2007-07-17T15:31:05, Brian Reichert <[EMAIL PROTECTED]> wrote: > > Yes, the permissions on this will keep some Heartbeat processes from > > core dumping. Other permission errors can cause other problems. > Couldn't one use setrlimit() to manage RLIMIT_CORE directly? (If > supressing code dump

Re: [Linux-HA] /var/lib/heartbeat permission all belong to root

2007-07-17 Thread Ciro Iriarte
2007/7/16, Lars Marowsky-Bree <[EMAIL PROTECTED]>: On 2007-07-16T16:46:16, Xn Nooby <[EMAIL PROTECTED]> wrote: > Previously I moved my entire /var partition to another drive, copying it as > root (I moved it to a bigger physical partition). That was a mistake, it appears as if you didn't use th

[Linux-HA] Question on lsb status scripts on SuSE

2007-07-17 Thread Andy Kipp
I saw another thread on lsb and status, so I thought I would ask a question that has been bothering me: In my cib, I have a resource configured like so: However when the service fails, I get this error in the syslog and it fails to restart the service. Jul 17 15:48:43 groupwise-2-

Re: [Linux-HA] /var/lib/heartbeat permission all belong to root

2007-07-17 Thread Brian Reichert
On Tue, Jul 17, 2007 at 01:26:19PM -0600, Alan Robertson wrote: > > Should I fix this? I thought maybe its not a problem since Heartbeat seems > > to be otherwise running. > > Yes, the permissions on this will keep some Heartbeat processes from > core dumping. Other permission errors can cause ot

[Linux-HA] Re: [Linux-ha-dev] More thoughts about 2.1.1 - SCHEDULE CHANGE - 23 July, 2007

2007-07-17 Thread Alan Robertson
Alan Robertson wrote: > Hi, > > I have a few things to say here, and a change of mind, which I'll > explain in more detail. And so on... I'm working on it. Tests still look pretty good. Only one problem so far. -- Alan Robertson <[EMAIL PROTECTED]> "Openness is the foundation and preser

Re: [Linux-HA] /var/lib/heartbeat permission all belong to root

2007-07-17 Thread Alan Robertson
Xn Nooby wrote: > I noticed in my /var/log/messages that heartbeat was having a problem > changing in to the cores/nobody directory: > >heartbeat[4895]: ERROR: Cannot chdir to > [/var/lib/heartbeat/cores/nobody]: Permission denied > > > Previously I moved my entire /var partition to another

Re: [Linux-HA] lsb and status

2007-07-17 Thread Lars Marowsky-Bree
On 2007-07-17T10:26:39, Andrés Ghigliazza <[EMAIL PROTECTED]> wrote: > I guess that they can, as in > http://www.linux-ha.org/ClusterInformationBase/Actions there is an > example with resources of class lsb, and monitor operation. Is that > right?. Although the operation is called "monitor", the s

[Linux-HA] Bug on Delay script in HB 2.1.0?

2007-07-17 Thread Sandro Bordacchini
Hello everyone. I'm running a heartbeat v1 cluster with 2.1.0. I'm using a OpenSuse 10.2 (with heartbeat package updated from ha repository). I noticed that Delay script doesn't work correctly. I suppose there is an error on last line: ra_execocf $op;; probably has to be replaced with ra_exe

[Linux-HA] lsb and status

2007-07-17 Thread Andrés Ghigliazza
Hi there, I will soon use linux-ha project in differents clusters, and I would like to know if, LSB scripts can be used with CRM, to monitor the resources, as OCF scripts can. I guess that they can, as in http://www.linux-ha.org/ClusterInformationBase/Actions there is an example with resources o

Re: [Linux-HA] 2.1.1 change in behaviour

2007-07-17 Thread Peter Kruse
Hi, Lars Marowsky-Bree wrote: On 2007-07-17T12:25:48, Peter Kruse <[EMAIL PROTECTED]> wrote: Good question. I assume Andrew has a good explanation when you have the testcase (pe inputs). ;-) done, bug #1648 But, this is safe by definition. true. What problem is this causing for you? I

Re: [Linux-HA] 2.1.1 change in behaviour

2007-07-17 Thread Lars Marowsky-Bree
On 2007-07-17T12:25:48, Peter Kruse <[EMAIL PROTECTED]> wrote: > Hello, > > while testing version 2.1.1. I found a change in behaviour > when a resource in a group failed. If there are resources > a b c d e f in the group G, and e failed this happens: > > 2.0.8: stop f, stop e, start e, start

Re: [Linux-HA] 2.1.1 change in behaviour

2007-07-17 Thread Andrew Beekhof
On 7/17/07, Peter Kruse <[EMAIL PROTECTED]> wrote: Hello, while testing version 2.1.1. I found a change in behaviour when a resource in a group failed. If there are resources a b c d e f in the group G, and e failed this happens: 2.0.8: stop f, stop e, start e, start f 2.1.1: stop f, stop e,

[Linux-HA] 2.1.1 change in behaviour

2007-07-17 Thread Peter Kruse
Hello, while testing version 2.1.1. I found a change in behaviour when a resource in a group failed. If there are resources a b c d e f in the group G, and e failed this happens: 2.0.8: stop f, stop e, start e, start f 2.1.1: stop f, stop e, start a, start b, ..., start f What is the reasoni

[Linux-HA] HA & clustering documentation

2007-07-17 Thread Petteri Hakkarainen
Hi list, What is the best available documentation on setting up and configuring HA and clustering? At the moment I'm interested in (more or less) basic configuration steps, e.g. what needs to be done (and with what tool) to cluster a certain service (DHCP, DNS, web server etc.)? BR, Pete

Re: [Linux-HA] Re: [Linux-ha-dev] Release testing with CTS

2007-07-17 Thread Lars Marowsky-Bree
On 2007-07-16T18:12:28, David Lang <[EMAIL PROTECTED]> wrote: > what I'd suggest is make a _lot_ of virtual machines on one physical > machine and run them at different nice levels. Yeah. > does anyone have some nice small HB configured virtual machines that can be > used rather then having to

Re: [Linux-HA] Re: [Linux-ha-dev] Release testing with CTS

2007-07-17 Thread Lars Marowsky-Bree
On 2007-07-16T16:18:00, David Lang <[EMAIL PROTECTED]> wrote: > I don't think anyone is implying that you are deliberately releaseing buggy > releases, but that's not the same as saying that the releases have all gone > through the same testing. The fact is that those releases have gone through

Re: [Linux-HA] Confusion about MailTo RA and monitoring

2007-07-17 Thread Peter Kruse
Hi, David Lang wrote: there is a second issue with MailTo part of the OCF specs are that it is considered 'safe' to call start or stop multiple times on a RA, with MailTo this will generate multiple e-mails. this isn't a fatal problem, but it is an annoyance (I've had the shutting down on

Re: [Linux-HA] Probably metadata missing

2007-07-17 Thread Andrew Beekhof
On 7/16/07, matilda matilda <[EMAIL PROTECTED]> wrote: Hi all, I'm using HAv2 2.1.0. When I'm doing a 'crm_verify -VV -L' I get the following output: == crm_verify[8659]: 2007/07/16_15:40:14 notice: main: Required feature set: 1.1 c