Re: [Linux-HA] heartbeat-2.1.2-29.1 for Debian Etch amd64

2008-03-21 Thread Raoul Bhatia [IPAX]
hi, why do you need a 2.1.2 build, when this release is known to be buggy and linux-ha 2.1.3 is out? please refer to: * http://www.gossamer-threads.com/lists/linuxha/dev/44755#44755 * http://www.linux-ha.net/DownloadSoftware and * http://download.opensuse.org/repositories/server:/ha-clustering/

Re: [Linux-HA] Enhanced version of showscores and a major update on the score calculation documentation

2008-03-12 Thread Raoul Bhatia [IPAX]
Dominik Klein wrote: http://www.linux-ha.org/ScoreCalculation If I'm not running into some nasty cache problem, then this page has not been updated yet. Use http://wiki.linux-ha.org/ScoreCalculation instead until it gets updated. added two links: http://wiki.linux-ha.org/ScoreCalculation

Re: [Linux-HA] Master/Slave problems

2008-02-26 Thread Raoul Bhatia [IPAX]
Adrian Chapela wrote: [snip] timeout="19s" prereq="nothing"/> I think that this line: name="monitor" interval="20s" timeout="19s" prereq="nothing"/> is the line to config monitoring operations and the time to

Re: AW: [Linux-HA] Some problems with monitoring

2008-02-22 Thread Raoul Bhatia [IPAX]
hi, actually, i never tested in thoroughly. as i enabled it, the cluster (i then had only one node up) reconfigured something and restarted some resources. but then, everything was ok. i noticed some "new" drbd log messages coming every couple of seconds, which i guess are because of the tiny mo

Re: [Linux-HA] Some problems with monitoring

2008-02-22 Thread Raoul Bhatia [IPAX]
Adrian Chapela wrote: It's cancelled because you have a monitor configured only for the demoted/slave state; you need one with role="master" (and a slightly different interval) as well. Can you explain a bit more ?? I dont understand this very well. I will re-read the docs some more... but I

Re: [Linux-HA] OCFS2 on HB 2.1.3 v2

2008-02-19 Thread Raoul Bhatia [IPAX]
Michael Brennen wrote: Can someone give pointers to integrating ocfs2 with heartbeat? The idea is to run ocfs2 as the cluster file system on the real servers running on an iscsi failover backend cluster. Apparently some userspace patches are required to ocfs2 to let hb manage it, but I think

Re: [Linux-HA] MySql OCF

2008-01-29 Thread Raoul Bhatia [IPAX]
hi, On Tue, 2008-01-29 at 11:21 +, prashant wrote: > I want to use failover with mysql server on multi configuration. i do not know for sure but i don't think you can use a mysql_multi configuration with the current OCF file. please look at $OCF_ROOT/resource.d/heartbeat/ (e.g. /usr/lib/ocf/

Re: [Linux-HA] Hint for usage of "updated" OCF resource agent ldirectord

2008-01-18 Thread Raoul Bhatia [IPAX]
Hello Andrea,s On Fri, 2008-01-18 at 01:09 +0100, Andreas Mock wrote: > So, now, if you upgrade to the version in HAv2 2.1.3 and have > no config-file-definition in your current cib, the wrong path > is taken. WARNING: You have to provide the path explicitly > in your cib or change the position o

Re: [Linux-HA] Syntax error on IPaddr2?

2008-01-17 Thread Raoul Bhatia [IPAX]
Hi, On Mon, 2008-01-14 at 12:55 +0100, Yves Schumann wrote: > > there you go: http://hg.linux-ha.org/dev/ > ... > > feel free to edit the contents - it is all wiki based: > > http://wiki.linux-ha.org/ > > > > The changelog should be located at > > http://wiki.linux-ha.org/download/ChangeLog > >

Re: [Linux-HA] Syntax error on IPaddr2?

2008-01-14 Thread Raoul Bhatia [IPAX]
On Mon, 2008-01-14 at 08:54 +0100, Yves Schumann wrote: *snip* > Youre funny! You should try to search for "changelog" on the LinuxHA page. > You will get a link [1] but that page is not existing like many others. I > think it is not very useful if one must download the full package only to > hav

Re: [Linux-HA] Depend on a Ressource that runs on more than one Host (Building a High Available Mailserver)

2008-01-12 Thread Raoul Bhatia [IPAX]
On Sat, 2008-01-12 at 10:25 +0100, Thomas Glanzmann wrote: > Hello, > I have the following scenario: > > - I have two machines (mail-01 and mail-02) > > - Both should run postfix local (supervised by linux-ha) > > - I have a floating ip address that is on mail-01 or on m

Re: [Linux-HA] debian and heartbeat

2008-01-10 Thread Raoul Bhatia [IPAX]
On Thu, 2008-01-10 at 10:21 +0100, Thomas Glanzmann wrote: > Hello, > > > honestly, i would not use this repository for my upgrades as - at > > least in the past - major changes have been introduced during the > > heartbeat 2.1.3 development. for example the constraints were heavily > > modified.

Re: [Linux-HA] debian and heartbeat

2008-01-10 Thread Raoul Bhatia [IPAX]
On Thu, 2008-01-10 at 11:03 +0100, Andrew Beekhof wrote: > ... > > as the development continues, and crm *might* be > > past tense - *has* been > > > extracted from this > > build and will be modified to work with other ha solutions like > > OpanAIS, > > things might get very compilicated if y

Re: [Linux-HA] debian and heartbeat

2008-01-10 Thread Raoul Bhatia [IPAX]
hello thomas, On Thu, 2008-01-10 at 08:24 +0100, Thomas Glanzmann wrote: > I read the manpage. I am still looking for a line that I can put in my > /etc/apt/sources.list . Does someone has such a line? Does someone use > that repository? If that is the case could that one be so kind to post > simp

Re: [Linux-HA] debian and heartbeat

2008-01-09 Thread Raoul Bhatia [IPAX]
hi, On Wed, 2008-01-09 at 13:12 +0100, Michael Schwartzkopff wrote: > Hi, > > In order to install the latest heartbeat I did download all packages from > http://www.ultramonkey.org/download/heartbeat/2.1.3/ > > to /root/debs/ > > and added > > deb file:/root debs/ > > to my sources.list and

Re: [Linux-HA] hb2: making xml manageable

2008-01-08 Thread Raoul Bhatia [IPAX]
On Tue, 2008-01-08 at 09:38 +0100, Jergen Dutch wrote: > "This can be really annoying and can easily result in system > administrators[3] making mistakes either in the middle of the night, > or when under pressure" wrong - use crm_verify to verify your xml against the dtd. e.g. crm_verify -V

Re: [Linux-HA] High load through lrmd and SIGCHLD-warnings

2008-01-03 Thread Raoul Bhatia [IPAX]
hi, i think lrmd had some performance issues in 2.1.2 - i suggest upgrading to 2.1.3 - thou please revisit your configuration as some constraints - especially those related to groups/master-slave/etc. - might break. cheers, raoul Marian Neubert wrote: Hi list, using Heartbeat 2.1.2, there is

Re: [Linux-HA] How to prevent unexpected unmount (stop) of ordered clone Filesystem resource on alive node

2007-12-29 Thread Raoul Bhatia [IPAX]
Takekazu Okamoto wrote: In addition, following change was made. Filesystem RA node2:/usr/lib/ocf/resource.d/heartbeat # diff -u Filesystem.org Filesystem --- Filesystem.org 2007-12-29 20:17:38.0 +0900 +++ Filesystem 2007-12-29 20:18:06.0 +0900 @@ -477,7 +477,7 @@

Re: [Linux-HA] colocation not respected?

2007-12-26 Thread Raoul Bhatia [IPAX]
Rodrigo Pereira wrote: Hi, I have a running cluster, with a couple of Xen VM's on top of DRBD. I have colocation constraints for each Xen VM resource so they only start where the DRBD fs is mounted. Something like: I created another Xen VM, and tried to add it to the cluster. I have all reso

Re: [Linux-HA] Prevent switch if redundanz comes up

2007-12-21 Thread Raoul Bhatia [IPAX]
at my place a similar primitive/constraint configuration is working. differences i spot at my system: mybe the default-resource-stickiness set to INFINITY causes the problem? cheers, raoul -- DI (FH) Raoul Bhatia M.Sc.

Re: [Linux-HA] Prevent switch if redundanz comes up

2007-12-21 Thread Raoul Bhatia [IPAX]
Yves Schumann wrote: Hi there, after a lot of trial and error my heartbeat V2 setup ist working. I have two nodes in a active/passive configuration, lets say "Master" and "Redundance". I've configured a resouce group with some services. If an error occoures, the group switches to "Redundance" as

Re: [Linux-HA] DRBD V8 and Heartbeat V2

2007-12-21 Thread Raoul Bhatia [IPAX]
Terry Hull wrote: I saw the docs on linux-ha.org for using Heartbeat V2 with DRBD 7 in a master / slave arrangement. It also says the configuration is not for version 8 of DRBD. I have a few questions about this: 1) Is there a document that describes doing the same thing with DRBD 8? not t

Re: [Linux-HA] DRBD Config

2007-12-21 Thread Raoul Bhatia [IPAX]
Dominik Klein wrote: Dec 20 12:57:49 mylogin1 drbd[7119]: [7131]: DEBUG: : Calling /sbin/drbdadm -c /etc/drbd.conf state Dec 20 12:57:49 mylogin1 drbd[7119]: [7134]: DEBUG: : Exit code 0 can you c/p what you get when you issue /sbin/drbdadm -c /etc/drbd.conf state by hand? That's a syntax

Re: [Linux-HA] DRBD Config

2007-12-21 Thread Raoul Bhatia [IPAX]
Jochen Lienhard wrote: we are using a two node cluster master/slave with an openSuSE 10.3, heartbeat 2.0.7 and drbd 8.0.6. I tried the configuration from this webpage: http://www.linux-ha.org/DRBD/HowTov2 Dec 20 12:57:49 mylogin1 drbd[7119]: [7131]: DEBUG: : Calling /sbin/drbdadm -c /etc/drbd

Re: [Linux-HA] (no subject)

2007-12-17 Thread Raoul Bhatia [IPAX]
Dejan Muhamedagic wrote: Hi, On Thu, Dec 13, 2007 at 01:33:20PM -0600, David Hostetler wrote: I am looking to provide some failover for my machines, however in order for everything to work correctly the hostname has to be the same on both machines. This is built into our software and cannot be

Re: [Linux-HA] My Xen is too old for your Xen RA!

2007-12-13 Thread Raoul Bhatia [IPAX]
Rodrigo Pereira wrote: I had to change a line on the Xen_Status() function to make it work, otherwise it would not detect status correctly. The line is: echo "${STATUS}" | grep -qs "[r--][-b-][--p]---" I initially escaped the first two "--", did the trick. Then decided to play safer with egrep

Re: R: [Linux-HA] Which version should I go for? 2.1.2 or 2.0.8?

2007-11-28 Thread Raoul Bhatia [IPAX]
Francesco Cristofori wrote: No. 2.0.8 was a disaster of a release - please dont use it. I'm using debian stable and I found only 2.0.7 available. Should I use the 2.1.2 package from testing? Is it more stable than 2.0.7? please use the current interim build for etch, which is available under

Re: [Linux-HA] hb_report heartbeat reporting utility

2007-10-18 Thread Raoul Bhatia [IPAX]
Dejan Muhamedagic wrote: this is not working as webcluster01 refer to the load-balanced ip - chich seems to be a bad thing ;) Well, nodes should definitely have static IP addresses. they have - not webcluster01 but wc1 and wc2. and webcluster01 and webcluster02 are dynamicaly managed by heart

Re: [Linux-HA] DRBD v8

2007-10-18 Thread Raoul Bhatia [IPAX]
Michael Schwartzkopff wrote: Hi, in the HOWTO of Linux-HA2 is mentioned htat the ocf RA of drbd does not support version 8 of drbd. Is that still true? as far as i know, yes :) Is it just not supported / tested? mainly it needs to be tested (by someone who really knows drbd8) and the ocf

Re: [Linux-HA] patch for mysql ocf script

2007-10-17 Thread Raoul Bhatia [IPAX]
Chun Tian (binghe) wrote: mysql ocf script also check the OCF_RESKEY_test_user in /etc/passwd: grep $OCF_RESKEY_test_user /etc/passwd >/dev/null 2>&1 if [ ! $? -eq 0 ]; then ocf_log err "Test user $OCF_RESKEY_test_user doesn't exit"; exit $OCF_ERR_ARGS; fi I dont's t

Re: [Linux-HA] patch for mysql ocf script

2007-10-16 Thread Raoul Bhatia [IPAX]
Raoul Bhatia [IPAX] wrote: do i have to understand why we use OCF_RESKEY_config in one line, and OCF_RESKEY_mysql_config on the next? so i guess its a typo then - i will post a patch to linux-ha-dev. cheers, raoul bhatia

Re: [Linux-HA] patch for mysql ocf script

2007-10-16 Thread Raoul Bhatia [IPAX]
Lars Marowsky-Bree wrote: On 2007-10-15T18:09:04, "Raoul Bhatia [IPAX]" <[EMAIL PROTECTED]> wrote: der dejan, thank you for applying the patches. and thank you for mentioning my name ;) i have another - hopefully for all - handy patch for mysql. it adds "additional_pa

Re: [Linux-HA] Question regarding Pure-FTPd OCF Script

2007-10-16 Thread Raoul Bhatia [IPAX]
Dejan Muhamedagic wrote: On Tue, Oct 16, 2007 at 01:34:40PM +0200, Dejan Muhamedagic wrote: The only way I can see is to introduce, say, a dist variable (OCF_RESKEY_dist) and then put if-else (or case) statements where appropriate: This is not a very good advice. Though not probable, it is po

[Linux-HA] hb_report heartbeat reporting utility

2007-10-16 Thread Raoul Bhatia [IPAX]
i today tried your tool for the first time. i encountered some problems: WARN: ssh does not work to all nodes cp: missing destination file operand after `/tmp/report/webcluster02/' Try `cp --help' for more information. /usr/sbin/hb_report: line 320: 28406 Aborted crm_diff -c -n $

Re: [Linux-HA] patch for mysql ocf script

2007-10-15 Thread Raoul Bhatia [IPAX]
Lars Marowsky-Bree wrote: thanks. Also merged. great! You might consider posting these patches to the dev list though ;-) ok, ok - yet another mailinglist to subscribe to ;) cheers, raoul -- DI (FH) Raoul Bhatia M.Sc.

[Linux-HA] Question regarding Pure-FTPd OCF Script

2007-10-15 Thread Raoul Bhatia [IPAX]
Dear list, on line 117 in ocf::heartbeat::Pure-FTPd, pure-ftpd is started via $OCF_RESKEY_script $OCF_RESKEY_conffile -g $PIDFILE on debian etch, pure-ftpd is started by the wrapper "/usr/sbin/pure-ftpd-wrapper" which reads a different configuration structure from /etc/pure-ftpd/ and expects

Re: [Linux-HA] patch for mysql ocf script

2007-10-15 Thread Raoul Bhatia [IPAX]
on, 15 Oct 2007 10:07:38 +0200, Dejan Muhamedagic <[EMAIL PROTECTED]> wrote: > Hi, > > On Sun, Oct 14, 2007 at 03:41:19PM +0200, Raoul Bhatia [IPAX] wrote: >> hi, >> >> may i propose a patch for the mysql ocf script which should help all >> people who do

Re: [Linux-HA] Heartbeat Shutdown issues

2007-10-15 Thread Raoul Bhatia [IPAX]
for the records: cs 8760aed1fccc [1] adresses the shutdown issue. lets see if this was the only reason or if there are further problems ;) thank you andrew! [1] http://hg.linux-ha.org/dev/rev/8760aed1fccc -- DI (FH) Raoul Bhati

Re: [Linux-HA] lrmd: G_SIG_dispatch ... dispatch function took to long

2007-10-15 Thread Raoul Bhatia [IPAX]
On Mon, 15 Oct 2007 12:14:30 +0200, Lars Marowsky-Bree <[EMAIL PROTECTED]> wrote: > On 2007-10-14T17:23:28, "Raoul Bhatia [IPAX]" <[EMAIL PROTECTED]> wrote: > >> for all of you who cannot wait for new interim builds, you can always >> get a special and/o

Re: [Linux-HA] Linksys Port Forwarding Failover Issue

2007-10-15 Thread Raoul Bhatia [IPAX]
dear darren, i do not know about the Linksys BEFW11S4 routers but i (and others) have head good experiences with the followin (wlan) routers, flashed with openwrt or dd-wrt, which basically is linux with admin interfaces. - linksys wrt64gl - Buffalo WHR-G54S or WHR-HP-G54 - Asus WL-HDD or WL-50

Re: [Linux-HA] lrmd: G_SIG_dispatch ... dispatch function took to long

2007-10-14 Thread Raoul Bhatia [IPAX]
On Sun, 14 Oct 2007 17:16:20 +0200, "Andrew Beekhof" <[EMAIL PROTECTED]> wrote: > On 10/12/07, matilda matilda <[EMAIL PROTECTED]> wrote: >> Andrew! When are you building the next interim release? >> I would be one of the consumers. ;-) > ... > > I have a recurring item in my calendar that goes o

[Linux-HA] Clone colocation fixes (11328:c2183a2caa71)

2007-10-14 Thread Raoul Bhatia [IPAX]
hello andrew, thank you very much for the 11328:c2183a2caa71 changeset. this resolved a number of problems i encountered during the last week and made my setup working again. :) i really appreciate your work - especially your quick replies and fixes. cheers, raoul bhatia -- __

[Linux-HA] patch for mysql ocf script

2007-10-14 Thread Raoul Bhatia [IPAX]
hi, may i propose a patch for the mysql ocf script which should help all people who do not use /etc/passwd and /etc/group for their user accounts (e.g. ldap). i replaced the greps with "getent". on linux, getent ships with glibc. from what i see there is a getent available on freebsd [1], open

Re: [Linux-HA] heartbeat ignoring my rsc_order directive

2007-10-11 Thread Raoul Bhatia [IPAX]
Lars Marowsky-Bree wrote: On 2007-10-10T23:04:31, "Raoul Bhatia [IPAX]" <[EMAIL PROTECTED]> wrote: as i do not know the old states, and am not that familiar with OCF scripts yet, i am not sure how to contribute for the drbd OCF script. i would appreciate any hints on this matt

Re: [Linux-HA] Heartbeat Shutdown issues

2007-10-11 Thread Raoul Bhatia [IPAX]
Andrew Beekhof wrote: On 10/10/07, Raoul Bhatia [IPAX] <[EMAIL PROTECTED]> wrote: hi, every now and then i encounter shutdown issues with heartbeat. right now for example: crmd[23092]: 2007/10/10_22:38:41 info: do_state_transition: (Re)Issuing shutdown request now that we are the D

Re: [Linux-HA] Re: [Linux-ha-announce] Announcing HA/DR Educational Blog

2007-10-11 Thread Raoul Bhatia [IPAX]
Maxim Doucet wrote: You can find the blog here: http://techthoughts.typepad.com/managing_computers/ "Bravo" for such a good initiative! I will follow it with attention. ill to will follow the blog! thanks, raoul bhatia -- __

Re: [Linux-HA] heartbeat ignoring my rsc_order directive

2007-10-10 Thread Raoul Bhatia [IPAX]
Lars Marowsky-Bree wrote: On 2007-10-09T20:53:16, "Raoul Bhatia [IPAX]" <[EMAIL PROTECTED]> wrote: lrmd[21310]: 2007/10/09_20:09:13 info: rsc:ocfs2_www:0: start Filesystem[4261]: 2007/10/09_20:09:13 INFO: Running start for /dev/drbd0 on /data/www Filesystem[4261]:

Re: [Linux-HA] heartbeat ignoring my rsc_order directive

2007-10-10 Thread Raoul Bhatia [IPAX]
Lars Marowsky-Bree wrote: On 2007-10-09T20:09:51, "Raoul Bhatia [IPAX]" <[EMAIL PROTECTED]> wrote: i have a drbd clone: ... You're using drbd8 with the OCF RA for drbd. That will likely not work well in all cases. yes, i too noticed that. i don

Re: [Linux-HA] heartbeat ignoring my rsc_order directive

2007-10-10 Thread Raoul Bhatia [IPAX]
Andrew Beekhof wrote: On 10/10/07, Andrew Beekhof <[EMAIL PROTECTED]> wrote: On 10/10/07, Andrew Beekhof <[EMAIL PROTECTED]> wrote: On 10/9/07, Raoul Bhatia [IPAX] <[EMAIL PROTECTED]> wrote: hi, during my tests with clustered ips+drbd+ocfs2+services, i encountered a strang

Re: [Linux-HA] heartbeat ignoring my rsc_order directive

2007-10-10 Thread Raoul Bhatia [IPAX]
Andrew Beekhof wrote: you're much better off starting a new topic for unrelated issues i've asked lmb to take a look at this since he wrote the ocfs2 extensions for the Filesystem RA On 10/9/07, Raoul Bhatia [IPAX] <[EMAIL PROTECTED]> wrote: another strange thing i encountered

Re: [Linux-HA] heartbeat ignoring my rsc_order directive

2007-10-10 Thread Raoul Bhatia [IPAX]
Andrew Beekhof wrote: ok, the PE now does the correct thing for your input... the trick now is to make it pass all the regression tests as well done http://hg.beekhof.net/lha/crm-dev/rev/0c14cfe57dd9 thank you for your quick fixes - they are very much appreciated! ill take a look if i can

Re: AW: [Linux-HA] Resource Restart Problem

2007-10-09 Thread Raoul Bhatia [IPAX]
Otte, Joerg (NSN - DE/Muenich) wrote: I know, no problem if we were on Linux. But on Solaris it is not easy. When we started the project 2.0.9 was the latest and it had many advantages over 2.0.8. But under Solaris it does not work out of the box (at least 2.0.8 and 2.0.9 didn't). It took about 6

Re: [Linux-HA] (drbd) master/slave monitoring operations

2007-10-09 Thread Raoul Bhatia [IPAX]
Dejan Muhamedagic wrote: moreover, you said that you attached the manual as an html file, but i could not find it in the archives. First time around, it was dropped for some reason (beat me). Then it went through, but I'll send it again. if you resend the manual, i will retry it the next time

Re: [Linux-HA] heartbeat ignoring my rsc_order directive

2007-10-09 Thread Raoul Bhatia [IPAX]
another strange thing i encountered with this setup right now: i issued "crm_resource -C -r ocfs2_www:1 -H webcluster01" and for a very short period of time both ocfs2 clone instances have been active. then heartbeat stopped one of them, which unmounted the ocfs2 filesystem, and it took some tim

[Linux-HA] heartbeat ignoring my rsc_order directive

2007-10-09 Thread Raoul Bhatia [IPAX]
hi, during my tests with clustered ips+drbd+ocfs2+services, i encountered a strange problem. i have a drbd clone: ... with the rsc_order/rsc_colocation rules: using "ptest -L -VV 2>&1|grep -i order|cut -d " " -f 3-|grep ocfs" i find that the ordering is: de

Re: [Linux-HA] Japanese WEB site of linux-ha.org

2007-10-09 Thread Raoul Bhatia [IPAX]
Takayuki Tanaka wrote: We opened Japanese WEB site of "linux-ha.org" today. http://linux-ha.org/ja/HomePage_ja Enjoy!! wow - looks like an impressive work to my european eyes :) keep it up! cheers, raoul bhatia --

Re: [Linux-HA] resource ordering

2007-10-09 Thread Raoul Bhatia [IPAX]
Andrew Beekhof wrote: I'd suggest: http://oss.beekhof.net/~beekhof/heartbeat/docs/Ordering-Explained.pdf i allready tried to understand that document, but - at least for me - its a little hard to understand how things actually work inside the calculation process :) hmmm can you suggest what

Re: [Linux-HA] (drbd) master/slave monitoring operations

2007-10-09 Thread Raoul Bhatia [IPAX]
Dejan Muhamedagic wrote: Hi, On Tue, Oct 09, 2007 at 11:09:54AM +0200, Raoul Bhatia [IPAX] wrote: Raoul Bhatia [IPAX] wrote: my logfiles show: pengine[27942]: 2007/10/09_10:59:01 WARN: process_pe_message: Transition 840: WARNINGs found during PE processing. PEngine Input stored in: /var/lib

Re: [Linux-HA] Interacting with the passive node in a 2-node cluster

2007-10-09 Thread Raoul Bhatia [IPAX]
Radu Handorean wrote: Hello, The system has 2 nodes: NodeA (active) and NodeB (passive). I want to design a scheme for updating the software and it seems that updating the passive node, forcing a failoverm, and updating the now passive (formerly active) node should work (actually seems like t

[Linux-HA] Discussion about Ordering and Colocating of Heartbeat 2.1.2-4+ (Ordering-Explained.pdf)

2007-10-09 Thread Raoul Bhatia [IPAX]
hi, during this "discussion" we will refer to the slieds found at http://oss.beekhof.net/~beekhof/heartbeat/docs/Colocation-Explained.html and/or http://oss.beekhof.net/~beekhof/heartbeat/docs/Colocation-Explained.pdf please feel free to post your questions, suggestions, etc. "below" this thread

Re: [Linux-HA] Fisrt Time HA

2007-10-09 Thread Raoul Bhatia [IPAX]
hi, Alejandro Rios Peña wrote: Sorry to write you off-list, but I'm having troubles whit a hb v2 style cib.xml config whit DRBD OCF agents, and I would really appreciate if you could show me your config to see if I can spot my errors. as far as i know, the relevant parts are... for the drbd

Re: [Linux-HA] (drbd) master/slave monitoring operations

2007-10-09 Thread Raoul Bhatia [IPAX]
Raoul Bhatia [IPAX] wrote: my logfiles show: pengine[27942]: 2007/10/09_10:59:01 WARN: process_pe_message: Transition 840: WARNINGs found during PE processing. PEngine Input stored in: /var/lib/heartbeat/pengine/pe-warn-1886.bz2 pengine[27942]: 2007/10/09_10:59:01 WARN: native_color: Resource

[Linux-HA] (drbd) master/slave monitoring operations

2007-10-09 Thread Raoul Bhatia [IPAX]
hi, as discussed a couple of days ago, monitoring actions do not "happen" by themselves. moreover, i learnd, that one has to specify seperate monitoring actions for different roles. now my questions are: 1) What is the difference between role="Slave" and role="Started"? 2) Why does my heartb

Re: [Linux-HA] resource ordering

2007-10-09 Thread Raoul Bhatia [IPAX]
Andrew Beekhof wrote: I'd suggest: http://oss.beekhof.net/~beekhof/heartbeat/docs/Ordering-Explained.pdf i allready tried to understand that document, but - at least for me - its a little hard to understand how things actually work inside the calculation process :) it seems i got that back

Re: [Linux-HA] resource ordering

2007-10-09 Thread Raoul Bhatia [IPAX]
Raoul Bhatia [IPAX] wrote: ok, so i have got my rsc_order, for starting the filesystem after the drbd device, as below. type="after" action="promote" from="ms_drbd_mysql" /> but this does not work! the ptest output reads: > ... ptest[5279]: 2007/10/09_10:

[Linux-HA] resource ordering

2007-10-09 Thread Raoul Bhatia [IPAX]
hello andrew, hello all, i am once more confused about the inner working of rsc_order. in all emails and in all docs (e.g. on http://hg.linux-ha.org/dev/file/tip/crm/crm-1.0.dtd) one can read: rsc_ordering > Read as: to_action to type action from ok, so i have got my rsc_order, for

Re: [Linux-HA] Fisrt Time HA

2007-10-08 Thread Raoul Bhatia [IPAX]
pfu, just read throu your email with all the information, just to conclude that i cannot help that much as i'm using the new v2 style configuraiton (which is cib.xml and ocf ressource agents) perhaps theres somebody else who can help you with your problem. if you try to use the xml based configu

Re: [Linux-HA] Resource in master state - no monitor operation

2007-10-02 Thread Raoul Bhatia [IPAX]
Phil Manuel wrote: Hi, I don't have an explicit master/slave configuration, just a preferred node. The thread looked to me as if it was referring to an explicit master/slave configuration. Thanks Phil. but you are talking about a promoted ressource = a ressource in mastert/primary state.

Re: [Linux-HA] Resource in master state - no monitor operation

2007-10-01 Thread Raoul Bhatia [IPAX]
Assaf N wrote: Hello, I started a small test cluster using heartbeat 2.1.1. The cluster contains one simple master/slave resource. While playing around with this cluster, I've noticed that whenever the resource is promoted to be the master on a machine, Heartbeat stops calling its monitor op

Re: [Linux-HA] IPaddr2

2007-10-01 Thread Raoul Bhatia [IPAX]
Matt Zagrabelny wrote: I am getting the following warning in my log files. lrmd[10793]: 2007/10/01_16:27:41 info: RA output: (internal_VIP:stop:stderr) Warning: Executing wildcard deletion to stay compatible with old scripts. Explicitly specify the prefix length (192.168.115.33/32) to a

Re: [Linux-HA] multistate resource - master resource not monitored

2007-09-28 Thread Raoul Bhatia [IPAX]
Andrew Beekhof wrote: you have to add another one with role="Master", eg. is this a recommended thing to add to any cib.xml? might this be related to another thread of mine regarding ressources (drbd) sometimes failing to become master/primrary? cheers, raoul --

[Linux-HA] Re: [Linux-ha-dev] [ANNOUNCE] Interim heartbeat packages refreshed (2.1.2-4)

2007-09-28 Thread Raoul Bhatia [IPAX]
hello horms, the file i got is displayed corrupt. i guess something with the headers messed up: --LZvS9be/3tNcYl/X Content-Type: text/plain; charset=utf-8 Content-Disposition: attachment; filename="linux-ha.testlog.bz2" Content-Transfer-Encoding: quoted-printable for all of those who have a p

Re: [Linux-HA] 6 node cluster? Am I doing this right?

2007-09-26 Thread Raoul Bhatia [IPAX]
Dejan Muhamedagic wrote: Hi, On Wed, Sep 26, 2007 at 07:23:43AM -0500, Dave Augustus wrote: On Wed, 2007-09-26 at 00:33 +0200, Dejan Muhamedagic wrote: Hi, On Tue, Sep 25, 2007 at 05:04:16PM -0500, Dave Augustus wrote: 6 servers- all running heartbeat 1 is LVS loadbalancer the other 5 are l

[Linux-HA] reconfiguring network interfaces causes split brain

2007-09-26 Thread Raoul Bhatia [IPAX]
hello, ill try to keep things short so please do not consider it rude: 2 (debian 4.0) nodes: eth0 = external; eth1 = hb channel the cluster has been in the state: Current DC: webcluster02 (917954cd-0285-4fcd-9cd2-671736c4de66) 2 Nodes configured. > ... Node: webcluster01 (49e81295-8e2f-4aeb

[Linux-HA] Re: Release Policy

2007-09-25 Thread Raoul Bhatia [IPAX]
hello max, thank you for your detailed feedback. it is much appreciated! hopefully i'll be able to follow your suggestions ;) cheers, raoul bhatia -- DI (FH) Raoul Bhatia M.Sc. email. [EMAIL PROTECTED] Techni

Release Policy (was: Re: [Linux-HA] 2.1.2 and failover of colocated resources)

2007-09-19 Thread Raoul Bhatia [IPAX]
hi, Andrew Beekhof wrote: > shortly I'll be releasing a revised implementation (including > documentation!) of colocation which will make it much more intuitive > and remove the need for hacks like symmetrical=true > > if anyone wants to try it sooner rather than later, grab the latest > from htt

Re: [Linux-HA] Question regarding primitives and groups inside a clone

2007-09-19 Thread Raoul Bhatia [IPAX]
Andrew Beekhof wrote: i need the complete cib - including the status section cibadmin -Ql > cib.xml ok, sorry about that. it is attached. cheers, raoul bhatia -- DI (FH) Raoul Bhatia M.Sc. email. [EMAIL PRO

Re: [Linux-HA] Question regarding primitives and groups inside a clone

2007-09-19 Thread Raoul Bhatia [IPAX]
Andrew Beekhof wrote: Clone Set: clone_ocfs2_www ocfs2_www:0 (heartbeat::ocf:Filesystem):Stopped ocfs2_www:1 (heartbeat::ocf:Filesystem):Started webcluster01 when using a group instead of the primitive, things do not work as i think they should. the result is, that the group is s

Re: [Linux-HA] HB2 with DRBD (master_slave) and OCFS2 (clone)

2007-09-18 Thread Raoul Bhatia [IPAX]
Andrew Beekhof wrote: perhaps one should update the dtd to include your simple explanation? "I recently added this to the DTD..." i took a look at http://hg.linux-ha.org/dev/file/tip/crm/crm-1.0.dtd but there is no such information - or at least i am unable to find it -- _

Re: [Linux-HA] HB2 with DRBD (master_slave) and OCFS2 (clone)

2007-09-18 Thread Raoul Bhatia [IPAX]
Andrew Beekhof wrote: btw. looks to be the wrong way around. I recently added this to the DTD... Read as: to_action to type action from Which in your case is: start apache2_clone before start clone_ocfs2_www exploring the dtd i find: > * type : Should the action on from occur

Re: [Linux-HA] Starting and Stopping LSB resources and adding Nodesusing CIB.

2007-09-17 Thread Raoul Bhatia [IPAX]
Chad Osmond wrote: What is the correct process for starting an LSB resource, or having CIB recheck a resource to see if it's been started when it's a LSB resource. I have seen LSB resources shown as not running, when they are indeed running. >> from cib.xml or a lsb scripts point of view? F

Re: [Linux-HA] enabling another account to use cibadmin -Q

2007-09-17 Thread Raoul Bhatia [IPAX]
Doug Knight wrote: Since the scripts are automated (i.e. running without a tty), I cannot use the /etc/sudoers file (which I have working as a command line execution). may i correct you on that. you can write something like: # User alias specification User_Alias UNPRIV = raoul # Cmnd alia

Re: [Linux-HA] enabling another account to use cibadmin -Q

2007-09-17 Thread Raoul Bhatia [IPAX]
Doug Knight wrote: Just now, I set something up very similar to this. I put: User_Alias UNPRIV = dknight Cmnd_Alias CIBADMIN = /usr/sbin/cibadmin UNPRIV ALL=NOPASSWD: CIBADMIN Then, I logged into the dknight account, and attempted the following: sudo /usr/sbin/cibadmin -Q Worked, returning th

Re: [Linux-HA] Starting and Stopping LSB resources and adding Nodes using CIB.

2007-09-17 Thread Raoul Bhatia [IPAX]
Chad Osmond wrote: Hi, What is the correct process for starting an LSB resource, or having CIB recheck a resource to see if it's been started when it's a LSB resource. I have seen LSB resources shown as not running, when they are indeed running. from cib.xml or a lsb scripts point of view?

[Linux-HA] Question regarding primitives and groups inside a clone

2007-09-17 Thread Raoul Bhatia [IPAX]
hi, as far as i understand, a clone ressource can be used to tell the crm to start a resource "clone_node_max" on one host and "clone_max" times within the whole cluster. if only one node is started, the clone should therefore run only "clone_node_max" times within the whole cluster. i have a o

Re: [Linux-HA] Repository of ocf scripts?

2007-09-14 Thread Raoul Bhatia [IPAX]
dear list, if you need any hosting (trac or wiki, svn, etc.) for this repository i would be able to offer this to the community. kind regards, raoul bhatia Max Hofer wrote: I think what we need is something like a classification (pots): a) OCF RA delivered by the HA Linux project. Means well

Re: [Linux-HA] HB2 with DRBD (master_slave) and OCFS2 (clone)

2007-09-06 Thread Raoul Bhatia [IPAX]
Raoul Bhatia [IPAX] wrote: hi, i have a problem with hb2, drbd and ocfs2 and two (master-master) nodes: 1) when i start one node: heartbeat is starting and activates the 2 drbd devices. most of the time they become master (primary) but sometimes even this fails (see attached config/logfiles

Re: [Linux-HA] Testing linux-ha

2007-09-06 Thread Raoul Bhatia [IPAX]
Ben Clewett wrote: Are there any Linux guru's there who might know of a way of telling a Linux box to power off without killing processes? Just flush the disks and switch off... Any ideas? :) halt -f ? kind regards, raoul bhatia -- __