Re: [Linux-HA] Re: New experimental debian repository

2009-03-04 Thread Simon Horman
On Thu, Mar 05, 2009 at 06:33:33AM +0100, Michael Schwartzkopff wrote:
> Simon Horman schrieb:
>> (...)
>> I agree that it would be good to have a good repository for
>> hb2.99/pacemaker on on Debian Stable/Lenny (as opposed to the efforts
>> to get  hb2.99/pacemaker into Debian experimental and subsequently,
>> Sid/unstable and Squeeze/testing).
>>
>> I may be mistaken, but as pacemaker wasn't included in Lenny I think
>> it will be difficult to get  hb2.99/pacemaker into backports.org,
>> though if that was possible it seems like it would be ideal.
>>
>> If that isn't possible, I wonder if the open build service provided
>> by SuSE would be a good option. I it already has Debian packages,
>> though I'm not sure if it is able to cope with Lenny yet (as opposed
>> to Etch which was Debian stable until quite recently).
>>   
> Hi,
>
> My intension just was to have a useable repository for lenny, the actual  
> debian distribution. If we could somehow bring it into the official  
> repositories, even better.

To clarify, Debian has serveral different distributions at any given time.
Typically experimental, unstable, testing, stable and oldstable.
Lenny is the current stable distribution.

> In my opinion the SuSE build service as it is now is NO option:
> - There are no usable packages for over half a year now. The packages  
> provided had dependencies not resolvable from the normal distribution.
> - There is no package for the i386 architecture, at least not if you add  
> the repository to your sources.
> - Much (!) slower build cycles compared to the SuSE products.
>
> Sorry, but these reasons could be my personal impression. So when the  
> compile the first time ran through (after a lot of bugs) I decided to  
> create my own repository so I could use it my own.

Well, I was suggesting fixing the packages on the open build service.
The main advantage that I was thinking of is that (in theory) they do
builds for multiple architectures automatically.

> @simon: Perhaps you want to host the packages? Of, course I could do  
> this also.

I'm happy to host them, but only if you don't want to.

-- 
Simon Horman
  VA Linux Systems Japan K.K., Sydney, Australia Satellite Office
  H: www.vergenet.net/~horms/ W: www.valinux.co.jp/en

___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


Re: [Linux-HA] Re: New experimental debian repository

2009-03-04 Thread Andrew Beekhof
On Thu, Mar 5, 2009 at 06:33, Michael Schwartzkopff  wrote:
> Simon Horman schrieb:
>>
>> (...)
>> I agree that it would be good to have a good repository for
>> hb2.99/pacemaker on on Debian Stable/Lenny (as opposed to the efforts
>> to get  hb2.99/pacemaker into Debian experimental and subsequently,
>> Sid/unstable and Squeeze/testing).
>>
>> I may be mistaken, but as pacemaker wasn't included in Lenny I think
>> it will be difficult to get  hb2.99/pacemaker into backports.org,
>> though if that was possible it seems like it would be ideal.
>>
>> If that isn't possible, I wonder if the open build service provided
>> by SuSE would be a good option. I it already has Debian packages,
>> though I'm not sure if it is able to cope with Lenny yet (as opposed
>> to Etch which was Debian stable until quite recently).
>>
>
> Hi,
>
> My intension just was to have a useable repository for lenny, the actual
> debian distribution. If we could somehow bring it into the official
> repositories, even better.
>
> In my opinion the SuSE build service as it is now is NO option:
> - There are no usable packages for over half a year now. The packages
> provided had dependencies not resolvable from the normal distribution.

Which ones?
I find that very hard to believe given that it builds against vanilla
installs of Etch.

Perhaps there are other targets, such as Lenny, that it could support
too - but thats a separate issue.

> - There is no package for the i386 architecture, at least not if you add the
> repository to your sources.

Yes, not having "proper" Debian repositories is highly annoying.

> - Much (!) slower build cycles compared to the SuSE products.

What exactly do you mean here?
The debian packages are built with the exact same tarballs at the
exact same time as the packages from every other distro.

> Sorry, but these reasons could be my personal impression. So when the
> compile the first time ran through (after a lot of bugs) I decided to create
> my own repository so I could use it my own.
>
> @simon: Perhaps you want to host the packages? Of, course I could do this
> also.
>
>
> Michael.
> ___
> Linux-HA mailing list
> Linux-HA@lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


Re: [Linux-HA] Re: testing watchdog

2009-03-04 Thread NAKAHIRA Kazutomo

> so if i kill sbd daemon then it should result in system reboot
> as watchdog will reboot the system right?
Yes. That's right.

Watchdog will reboot the system when SBD daemon is killed before
"sbd -d  -D -W message LOCAL exit" is executed.

Best Regards,
NAKAHIRA Kazutomo

Priyanka Ranjan wrote:

Thanks  Kazutomo,
yes i have started sbd daemon like this
sbd -d  -W -D watch.  so if i kill sbd daemon then it should
result in system reboot as watchdog will reboot the system right?


On Thu, Mar 5, 2009 at 11:59 AM, NAKAHIRA Kazutomo <
nakah...@intellilink.co.jp> wrote:


Hi, Priyanka

Are you starting SBD daemon with "-W" option?
If it is so, the SBD daemon writes the "" string
in softdog driver periodically.

Best Regards,
NAKAHIRA Kazutomo


Priyanka Ranjan wrote:


Thanks for reply Michael,
I went through the link you mentioned. it was useful but i could not get
answer of  my question. i want to know which daemon in Heartbeat
writes/updated softdog driver.

Regards,

On Wed, Mar 4, 2009 at 7:22 PM, Michael Schwartzkopff 
wrote:

 Am Mittwoch, 4. März 2009 14:47:32 schrieb Priyanka Ranjan:

Hi All,
when we configure watchdog in sbd stonith. sbd daemon keeps monitoring


the


watchdog and if it finds that watchdog is not updated  then it resets
the
node.  can anyone tell me which daemon update watchdog perodically.

Thanks,


See:
http://www.linux-ha.org/softdog
and the links therein for a beginning.

--
Dr. Michael Schwartzkopff
MultiNET Services GmbH
Addresse: Bretonischer Ring 7; 85630 Grasbrunn; Germany
Tel: +49 - 89 - 45 69 11 0
Fax: +49 - 89 - 45 69 11 21
mob: +49 - 174 - 343 28 75

mail: mi...@multinet.de
web: www.multinet.de

Sitz der Gesellschaft: 85630 Grasbrunn
Registergericht: Amtsgericht München HRB 114375
Geschäftsführer: Günter Jurgeneit, Hubert Martens

---

PGP Fingerprint: F919 3919 FF12 ED5A 2801 DEA6 AA77 57A4 EDD8 979B
Skype: misch42
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

 ___

Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems



--

NAKAHIRA Kazutomo
NTT DATA INTELLILINK CORPORATION
Open Source Business Unit
Software Services Integration Business Division

___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems



--

NAKAHIRA Kazutomo
NTT DATA INTELLILINK CORPORATION
Open Source Business Unit
Software Services Integration Business Division
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


Re: [Linux-HA] Re: testing watchdog

2009-03-04 Thread Priyanka Ranjan
Thanks  Kazutomo,
yes i have started sbd daemon like this
sbd -d  -W -D watch.  so if i kill sbd daemon then it should
result in system reboot as watchdog will reboot the system right?


On Thu, Mar 5, 2009 at 11:59 AM, NAKAHIRA Kazutomo <
nakah...@intellilink.co.jp> wrote:

> Hi, Priyanka
>
> Are you starting SBD daemon with "-W" option?
> If it is so, the SBD daemon writes the "" string
> in softdog driver periodically.
>
> Best Regards,
> NAKAHIRA Kazutomo
>
>
> Priyanka Ranjan wrote:
>
>> Thanks for reply Michael,
>> I went through the link you mentioned. it was useful but i could not get
>> answer of  my question. i want to know which daemon in Heartbeat
>> writes/updated softdog driver.
>>
>> Regards,
>>
>> On Wed, Mar 4, 2009 at 7:22 PM, Michael Schwartzkopff > >wrote:
>>
>>  Am Mittwoch, 4. März 2009 14:47:32 schrieb Priyanka Ranjan:
>>>
 Hi All,
 when we configure watchdog in sbd stonith. sbd daemon keeps monitoring

>>> the
>>>
 watchdog and if it finds that watchdog is not updated  then it resets
 the
 node.  can anyone tell me which daemon update watchdog perodically.

 Thanks,

>>> See:
>>> http://www.linux-ha.org/softdog
>>> and the links therein for a beginning.
>>>
>>> --
>>> Dr. Michael Schwartzkopff
>>> MultiNET Services GmbH
>>> Addresse: Bretonischer Ring 7; 85630 Grasbrunn; Germany
>>> Tel: +49 - 89 - 45 69 11 0
>>> Fax: +49 - 89 - 45 69 11 21
>>> mob: +49 - 174 - 343 28 75
>>>
>>> mail: mi...@multinet.de
>>> web: www.multinet.de
>>>
>>> Sitz der Gesellschaft: 85630 Grasbrunn
>>> Registergericht: Amtsgericht München HRB 114375
>>> Geschäftsführer: Günter Jurgeneit, Hubert Martens
>>>
>>> ---
>>>
>>> PGP Fingerprint: F919 3919 FF12 ED5A 2801 DEA6 AA77 57A4 EDD8 979B
>>> Skype: misch42
>>> ___
>>> Linux-HA mailing list
>>> Linux-HA@lists.linux-ha.org
>>> http://lists.linux-ha.org/mailman/listinfo/linux-ha
>>> See also: http://linux-ha.org/ReportingProblems
>>>
>>>  ___
>> Linux-HA mailing list
>> Linux-HA@lists.linux-ha.org
>> http://lists.linux-ha.org/mailman/listinfo/linux-ha
>> See also: http://linux-ha.org/ReportingProblems
>>
>
>
> --
> 
> NAKAHIRA Kazutomo
> NTT DATA INTELLILINK CORPORATION
> Open Source Business Unit
> Software Services Integration Business Division
>
> ___
> Linux-HA mailing list
> Linux-HA@lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


[Linux-HA] Re: OCF Script for Jboss

2009-03-04 Thread Takenaka Kazuhiro

Hello Stefan.

I am planning to test your jboss RA on the following enviroment.

Redhat 5.2(i386)
heartbeat 2.1.4
jdk-1_5_0_17
jboss-eap-4.3.0.GA_CP03

If anything turns out, I will post it.

Wait my report without haste.

Stefan Schluppeck:

Hi,

I have written a jboss ocf script, to run jboss as active/passive resource.
It is based on the tomcat script. It is tested well under Novell SLES10 SP2
64bit with jboss-4.2.3.GA, and jdk1.6.0_11

Please take a look.





___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

--
Takenaka Kazuhiro 
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


Re: [Linux-HA] Having issues with getting DRBD to work with Pacemaker

2009-03-04 Thread Dominik Klein
Hi

Jerome Yanga wrote:
> Hi!  I am having issues with getting DRBD to work with Pacemaker.  I can get 
> Pacemaker and DRBD run individually but not DRBD managed by Pacemaker.  I 
> tried following the instruction in the site below but the resources will not 
> go online.
> 
> http://clusterlabs.org/wiki/DRBD_HowTo_1.0
> 
> Below is my configuration.
> 
> Installed applications:
> ===
> kernel-2.6.18-128.el5

copy that

> drbd-8.3.0-3
> heartbeat-2.99.2-6.1
> pacemaker-1.0.1-3.1
> 
> 
> 
> drbd.conf:
> ==
> global {
> usage-count no;
> }
> 
> resource r0 {
>   protocol C;
>   handlers {
> pri-on-incon-degr "echo o > /proc/sysrq-trigger ; halt -f";
> pri-lost-after-sb "echo o > /proc/sysrq-trigger ; halt -f";
> local-io-error "echo o > /proc/sysrq-trigger ; halt -f";
> outdate-peer "/usr/lib/heartbeat/drbd-peer-outdater -t 5";
> pri-lost "echo pri-lost. Have a look at the log files. | mail -s 'DRBD 
> Alert' root";
> out-of-sync "/usr/lib/drbd/notify-out-of-sync.sh root";
>   }
>   startup {
>  wfc-timeout  0;
>   }
> 
>   disk {
> on-io-error   pass_on;
>   }
>   net {
>  max-buffers 2048;
> after-sb-0pri disconnect;
> after-sb-1pri disconnect;
> after-sb-2pri disconnect;
> rr-conflict disconnect;
>   }
>   syncer {
> rate 100M;
> al-extents 257;
>   }
>   on nomen.esri.com {
> device /dev/drbd0;
> disk   /dev/sda5;
> address192.168.0.1:7789;
> meta-disk  internal;
>   }
>   on rubric.esri.com {
> device/dev/drbd0;
> disk  /dev/sda5;
> address   192.168.0.2:7789;
> meta-disk internal;
>   }
> }
> 
> 
> 
> Cib.xml:
> 
>  have-quorum="1" dc-uuid="a5
> e95310-f27d-418e-9cb9-42e50310f702" epoch="56" num_updates="0" 
> cib-last-written="Wed Mar  4 14:27:59
>  2009">
>   
> 
>   
>  value="1.0.1-node: 6fc5ce830
> 2abf145a02891ec41e5a492efbe8efe"/>
>   
> 
> 
>type="normal"/>
>type="normal"/>
> 
> 
>   
> 
>value="2"/>
>value="true"/>
>name="globally-unique" value="false"
> />
>id="ms-drbd0-meta_attributes-target-role" value="Started"/>
> 
> 
>   
>  name="drbd_resource" value="r0"/>
>   
>   
>  role="Master" timeout="30s"/>
>  role="Slave" timeout="30s"/>
>   
> 
>   
> 
> 
>   
> 
> 
> 
> /var/log/messages:
> ==
> Mar  4 14:27:58 nomen crm_resource: [30167]: info: Invoked: crm_resource 
> --meta -r ms-drbd0 -p target-role -v Started
> Mar  4 14:27:58 nomen cib: [29899]: info: cib_process_xpath: Processing 
> cib_query op for 
> //cib/configuration/resources//*...@id="ms-drbd0"]//meta_attributes//nvpa...@name="target-role"]
>  (/cib/configuration/resources/master/meta_attributes/nvpair[4])
> Mar  4 14:27:59 nomen crmd: [29903]: info: do_lrm_rsc_op: Performing 
> key=5:5:0:d4b86e31-ca4a-4033-8437-6486622eb19f op=drbd0:0_start_0 )
> Mar  4 14:27:59 nomen haclient: on_event:evt:cib_changed
> Mar  4 14:27:59 nomen lrmd: [29900]: info: rsc:drbd0:0: start
> Mar  4 14:27:59 nomen cib: [30168]: info: write_cib_contents: Wrote version 
> 0.56.0 of the CIB to disk (digest: 2365d9802f1b9c55e0ed87b8ebda5db3)
> Mar  4 14:27:59 nomen cib: [30168]: info: retrieveCib: Reading cluster 
> configuration from: /var/lib/heartbeat/crm/cib.xml (digest: 
> /var/lib/heartbeat/crm/cib.xml.sig)
> Mar  4 14:27:59 nomen cib: [29899]: info: Managed write_cib_contents process 
> 30168 exited with return code 0.
> Mar  4 14:27:59 nomen modprobe: FATAL: Module drbd not found.
> Mar  4 14:27:59 nomen lrmd: [29900]: info: RA output: (drbd0:0:start:stdout)
> Mar  4 14:27:59 nomen mgmtd: [29904]: info: CIB query: cib
> Mar  4 14:27:59 nomen lrmd: [29900]: info: RA output: (drbd0:0:start:stdout) 
> Could not stat("/proc/drbd"): No such file or directory do you need to load 
> the module? try: modprobe drbd Command 'drbdsetup /dev/drbd0 disk /dev/sda5 
> /dev/sda5 internal --set-defaults --create-device --on-io-error=pass_on' 
> terminated with exit code 20 drbdadm attach r0: exited with code 20
> Mar  4 14:27:59 nomen drbd[30169]: ERROR: r0 start: not in Secondary mode 
> after start.
> Mar  4 14:27:59 nomen lrmd: [29900]: WARN: Managed drbd0:0:start process 
> 30169 exited with return code 1.
> Mar  4 14:27:59 nomen crmd: [29903]: info: process_lrm_event: LRM operation 
> drbd0:0_start_0 (call=3, rc=1, cib-update=13, confirmed=true) complete 
> unknown error
> Mar  4 14:27:59 nomen haclient: on_event: from message queue: evt:cib_changed
> Mar  4 14:27:59 nomen mgmtd: [29904]: info: CIB query: cib
> Mar  4 14:28:00 nomen crmd: [29903]: info: do_lrm_rsc_op: Performing 
> key=41:6:0:d4b86e31-ca4a-4033-8437-6486622eb19f op=drbd0:0_notify_0 )
> Mar  4 14:28:00 nomen lrmd: [29900]: info: rsc:drbd0:0: notify
> Mar  4 14:28:00 nomen lrmd: [29900]: info: Managed drbd0:0:noti

Re: [Linux-HA] Re: testing watchdog

2009-03-04 Thread NAKAHIRA Kazutomo

Hi, Priyanka

Are you starting SBD daemon with "-W" option?
If it is so, the SBD daemon writes the "" string
in softdog driver periodically.

Best Regards,
NAKAHIRA Kazutomo

Priyanka Ranjan wrote:

Thanks for reply Michael,
I went through the link you mentioned. it was useful but i could not get
answer of  my question. i want to know which daemon in Heartbeat
writes/updated softdog driver.

Regards,

On Wed, Mar 4, 2009 at 7:22 PM, Michael Schwartzkopff wrote:


Am Mittwoch, 4. März 2009 14:47:32 schrieb Priyanka Ranjan:

Hi All,
when we configure watchdog in sbd stonith. sbd daemon keeps monitoring

the

watchdog and if it finds that watchdog is not updated  then it resets the
node.  can anyone tell me which daemon update watchdog perodically.

Thanks,

See:
http://www.linux-ha.org/softdog
and the links therein for a beginning.

--
Dr. Michael Schwartzkopff
MultiNET Services GmbH
Addresse: Bretonischer Ring 7; 85630 Grasbrunn; Germany
Tel: +49 - 89 - 45 69 11 0
Fax: +49 - 89 - 45 69 11 21
mob: +49 - 174 - 343 28 75

mail: mi...@multinet.de
web: www.multinet.de

Sitz der Gesellschaft: 85630 Grasbrunn
Registergericht: Amtsgericht München HRB 114375
Geschäftsführer: Günter Jurgeneit, Hubert Martens

---

PGP Fingerprint: F919 3919 FF12 ED5A 2801 DEA6 AA77 57A4 EDD8 979B
Skype: misch42
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems



--

NAKAHIRA Kazutomo
NTT DATA INTELLILINK CORPORATION
Open Source Business Unit
Software Services Integration Business Division
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


Re: [Linux-HA] Having issues with getting DRBD to work with Pacemaker

2009-03-04 Thread Neil Katin


OK.  I notice you're running drbd 8.3.  The scripts distributed with 
pacemaker 1.0.1 only worked with 8.2
(I'm not sure about 1.0.2).  I made a few tiny changes to the drbd 
script so it would run with 8.3.  I've
attached the changed script.  I also added more logging so you can see 
exactly what is going on in the
drbd OCF script at debug level.  You probably want to change your 
loggging to debug to get all

the output while trying to figure this out.

However, the errors I got from the script were very different from 
yours: I was getting errors

from drbdadm, not failures to load the driver.

The other thing I did was run the OCF scripts "by hand" (you have to set 
a bunch of env variables).  I can't
find the script I used to test drbd, but I've attached one I used for 
mysql; you should be able to adapt it to

your use.  As always, remember "bash -x" is your friend.

   Neil

Jerome Yanga wrote:

Hi Neil!

Yes.  DRBD works outside of Pacemaker.  When I do a "service drbd start" on each node, 
drbd runs properly and are both "Secondary".

jerome
  
  
#!/bin/sh
#
#
#   OCF Resource Agent compliant drbd resource script.
#
# Copyright (c) 2004 - 2007 SUSE LINUX Products GmbH, Lars Marowsky-Bree
#All Rights Reserved.
#
# This program is free software; you can redistribute it and/or modify
# it under the terms of version 2 of the GNU General Public License as
# published by the Free Software Foundation.
#
# This program is distributed in the hope that it would be useful, but
# WITHOUT ANY WARRANTY; without even the implied warranty of
# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
#
# Further, this software is distributed without any warranty that it is
# free of the rightful claim of any third person regarding infringement
# or the like.  Any license provided herein, whether implied or
# otherwise, applies only to this software file.  Patent licenses, if
# any, provided herein do not apply to combinations of this program with
# other software, or any other product whatsoever.
#
# You should have received a copy of the GNU General Public License
# along with this program; if not, write the Free Software Foundation,
# Inc., 59 Temple Place - Suite 330, Boston MA 02111-1307, USA.
#
#

# OCF instance parameters
#   OCF_RESKEY_drbd_resource
#   OCF_RESKEY_drbdconf
#   OCF_RESKEY_CRM_meta_clone_max
#   OCF_RESKEY_CRM_meta_clone_node_max
#   OCF_RESKEY_master_max
#   OCF_RESKEY_master_node_max


###
# Initialization:

if [ -n "$OCF_DEBUG_LIBRARY" ]; then
. $OCF_DEBUG_LIBRARY
else
. ${OCF_ROOT}/resource.d/heartbeat/.ocf-shellfuncs
fi

###

meta_data() {
cat <


1.1


Master/Slave OCF Resource Agent for DRBD


This resource agent manages a Distributed
Replicated Block Device (DRBD) object as a master/slave
resource. DRBD is a mechanism for replicating storage; please see the
documentation for setup details.




The name of the drbd resource from the drbd.conf file.

drbd resource name





Full path to the drbd.conf file.

Path to drbd.conf





Whether or not to override the hostname with the clone number. This can
be used to create floating peer configurations; drbd will be told to
use node_ as the hostname instead of the real uname,
which can then be used in drbd.conf.

Override drbd hostname






Number of clones of this drbd resource. Do not fiddle with the default.

Number of clones





Clones per node. Do not fiddle with the default.

Number of nodes





Maximum number of active primaries. Do not fiddle with the default.

Number of primaries





Maximum number of primaries per node. Do not fiddle with the default.

Number of primaries per node
















END

exit $OCF_SUCCESS
}

do_cmd() {
local cmd="$*"
ocf_log debug "$RESOURCE: Calling $cmd"
local cmd_out=$($cmd 2>&1)
ret=$?

if [ $ret -ne 0 ]; then
ocf_log err "$RESOURCE: Called $cmd"
ocf_log err "$RESOURCE: Exit code $ret"
ocf_log err "$RESOURCE: Command output: $cmd_out"
else
ocf_log debug "$RESOURCE: Exit code $ret"
ocf_log debug "$RESOURCE: Command output: $cmd_out"
fi

echo $cmd_out

return $ret
}

do_drbdadm() {
local cmd="$DRBDADM -c $DRBDCONF $*"
ocf_log debug "$RESOURCE: Calling $cmd"
local cmd_out=$($cmd 2>&1)
ret=$?
# Trim the garbage drbdadm likes to print when using the node
# override feature:
local cmd_ret=$(echo $cmd_out | sed -e 's/found __DRBD_NODE__.*

Re: [Linux-HA] Re: testing watchdog

2009-03-04 Thread Priyanka Ranjan
Thanks for reply Michael,
I went through the link you mentioned. it was useful but i could not get
answer of  my question. i want to know which daemon in Heartbeat
writes/updated softdog driver.

Regards,

On Wed, Mar 4, 2009 at 7:22 PM, Michael Schwartzkopff wrote:

> Am Mittwoch, 4. März 2009 14:47:32 schrieb Priyanka Ranjan:
> > Hi All,
> > when we configure watchdog in sbd stonith. sbd daemon keeps monitoring
> the
> > watchdog and if it finds that watchdog is not updated  then it resets the
> > node.  can anyone tell me which daemon update watchdog perodically.
> >
> > Thanks,
>
> See:
> http://www.linux-ha.org/softdog
> and the links therein for a beginning.
>
> --
> Dr. Michael Schwartzkopff
> MultiNET Services GmbH
> Addresse: Bretonischer Ring 7; 85630 Grasbrunn; Germany
> Tel: +49 - 89 - 45 69 11 0
> Fax: +49 - 89 - 45 69 11 21
> mob: +49 - 174 - 343 28 75
>
> mail: mi...@multinet.de
> web: www.multinet.de
>
> Sitz der Gesellschaft: 85630 Grasbrunn
> Registergericht: Amtsgericht München HRB 114375
> Geschäftsführer: Günter Jurgeneit, Hubert Martens
>
> ---
>
> PGP Fingerprint: F919 3919 FF12 ED5A 2801 DEA6 AA77 57A4 EDD8 979B
> Skype: misch42
> ___
> Linux-HA mailing list
> Linux-HA@lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


Re: [Linux-HA] Re: New experimental debian repository

2009-03-04 Thread Michael Schwartzkopff

Simon Horman schrieb:

(...)
I agree that it would be good to have a good repository for
hb2.99/pacemaker on on Debian Stable/Lenny (as opposed to the efforts
to get  hb2.99/pacemaker into Debian experimental and subsequently,
Sid/unstable and Squeeze/testing).

I may be mistaken, but as pacemaker wasn't included in Lenny I think
it will be difficult to get  hb2.99/pacemaker into backports.org,
though if that was possible it seems like it would be ideal.

If that isn't possible, I wonder if the open build service provided
by SuSE would be a good option. I it already has Debian packages,
though I'm not sure if it is able to cope with Lenny yet (as opposed
to Etch which was Debian stable until quite recently).
  

Hi,

My intension just was to have a useable repository for lenny, the actual 
debian distribution. If we could somehow bring it into the official 
repositories, even better.


In my opinion the SuSE build service as it is now is NO option:
- There are no usable packages for over half a year now. The packages 
provided had dependencies not resolvable from the normal distribution.
- There is no package for the i386 architecture, at least not if you add 
the repository to your sources.

- Much (!) slower build cycles compared to the SuSE products.

Sorry, but these reasons could be my personal impression. So when the 
compile the first time ran through (after a lot of bugs) I decided to 
create my own repository so I could use it my own.


@simon: Perhaps you want to host the packages? Of, course I could do 
this also.



Michael.
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


Re: [Linux-HA] Re: New experimental debian repository

2009-03-04 Thread Simon Horman
On Wed, Mar 04, 2009 at 09:13:16AM +0100, Michael Schwartzkopff wrote:
> Am Mittwoch, 4. März 2009 07:59:16 schrieb Thomas Mueller:
> > On Wed, 04 Mar 2009 09:13:31 +1100, Simon Horman wrote:
> > > On Tue, Mar 03, 2009 at 04:24:06PM +0100, Michael Schwartzkopff wrote:
> > >> Am Dienstag, 3. März 2009 15:33:38 schrieb Michael Schwartzkopff:
> > >> > Am Dienstag, 3. März 2009 12:32:35 schrieb Michael Schwartzkopff:
> > >> > > Hi,
> > >> > >
> > >> > > I set up a new experimental repository for the debian distro:
> > >> > >
> > >> > > - It contains heartbeat-2, pacemaker and pacemaker-gui - No OpenAIS
> > >> > > for now. Please mail me if you need it. - I compiled it with debian
> > >> > > lenny, so it will not work with etch. - The files are generated
> > >> > > automatically every night. The repository is updated if the compile
> > >> > > was successful. So you will find always the latest versions here.
> > >> > > - The files are only for i386 arch. Sorry, but I do not have access
> > >> > > to a x64 machine. Donations are welcome ;-)
> > >
> > > Hi Michael,
> > >
> > > which debian distro do these packages target? If it is sid or
> > > experimental, are they based off the packages that I have on
> > > http://packages.vergenet.net/debian/experimental/ ? I know that I have
> > > been slow on that front, but it would be good to combine any efforts in
> > > that area.
> >
> > my vote for "combine any efforts in that area". would be nice to have one
> > good repository for hb2.99/pacemaker on debian. if needed, i can compile
> > on amd64.

I agree that it would be good to have a good repository for
hb2.99/pacemaker on on Debian Stable/Lenny (as opposed to the efforts
to get  hb2.99/pacemaker into Debian experimental and subsequently,
Sid/unstable and Squeeze/testing).

I may be mistaken, but as pacemaker wasn't included in Lenny I think
it will be difficult to get  hb2.99/pacemaker into backports.org,
though if that was possible it seems like it would be ideal.

If that isn't possible, I wonder if the open build service provided
by SuSE would be a good option. I it already has Debian packages,
though I'm not sure if it is able to cope with Lenny yet (as opposed
to Etch which was Debian stable until quite recently).

> Ok. If you compile it I can host the files afterwards. Or perhaps Simon wants 
> to help also.

I am happy to help with compiling, but I'm also happy for Thomas to
help out.

Actually, I'm happy with anything that involves more/better Debian
packages out there.

-- 
Simon Horman
  VA Linux Systems Japan K.K., Sydney, Australia Satellite Office
  H: www.vergenet.net/~horms/ W: www.valinux.co.jp/en

___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


Re: [Linux-HA] Re: New experimental debian repository

2009-03-04 Thread Simon Horman
On Wed, Mar 04, 2009 at 09:24:24AM +0100, Michael Schwartzkopff wrote:
> Am Mittwoch, 4. März 2009 07:59:16 schrieb Thomas Mueller:
> > On Wed, 04 Mar 2009 09:13:31 +1100, Simon Horman wrote:
> > > On Tue, Mar 03, 2009 at 04:24:06PM +0100, Michael Schwartzkopff wrote:
> > >> Am Dienstag, 3. März 2009 15:33:38 schrieb Michael Schwartzkopff:
> > >> > Am Dienstag, 3. März 2009 12:32:35 schrieb Michael Schwartzkopff:
> > >> > > Hi,
> > >> > >
> > >> > > I set up a new experimental repository for the debian distro:
> > >> > >
> > >> > > - It contains heartbeat-2, pacemaker and pacemaker-gui - No OpenAIS
> > >> > > for now. Please mail me if you need it. - I compiled it with debian
> > >> > > lenny, so it will not work with etch. - The files are generated
> > >> > > automatically every night. The repository is updated if the compile
> > >> > > was successful. So you will find always the latest versions here.
> > >> > > - The files are only for i386 arch. Sorry, but I do not have access
> > >> > > to a x64 machine. Donations are welcome ;-)
> > >
> > > Hi Michael,
> > >
> > > which debian distro do these packages target? If it is sid or
> > > experimental, are they based off the packages that I have on
> > > http://packages.vergenet.net/debian/experimental/ ? I know that I have
> > > been slow on that front, but it would be good to combine any efforts in
> > > that area.
> 
> Hi,
> 
> these packets are compiled on lenny. I wrote a script that gets the latest 
> sources (see http://www.clusterlabs.org/wiki/Install#From_Source), patches 
> them to get rid of openais, compiles and builds the packages. 

Thanks for the clarification.

-- 
Simon Horman
  VA Linux Systems Japan K.K., Sydney, Australia Satellite Office
  H: www.vergenet.net/~horms/ W: www.valinux.co.jp/en

___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


Re: [Linux-HA] Order of starting heartbeat processes on solaris (was crm_mon vs cl_status)

2009-03-04 Thread Harakiri

Ok i figured out why crm_mon didnt work and why crmd wasnt initialized 
correctly.

On solaris, the heartbeat subprocesses are not spawned in the same order as on 
other systems - they take some time to spawn - this is the issue.

My first hack was just put in a sleep in the heartbeat.c before crmd is spawned 
- to wait 20sec so that all other processes were already finished initializing 
- this worked at least most of time.

The real issue however was that you would see

crmd[29618]: 2009/03/05_01:34:24 ERROR: socket_client_channel_new: 
open(/var/run/heartbeat/crm/cib_rw, ...) failure: No such file or directory

which means that crmd was still sometimes initialized before cib or ccm.

Remember - on solaris pipes are used, not sockets.

What i simply did, was hack a while loop in lib/clplumbing/ipcsocket.c for 
sockfd = open(path_name, O_RDWR|O_NONBLOCK);
 till sockfd != -1

this worked perfect, i could see the number of times it tried to open the 
socket, and after a while it was created by another subprocess.

Then crm_mon and cibadmin would always work - also stopping heartbeat would now 
too always work under solaris - it would no longer hang - the issue why it hang 
was that crmd couldnt be stopped - when you killed crmd normally - heartbeat 
could shutdown

The final issue is, that the pipes were not sometimes removed from 
/var/run/heartbeat - this would lead that crmd wouldnt always work - a simply 
fix was in the init.d script to rm the run dir pipes before start.


--- On Wed, 3/4/09, Harakiri  wrote:

> From: Harakiri 
> Subject: Re: [Linux-HA] crm_mon vs cl_status
> To: "General Linux-HA mailing list" , "Andrew 
> Beekhof" 
> Date: Wednesday, March 4, 2009, 11:15 AM
> Thanks for answering, 
> 
> 
> --- On Wed, 3/4/09, Andrew Beekhof
>  wrote:
> 
> > 
> > crm_mon takes other things into account.
> > but without logs or the current cib its impossible to
> say
> > for sure why
> > this is happening.
> 
> 
> after a reboot, or restart the following log information
> are found in ha-debug
> 
> http://pastebin.com/m7d9c71f7
> 
> note the only error is :
> 
> mgmtd[5612]: 2009/03/04_16:58:25 ERROR:
> socket_client_channel_new:
> open(/var/lib/heartbeat/run/heartbeat/lrm_cmd_sock, ...)
> failure: No such file or directory
> 
> but it exists - its probably a race condition and created
> later:
> 
> ls -la /var/lib/heartbeat/run/heartbeat/lrm_cmd_sock
> prwxrwxrwx   1 root root   0 Mar  4 16:58
> /var/lib/heartbeat/run/heartbeat/lrm_cmd_sock|
> 
> At this point, cibadmin etc will not work and hang because
> they cant seem to connect to the crmd, crm_mon will indicate
> the note as offline
> 
> After killing crmd the following log information is found:
> 
> http://pastebin.com/m29a3ec9d
> 
> crmd[5644]: 2009/03/04_17:06:29 info: do_cib_control: CIB
> connection established
> 
> etc
> 
> So it seems that on the initial start crmd does not
> correctly initialize, maybe the cib process has to be
> started before crmd?
> 
> Maybe its related to the issue that under solaris sparc
> PIPES are used instead of sockets for communication
> 
> PIPES were introduced because of this patch
> 
> http://www.mail-archive.com/linux-ha-...@lists.linux-ha.org/msg00307.html
> 
> since i have solaris 10 i tried to use streams but i dont
> find the ucred.h anywere for solaris.
> 
> Any ideas? How can i modify the "Starting child
> client" in different order?
> 
> Thanks
> 
> 
>   
> ___
> Linux-HA mailing list
> Linux-HA@lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems


  
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


[Linux-HA] live migrate

2009-03-04 Thread David Pinkerton H

Having an issue with live migrates:

When I migrate a single domU (ie. crm_resource -M -r domU) the source dom0 
calls "migrate_to" and the target dom0 calls "migrate_from" - as expected.
If I execute several migrates at once, the source dom0 calls "migrate_to" 
whereas the target now calls "migrate_from" for the first domU and "start" for 
the remainder...

All domU's have allowed_migrate set to 1

I added the following code to the xen script to dump the calls/variables.

if [ "${DEBUG_MODE}" -eq "1" ]
then
echo "="  >> /tmp/xen.log
echo "`date +%r`"   >> /tmp/xen.log
echo "$*"   >> /tmp/xen.log
env | grep OCF | sort   >> /tmp/xen.log
fi



Is this behaviour correct?


=
03:37:34 PM
migrate_to
OCF_RA_VERSION_MAJOR=1
OCF_RA_VERSION_MINOR=0
OCF_RESKEY_CRM_meta_id=op_lpdhcp01_stop
OCF_RESKEY_CRM_meta_migrate_source=lpxenhost16
OCF_RESKEY_CRM_meta_migrate_target=lpxenhost15
OCF_RESKEY_CRM_meta_name=stop
OCF_RESKEY_CRM_meta_timeout=30
OCF_RESKEY_allow_migrate=1
OCF_RESKEY_crm_feature_set=2.0
OCF_RESKEY_internal_ip=10.10.202.101
OCF_RESKEY_shutdown_timeout=280
OCF_RESKEY_xmfile=/proj/xenconfigs/lpdhcp01
OCF_RESOURCE_INSTANCE=lpdhcp01
OCF_RESOURCE_PROVIDER=cml
OCF_RESOURCE_TYPE=xen
OCF_ROOT=/usr/lib/ocf

=
03:37:34 PM
start
OCF_RA_VERSION_MAJOR=1
OCF_RA_VERSION_MINOR=0
OCF_RESKEY_CRM_meta_id=op_lddhcp01_start
OCF_RESKEY_CRM_meta_name=start
OCF_RESKEY_CRM_meta_timeout=18
OCF_RESKEY_allow_migrate=1
OCF_RESKEY_crm_feature_set=2.0
OCF_RESKEY_internal_ip=10.10.176.75
OCF_RESKEY_shutdown_timeout=280
OCF_RESKEY_xmfile=/proj/xenconfigs/lddhcp01
OCF_RESOURCE_INSTANCE=lddhcp01
OCF_RESOURCE_PROVIDER=cml
OCF_RESOURCE_TYPE=xen
OCF_ROOT=/usr/lib/ocf







David H Pinkerton
Systems Engineer - Linux Team/Platform Services
745 Springvale Road
Mulgrave 3170
*  8544 6827
*  0488 904 232
*  david.h.pinker...@colesgroup.com.au

Unix Team Site: 
http://portal.cmlconnect.org/portal/wps/portal/retail_support/it/teams/unix_systems





This email and any attachments may contain privileged and confidential 
information
and are intended for the named addressee only. If you have received this e-mail 
in
error, please notify the sender and delete this e-mail immediately. Any
confidentiality, privilege or copyright is not waived or lost because this 
e-mail
has been sent to you in error. It is your responsibility to check this e-mail 
and
any attachments for viruses.  No warranty is made that this material is free 
from
computer virus or any other defect or error.  Any loss/damage incurred by using 
this
material is not the sender's responsibility.  The sender's entire liability 
will be
limited to resupplying the material.

___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


RE: [Linux-HA] Having issues with getting DRBD to work with Pacemaker

2009-03-04 Thread Jerome Yanga
Hi Neil!

Yes.  DRBD works outside of Pacemaker.  When I do a "service drbd start" on 
each node, drbd runs properly and are both "Secondary".

jerome

-Original Message-
From: linux-ha-boun...@lists.linux-ha.org 
[mailto:linux-ha-boun...@lists.linux-ha.org] On Behalf Of Neil Katin
Sent: Wednesday, March 04, 2009 4:00 PM
To: General Linux-HA mailing list
Subject: Re: [Linux-HA] Having issues with getting DRBD to work with Pacemaker


Does drbd work outside of pacemaker?  I suspect perhaps not from these lines in 
your log:

Mar  4 14:27:59 nomen modprobe: FATAL: Module drbd not found.
Mar  4 14:27:59 nomen lrmd: [29900]: info: RA output: (drbd0:0:start:stdout) 
Could not stat("/proc/drbd"): No such file or directory do you need to load the 
module? try: modprobe drbd Command 'drbdsetup /dev/drbd0 disk /dev/sda5 
/dev/sda5 internal --set-defaults --create-device --on-io-error=pass_on' 
terminated with exit code 20 drbdadm attach r0: exited with code 20
Mar  4 14:27:59 nomen drbd[30169]: ERROR: r0 start: not in Secondary mode after 
start.

Try starting drbd "by hand" with pacemaker turned off; it should come up on 
both nodes, with
both nodes as "secondary".  If it doesn't they you have to fix drbd first 
before trying to
add pacemaker to the mix.

 Neil

Jerome Yanga wrote:
> Hi!  I am having issues with getting DRBD to work with Pacemaker.  I can get 
> Pacemaker and DRBD run individually but not DRBD managed by Pacemaker.  I 
> tried following the instruction in the site below but the resources will not 
> go online.
> 
> http://clusterlabs.org/wiki/DRBD_HowTo_1.0
> 
> Below is my configuration.
> 
> Installed applications:
> ===
> kernel-2.6.18-128.el5
> drbd-8.3.0-3
> heartbeat-2.99.2-6.1
> pacemaker-1.0.1-3.1
> 
> 
> 
> drbd.conf:
> ==
> global {
> usage-count no;
> }
> 
> resource r0 {
>   protocol C;
>   handlers {
> pri-on-incon-degr "echo o > /proc/sysrq-trigger ; halt -f";
> pri-lost-after-sb "echo o > /proc/sysrq-trigger ; halt -f";
> local-io-error "echo o > /proc/sysrq-trigger ; halt -f";
> outdate-peer "/usr/lib/heartbeat/drbd-peer-outdater -t 5";
> pri-lost "echo pri-lost. Have a look at the log files. | mail -s 'DRBD 
> Alert' root";
> out-of-sync "/usr/lib/drbd/notify-out-of-sync.sh root";
>   }
>   startup {
>  wfc-timeout  0;
>   }
> 
>   disk {
> on-io-error   pass_on;
>   }
>   net {
>  max-buffers 2048;
> after-sb-0pri disconnect;
> after-sb-1pri disconnect;
> after-sb-2pri disconnect;
> rr-conflict disconnect;
>   }
>   syncer {
> rate 100M;
> al-extents 257;
>   }
>   on nomen.esri.com {
> device /dev/drbd0;
> disk   /dev/sda5;
> address192.168.0.1:7789;
> meta-disk  internal;
>   }
>   on rubric.esri.com {
> device/dev/drbd0;
> disk  /dev/sda5;
> address   192.168.0.2:7789;
> meta-disk internal;
>   }
> }
> 
> 
> 
> Cib.xml:
> 
>  have-quorum="1" dc-uuid="a5
> e95310-f27d-418e-9cb9-42e50310f702" epoch="56" num_updates="0" 
> cib-last-written="Wed Mar  4 14:27:59
>  2009">
>   
> 
>   
>  value="1.0.1-node: 6fc5ce830
> 2abf145a02891ec41e5a492efbe8efe"/>
>   
> 
> 
>type="normal"/>
>type="normal"/>
> 
> 
>   
> 
>value="2"/>
>value="true"/>
>name="globally-unique" value="false"
> />
>id="ms-drbd0-meta_attributes-target-role" value="Started"/>
> 
> 
>   
>  name="drbd_resource" value="r0"/>
>   
>   
>  role="Master" timeout="30s"/>
>  role="Slave" timeout="30s"/>
>   
> 
>   
> 
> 
>   
> 
> 
> 
> /var/log/messages:
> ==
> Mar  4 14:27:58 nomen crm_resource: [30167]: info: Invoked: crm_resource 
> --meta -r ms-drbd0 -p target-role -v Started
> Mar  4 14:27:58 nomen cib: [29899]: info: cib_process_xpath: Processing 
> cib_query op for 
> //cib/configuration/resources//*...@id="ms-drbd0"]//meta_attributes//nvpa...@name="target-role"]
>  (/cib/configuration/resources/master/meta_attributes/nvpair[4])
> Mar  4 14:27:59 nomen crmd: [29903]: info: do_lrm_rsc_op: Performing 
> key=5:5:0:d4b86e31-ca4a-4033-8437-6486622eb19f op=drbd0:0_start_0 )
> Mar  4 14:27:59 nomen haclient: on_event:evt:cib_changed
> Mar  4 14:27:59 nomen lrmd: [29900]: info: rsc:drbd0:0: start
> Mar  4 14:27:59 nomen cib: [30168]: info: write_cib_contents: Wrote version 
> 0.56.0 of the CIB to disk (digest: 2365d9802f1b9c55e0ed87b8ebda5db3)
> Mar  4 14:27:59 nomen cib: [30168]: info: retrieveCib: Reading cluster 
> configuration from: /var/lib/heartbeat/crm/cib.xml (digest: 
> /var/lib/heartbeat/crm/cib.xml.sig)
> Mar  4 14:27:59 nomen cib: [29899]: info: Managed write_cib_contents process 
> 30168 exited with return code 0.
> Mar  4 14:27:59 nomen modprobe: FATAL: Module drbd not found.
> Mar  4 14:27:59 nomen lrmd: [29900]: i

Re: [Linux-HA] Having issues with getting DRBD to work with Pacemaker

2009-03-04 Thread Neil Katin


Does drbd work outside of pacemaker?  I suspect perhaps not from these lines in 
your log:

Mar  4 14:27:59 nomen modprobe: FATAL: Module drbd not found.
Mar  4 14:27:59 nomen lrmd: [29900]: info: RA output: (drbd0:0:start:stdout) Could not 
stat("/proc/drbd"): No such file or directory do you need to load the module? 
try: modprobe drbd Command 'drbdsetup /dev/drbd0 disk /dev/sda5 /dev/sda5 internal 
--set-defaults --create-device --on-io-error=pass_on' terminated with exit code 20 
drbdadm attach r0: exited with code 20
Mar  4 14:27:59 nomen drbd[30169]: ERROR: r0 start: not in Secondary mode after 
start.

Try starting drbd "by hand" with pacemaker turned off; it should come up on 
both nodes, with
both nodes as "secondary".  If it doesn't they you have to fix drbd first 
before trying to
add pacemaker to the mix.

Neil

Jerome Yanga wrote:

Hi!  I am having issues with getting DRBD to work with Pacemaker.  I can get 
Pacemaker and DRBD run individually but not DRBD managed by Pacemaker.  I tried 
following the instruction in the site below but the resources will not go 
online.

http://clusterlabs.org/wiki/DRBD_HowTo_1.0

Below is my configuration.

Installed applications:
===
kernel-2.6.18-128.el5
drbd-8.3.0-3
heartbeat-2.99.2-6.1
pacemaker-1.0.1-3.1



drbd.conf:
==
global {
usage-count no;
}

resource r0 {
  protocol C;
  handlers {
pri-on-incon-degr "echo o > /proc/sysrq-trigger ; halt -f";
pri-lost-after-sb "echo o > /proc/sysrq-trigger ; halt -f";
local-io-error "echo o > /proc/sysrq-trigger ; halt -f";
outdate-peer "/usr/lib/heartbeat/drbd-peer-outdater -t 5";
pri-lost "echo pri-lost. Have a look at the log files. | mail -s 'DRBD Alert' 
root";
out-of-sync "/usr/lib/drbd/notify-out-of-sync.sh root";
  }
  startup {
 wfc-timeout  0;
  }

  disk {
on-io-error   pass_on;
  }
  net {
 max-buffers 2048;
after-sb-0pri disconnect;
after-sb-1pri disconnect;
after-sb-2pri disconnect;
rr-conflict disconnect;
  }
  syncer {
rate 100M;
al-extents 257;
  }
  on nomen.esri.com {
device /dev/drbd0;
disk   /dev/sda5;
address192.168.0.1:7789;
meta-disk  internal;
  }
  on rubric.esri.com {
device/dev/drbd0;
disk  /dev/sda5;
address   192.168.0.2:7789;
meta-disk internal;
  }
}



Cib.xml:


  

  

  


  
  


  

  
  
  
  


  

  
  


  

  


  



/var/log/messages:
==
Mar  4 14:27:58 nomen crm_resource: [30167]: info: Invoked: crm_resource --meta 
-r ms-drbd0 -p target-role -v Started
Mar  4 14:27:58 nomen cib: [29899]: info: cib_process_xpath: Processing cib_query op for 
//cib/configuration/resources//*...@id="ms-drbd0"]//meta_attributes//nvpa...@name="target-role"]
 (/cib/configuration/resources/master/meta_attributes/nvpair[4])
Mar  4 14:27:59 nomen crmd: [29903]: info: do_lrm_rsc_op: Performing 
key=5:5:0:d4b86e31-ca4a-4033-8437-6486622eb19f op=drbd0:0_start_0 )
Mar  4 14:27:59 nomen haclient: on_event:evt:cib_changed
Mar  4 14:27:59 nomen lrmd: [29900]: info: rsc:drbd0:0: start
Mar  4 14:27:59 nomen cib: [30168]: info: write_cib_contents: Wrote version 
0.56.0 of the CIB to disk (digest: 2365d9802f1b9c55e0ed87b8ebda5db3)
Mar  4 14:27:59 nomen cib: [30168]: info: retrieveCib: Reading cluster 
configuration from: /var/lib/heartbeat/crm/cib.xml (digest: 
/var/lib/heartbeat/crm/cib.xml.sig)
Mar  4 14:27:59 nomen cib: [29899]: info: Managed write_cib_contents process 
30168 exited with return code 0.
Mar  4 14:27:59 nomen modprobe: FATAL: Module drbd not found.
Mar  4 14:27:59 nomen lrmd: [29900]: info: RA output: (drbd0:0:start:stdout)
Mar  4 14:27:59 nomen mgmtd: [29904]: info: CIB query: cib
Mar  4 14:27:59 nomen lrmd: [29900]: info: RA output: (drbd0:0:start:stdout) Could not 
stat("/proc/drbd"): No such file or directory do you need to load the module? 
try: modprobe drbd Command 'drbdsetup /dev/drbd0 disk /dev/sda5 /dev/sda5 internal 
--set-defaults --create-device --on-io-error=pass_on' terminated with exit code 20 
drbdadm attach r0: exited with code 20
Mar  4 14:27:59 nomen drbd[30169]: ERROR: r0 start: not in Secondary mode after 
start.
Mar  4 14:27:59 nomen lrmd: [29900]: WARN: Managed drbd0:0:start process 30169 
exited with return code 1.
Mar  4 14:27:59 nomen crmd: [29903]: info: process_lrm_event: LRM operation 
drbd0:0_start_0 (call=3, rc=1, cib-update=13, confirmed=true) complete unknown 
error
Mar  4 14:27:59 nomen haclient: on_event: from message queue: evt:cib_changed
Mar  4 14:27:59 nomen mgmtd: [29904]: info: CIB query: cib
Mar  4 14:28:00 nomen crmd: [29903]: info: do_lrm_rsc_op: Performing 
key=41:6:0:d4b86e31-ca4a-4033-8437-6486622eb19f op=drbd0:0_notify_0 )
Mar  4 14:28:00 nomen lrmd: [29900]: info: rsc:drbd0:0: 

[Linux-HA] Having issues with getting DRBD to work with Pacemaker

2009-03-04 Thread Jerome Yanga
Hi!  I am having issues with getting DRBD to work with Pacemaker.  I can get 
Pacemaker and DRBD run individually but not DRBD managed by Pacemaker.  I tried 
following the instruction in the site below but the resources will not go 
online.

http://clusterlabs.org/wiki/DRBD_HowTo_1.0

Below is my configuration.

Installed applications:
===
kernel-2.6.18-128.el5
drbd-8.3.0-3
heartbeat-2.99.2-6.1
pacemaker-1.0.1-3.1



drbd.conf:
==
global {
usage-count no;
}

resource r0 {
  protocol C;
  handlers {
pri-on-incon-degr "echo o > /proc/sysrq-trigger ; halt -f";
pri-lost-after-sb "echo o > /proc/sysrq-trigger ; halt -f";
local-io-error "echo o > /proc/sysrq-trigger ; halt -f";
outdate-peer "/usr/lib/heartbeat/drbd-peer-outdater -t 5";
pri-lost "echo pri-lost. Have a look at the log files. | mail -s 'DRBD 
Alert' root";
out-of-sync "/usr/lib/drbd/notify-out-of-sync.sh root";
  }
  startup {
 wfc-timeout  0;
  }

  disk {
on-io-error   pass_on;
  }
  net {
 max-buffers 2048;
after-sb-0pri disconnect;
after-sb-1pri disconnect;
after-sb-2pri disconnect;
rr-conflict disconnect;
  }
  syncer {
rate 100M;
al-extents 257;
  }
  on nomen.esri.com {
device /dev/drbd0;
disk   /dev/sda5;
address192.168.0.1:7789;
meta-disk  internal;
  }
  on rubric.esri.com {
device/dev/drbd0;
disk  /dev/sda5;
address   192.168.0.2:7789;
meta-disk internal;
  }
}



Cib.xml:


  

  

  


  
  


  

  
  
  
  


  

  
  


  

  


  



/var/log/messages:
==
Mar  4 14:27:58 nomen crm_resource: [30167]: info: Invoked: crm_resource --meta 
-r ms-drbd0 -p target-role -v Started
Mar  4 14:27:58 nomen cib: [29899]: info: cib_process_xpath: Processing 
cib_query op for 
//cib/configuration/resources//*...@id="ms-drbd0"]//meta_attributes//nvpa...@name="target-role"]
 (/cib/configuration/resources/master/meta_attributes/nvpair[4])
Mar  4 14:27:59 nomen crmd: [29903]: info: do_lrm_rsc_op: Performing 
key=5:5:0:d4b86e31-ca4a-4033-8437-6486622eb19f op=drbd0:0_start_0 )
Mar  4 14:27:59 nomen haclient: on_event:evt:cib_changed
Mar  4 14:27:59 nomen lrmd: [29900]: info: rsc:drbd0:0: start
Mar  4 14:27:59 nomen cib: [30168]: info: write_cib_contents: Wrote version 
0.56.0 of the CIB to disk (digest: 2365d9802f1b9c55e0ed87b8ebda5db3)
Mar  4 14:27:59 nomen cib: [30168]: info: retrieveCib: Reading cluster 
configuration from: /var/lib/heartbeat/crm/cib.xml (digest: 
/var/lib/heartbeat/crm/cib.xml.sig)
Mar  4 14:27:59 nomen cib: [29899]: info: Managed write_cib_contents process 
30168 exited with return code 0.
Mar  4 14:27:59 nomen modprobe: FATAL: Module drbd not found.
Mar  4 14:27:59 nomen lrmd: [29900]: info: RA output: (drbd0:0:start:stdout)
Mar  4 14:27:59 nomen mgmtd: [29904]: info: CIB query: cib
Mar  4 14:27:59 nomen lrmd: [29900]: info: RA output: (drbd0:0:start:stdout) 
Could not stat("/proc/drbd"): No such file or directory do you need to load the 
module? try: modprobe drbd Command 'drbdsetup /dev/drbd0 disk /dev/sda5 
/dev/sda5 internal --set-defaults --create-device --on-io-error=pass_on' 
terminated with exit code 20 drbdadm attach r0: exited with code 20
Mar  4 14:27:59 nomen drbd[30169]: ERROR: r0 start: not in Secondary mode after 
start.
Mar  4 14:27:59 nomen lrmd: [29900]: WARN: Managed drbd0:0:start process 30169 
exited with return code 1.
Mar  4 14:27:59 nomen crmd: [29903]: info: process_lrm_event: LRM operation 
drbd0:0_start_0 (call=3, rc=1, cib-update=13, confirmed=true) complete unknown 
error
Mar  4 14:27:59 nomen haclient: on_event: from message queue: evt:cib_changed
Mar  4 14:27:59 nomen mgmtd: [29904]: info: CIB query: cib
Mar  4 14:28:00 nomen crmd: [29903]: info: do_lrm_rsc_op: Performing 
key=41:6:0:d4b86e31-ca4a-4033-8437-6486622eb19f op=drbd0:0_notify_0 )
Mar  4 14:28:00 nomen lrmd: [29900]: info: rsc:drbd0:0: notify
Mar  4 14:28:00 nomen lrmd: [29900]: info: Managed drbd0:0:notify process 30310 
exited with return code 0.
Mar  4 14:28:00 nomen crmd: [29903]: info: process_lrm_event: LRM operation 
drbd0:0_notify_0 (call=4, rc=0, cib-update=14, confirmed=true) complete ok
Mar  4 14:28:00 nomen haclient: on_event: from message queue: evt:cib_changed
Mar  4 14:28:00 nomen haclient: on_event: from message queue: evt:cib_changed
Mar  4 14:28:00 nomen mgmtd: [29904]: info: CIB query: cib
Mar  4 14:28:01 nomen crmd: [29903]: info: do_lrm_rsc_op: Performing 
key=2:6:0:d4b86e31-ca4a-4033-8437-6486622eb19f op=drbd0:0_stop_0 )
Mar  4 14:28:01 nomen lrmd: [29900]: info: rsc:drbd0:0: stop
Mar  4 14:28:01 nomen lrmd: [29900]: info: Managed drbd0:0:stop process 30324 
exited with return code 0.
Mar  4 14:28:01 nomen crmd: [29903]: info: process_lrm_event: LRM operation 

Re: [Linux-HA] crm_mon vs cl_status

2009-03-04 Thread Harakiri

Thanks for answering, 


--- On Wed, 3/4/09, Andrew Beekhof  wrote:

> 
> crm_mon takes other things into account.
> but without logs or the current cib its impossible to say
> for sure why
> this is happening.


after a reboot, or restart the following log information are found in ha-debug

http://pastebin.com/m7d9c71f7

note the only error is :

mgmtd[5612]: 2009/03/04_16:58:25 ERROR: socket_client_channel_new: 
open(/var/lib/heartbeat/run/heartbeat/lrm_cmd_sock, ...) failure: No such file 
or directory

but it exists - its probably a race condition and created later:

ls -la /var/lib/heartbeat/run/heartbeat/lrm_cmd_sock
prwxrwxrwx   1 root root   0 Mar  4 16:58 
/var/lib/heartbeat/run/heartbeat/lrm_cmd_sock|

At this point, cibadmin etc will not work and hang because they cant seem to 
connect to the crmd, crm_mon will indicate the note as offline

After killing crmd the following log information is found:

http://pastebin.com/m29a3ec9d

crmd[5644]: 2009/03/04_17:06:29 info: do_cib_control: CIB connection established

etc

So it seems that on the initial start crmd does not correctly initialize, maybe 
the cib process has to be started before crmd?

Maybe its related to the issue that under solaris sparc PIPES are used instead 
of sockets for communication

PIPES were introduced because of this patch

http://www.mail-archive.com/linux-ha-...@lists.linux-ha.org/msg00307.html

since i have solaris 10 i tried to use streams but i dont find the ucred.h 
anywere for solaris.

Any ideas? How can i modify the "Starting child client" in different order?

Thanks


  
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


Re: [Linux-HA] Re: testing watchdog

2009-03-04 Thread Michael Schwartzkopff
Am Mittwoch, 4. März 2009 14:47:32 schrieb Priyanka Ranjan:
> Hi All,
> when we configure watchdog in sbd stonith. sbd daemon keeps monitoring the
> watchdog and if it finds that watchdog is not updated  then it resets the
> node.  can anyone tell me which daemon update watchdog perodically.
>
> Thanks,

See:
http://www.linux-ha.org/softdog
and the links therein for a beginning.

-- 
Dr. Michael Schwartzkopff
MultiNET Services GmbH
Addresse: Bretonischer Ring 7; 85630 Grasbrunn; Germany
Tel: +49 - 89 - 45 69 11 0
Fax: +49 - 89 - 45 69 11 21
mob: +49 - 174 - 343 28 75

mail: mi...@multinet.de
web: www.multinet.de

Sitz der Gesellschaft: 85630 Grasbrunn
Registergericht: Amtsgericht München HRB 114375
Geschäftsführer: Günter Jurgeneit, Hubert Martens

---

PGP Fingerprint: F919 3919 FF12 ED5A 2801 DEA6 AA77 57A4 EDD8 979B
Skype: misch42
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


[Linux-HA] Re: testing watchdog

2009-03-04 Thread Priyanka Ranjan
Hi All,
when we configure watchdog in sbd stonith. sbd daemon keeps monitoring the
watchdog and if it finds that watchdog is not updated  then it resets the
node.  can anyone tell me which daemon update watchdog perodically.

Thanks,

On Tue, Mar 3, 2009 at 5:32 PM, Priyanka Ranjan wrote:

> Hi All,
> i have configured watchdog timer in sbd stonith.  i want to test , whether
> watchdog is resetting  the node or not.  the defualt value of watchdog timer
> (timeout)  is 5 sec , and msgwait (Timeout )  is 10 sec. i thought if i
> could reduce the msgwait to 4 sec , it would result in node reset as
> watchdog will be not get updated in 4 sec. i tried to reduce the msgwait
> timeout with -4 option but it did not went through , i searched and found
> that it is bug in heartbeat-2.99.3-3.2 .   due to some reason i cant upgrade
> heartbeat to latest version now.
>
> is there any other way to test whether watchdog is working fine or not.
>
> Thanks &  Best Regards,
> Priyanka.
>
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


Re: [Linux-HA] Re: New experimental debian repository

2009-03-04 Thread Michael Schwartzkopff
Am Mittwoch, 4. März 2009 10:30:56 schrieb Andrew Beekhof:
> On Wed, Mar 4, 2009 at 09:24, Michael Schwartzkopff  
wrote:
> > Hi,
> >
> > these packets are compiled on lenny. I wrote a script that gets the
> > latest sources (see http://www.clusterlabs.org/wiki/Install#From_Source),
> > patches them to get rid of openais, compiles and builds the packages.
> >
> > Sorry for missing OpenAIS support, but I am quite familiar with heartbeat
> > and did not have the time to check out openais. I know that I have to
> > move on somewhen ...
>
> Btw. The latest stable OpenAIS code from upstream SVN (Whitetank
> branch) has all the bits needed to support Pacemaker.
> So there's no longer any need to be using the "modified" versions from
> clusterlabs.org- just go straight to the upstream project.


Good news! I have to get used to OpenAIS ...

-- 
Dr. Michael Schwartzkopff
MultiNET Services GmbH
Addresse: Bretonischer Ring 7; 85630 Grasbrunn; Germany
Tel: +49 - 89 - 45 69 11 0
Fax: +49 - 89 - 45 69 11 21
mob: +49 - 174 - 343 28 75

mail: mi...@multinet.de
web: www.multinet.de

Sitz der Gesellschaft: 85630 Grasbrunn
Registergericht: Amtsgericht München HRB 114375
Geschäftsführer: Günter Jurgeneit, Hubert Martens

---

PGP Fingerprint: F919 3919 FF12 ED5A 2801 DEA6 AA77 57A4 EDD8 979B
Skype: misch42
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


RE: [Linux-HA] HA fails when stopping master director

2009-03-04 Thread Alejandro Sánchez Meroño
Hello again, 

I would like to enclose these log lines from /var/log/messages on director1 and 
director2 for if it might give a clue: 

*** From /var/log/messages on director1 ***

After typing: "/etc/init.d/heartbeat start" on director1:

Mar  4 10:15:32 director1 ldirectord[22403]: Invoking ldirectord invoked as: 
/etc/ha.d/resource.d/ldirectord ldirectord.cf status
Mar  4 10:15:32 director1 ldirectord[22403]: Exiting with exit_status 3: 
Exiting from ldirectord status
Mar  4 10:15:33 director1 ldirectord[22440]: Invoking ldirectord invoked as: 
/etc/ha.d/resource.d/ldirectord ldirectord.cf status
Mar  4 10:15:33 director1 ldirectord[22440]: Exiting with exit_status 3: 
Exiting from ldirectord status
Mar  4 10:15:34 director1 ldirectord[22456]: Invoking ldirectord invoked as: 
/etc/ha.d/resource.d/ldirectord ldirectord.cf start
Mar  4 10:15:34 director1 ldirectord[22456]: Starting Linux Director 
v1.186-ha-2.1.3 as daemon
Mar  4 10:15:34 director1 ldirectord[22458]: Added virtual server: 
172.25.146.31:80
Mar  4 10:15:34 director1 kernel: [144096.978975] IPVS: stopping backup sync 
thread 22208 ...
Mar  4 10:15:34 director1 kernel: [144097.227697] IPVS: sync thread started: 
state = MASTER, mcast_ifn = eth0, syncid = 0
Mar  4 10:15:35 director1 ldirectord[22458]: Added fallback server: 
127.0.0.1:80 (172.25.146.31:80) (Weight set to 1)
Mar  4 10:15:35 director1 ldirectord[22458]: Quiescent real server: 
172.25.146.38:80 (172.25.146.31:80) (Weight set to 0)
Mar  4 10:15:35 director1 ldirectord[22458]: Quiescent real server: 
172.25.146.37:80 (172.25.146.31:80) (Weight set to 0)
Mar  4 10:15:36 director1 ldirectord[22458]: Restored real server: 
172.25.146.37:80 (172.25.146.31:80) (Weight set to 1)
Mar  4 10:15:36 director1 ldirectord[22458]: Deleted fallback server: 
127.0.0.1:80 (172.25.146.31:80)
Mar  4 10:15:36 director1 ldirectord[22458]: Restored real server: 
172.25.146.38:80 (172.25.146.31:80) (Weight set to 1)

After typing: "/etc/init.d/heartbeat start" on director2

Mar  4 10:20:16 director1 ldirectord[22859]: Invoking ldirectord invoked as: 
/etc/ha.d/resource.d/ldirectord ldirectord.cf status
Mar  4 10:20:17 director1 ldirectord[22859]: ldirectord for 
/etc/ha.d/ldirectord.cf is running with pid: 22458
Mar  4 10:20:17 director1 ldirectord[22859]: Exiting from ldirectord status
Mar  4 10:20:17 director1 ldirectord[22875]: Invoking ldirectord invoked as: 
/etc/ha.d/resource.d/ldirectord ldirectord.cf start

After typing: "/etc/init.d/heartbeat stop" on director1

Mar  4 10:26:12 director1 kernel: [144734.478909] IPVS: stopping master sync 
thread 22530 ...
Mar  4 10:26:12 director1 kernel: [144734.693492] IPVS: sync thread started: 
state = BACKUP, mcast_ifn = eth0, syncid = 0
Mar  4 10:26:12 director1 ldirectord[23144]: Invoking ldirectord invoked as: 
/etc/ha.d/resource.d/ldirectord ldirectord.cf stop
Mar  4 10:26:13 director1 ldirectord[22458]: Purged real server (stop): 
172.25.146.37:80 (172.25.146.31:80)
Mar  4 10:26:13 director1 ldirectord[22458]: Purged real server (stop): 
172.25.146.38:80 (172.25.146.31:80)
Mar  4 10:26:13 director1 ldirectord[22458]: Purged virtual server (stop): 
172.25.146.31:80
Mar  4 10:26:13 director1 ldirectord[22458]: Linux Director Daemon terminated 
on signal: TERM


*** From /var/log/messages on director1 ***

After typing: "/etc/init.d/heartbeat start" on director2

Mar  4 10:18:43 director2 ldirectord[23274]: Invoking ldirectord invoked as: 
/etc/ha.d/resource.d/ldirectord ldirectord.cf status
Mar  4 10:18:43 director2 ldirectord[23274]: Exiting with exit_status 3: 
Exiting from ldirectord status
Mar  4 10:19:09 director2 ldirectord[23905]: Invoking ldirectord invoked as: 
/etc/ha.d/resource.d/ldirectord ldirectord.cf stop

After typing: "/etc/init.d/heartbeat stop" on director1

Mar  4 10:25:07 director2 ldirectord[23996]: Invoking ldirectord invoked as: 
/etc/ha.d/resource.d/ldirectord ldirectord.cf status
Mar  4 10:25:07 director2 ldirectord[23996]: Exiting with exit_status 3: 
Exiting from ldirectord status
Mar  4 10:25:08 director2 ldirectord[24012]: Invoking ldirectord invoked as: 
/etc/ha.d/resource.d/ldirectord ldirectord.cf start
Mar  4 10:25:08 director2 ldirectord[24012]: Starting Linux Director 
v1.186-ha-2.1.3 as daemon
Mar  4 10:25:08 director2 ldirectord[24014]: Added virtual server: 
172.25.146.31:80
Mar  4 10:25:08 director2 ldirectord[24014]: Added fallback server: 
127.0.0.1:80 (172.25.146.31:80) (Weight set to 1)
Mar  4 10:25:09 director2 ldirectord[24014]: Quiescent real server: 
172.25.146.38:80 (172.25.146.31:80) (Weight set to 0)
Mar  4 10:25:09 director2 ldirectord[24014]: Quiescent real server: 
172.25.146.37:80 (172.25.146.31:80) (Weight set to 0)
Mar  4 10:25:09 director2 ldirectord[24014]: Restored real server: 
172.25.146.37:80 (172.25.146.31:80) (Weight set to 1)
Mar  4 10:25:09 director2 ldirectord[24014]: Deleted fallback server: 
127.0.0.1:80 (172.25.146.31:80)
Mar  4 10:25:10 director2 ldirectord[24014]: Restored real se

Re: [Linux-HA] Re: New experimental debian repository

2009-03-04 Thread Andrew Beekhof
On Wed, Mar 4, 2009 at 09:24, Michael Schwartzkopff  wrote:
> Hi,
>
> these packets are compiled on lenny. I wrote a script that gets the latest
> sources (see http://www.clusterlabs.org/wiki/Install#From_Source), patches
> them to get rid of openais, compiles and builds the packages.
>
> Sorry for missing OpenAIS support, but I am quite familiar with heartbeat and
> did not have the time to check out openais. I know that I have to move on
> somewhen ...

Btw. The latest stable OpenAIS code from upstream SVN (Whitetank
branch) has all the bits needed to support Pacemaker.
So there's no longer any need to be using the "modified" versions from
clusterlabs.org- just go straight to the upstream project.
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


Re: [Linux-HA] crm_mon vs cl_status

2009-03-04 Thread Andrew Beekhof
On Wed, Mar 4, 2009 at 01:18, Harakiri  wrote:
>
> Hi,
>
> i got 2.1.4 to work on sparc solaris 10, the only issue left is that crm_mon 
> reports wrong node status (node as offline). Whereas cl_status works more 
> reliably in indicating that the local node is online.

crm_mon takes other things into account.
but without logs or the current cib its impossible to say for sure why
this is happening.

>
> To get the crm_mon status uptodate i can kill the crmd, then hb automatically 
> starts crmd again and the crm_mon status is the correct one.
>
> However, when i use cl_status while heartbeat is offline - no error and no 
> message is shown - on debian for example, an error is shown like  ERROR: 
> Cannot signon with heartbeat.
>
> So while cl_status seems to reliably show the node as online, the tool does 
> not work as expected when hb is offline.
>
> Whats the difference between the two now, when should you use cl_status and 
> when crm_mon --one-shot ?
>
>
>
> ___
> Linux-HA mailing list
> Linux-HA@lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


Re: [Linux-HA] New experimental debian repository

2009-03-04 Thread Michael Schwartzkopff
Am Dienstag, 3. März 2009 12:32:35 schrieb Michael Schwartzkopff:
> Hi,
>
> I set up a new experimental repository for the debian distro:
>
> - It contains heartbeat-2, pacemaker and pacemaker-gui
> - No OpenAIS for now. Please mail me if you need it.
> - I compiled it with debian lenny, so it will not work with etch.
> - The files are generated automatically every night. The repository is
> updated if the compile was successful. So you will find always the latest
> versions here.
> - The files are only for i386 arch. Sorry, but I do not have access to a
> x64 machine. Donations are welcome ;-)
>
> You can include it directly in your /etc/apt/sources with:
> deb   http://www.multinet.de/ experimental main
>
> Of corse you also download the files ans install them with dpkg -i
>
> Please mail me about any errors. The compile went OK, but you never know.
>
> Have fun!

Hi,

I see some access to the wrong directories in my apache log. You find the files 
under:

http://www.multinet.de/debian/dists/

for the latest experimental files see:
http://www.multinet.de/debian/dists/experimental/main/binary-i386/
-- 
Dr. Michael Schwartzkopff
MultiNET Services GmbH
Addresse: Bretonischer Ring 7; 85630 Grasbrunn; Germany
Tel: +49 - 89 - 45 69 11 0
Fax: +49 - 89 - 45 69 11 21
mob: +49 - 174 - 343 28 75

mail: mi...@multinet.de
web: www.multinet.de

Sitz der Gesellschaft: 85630 Grasbrunn
Registergericht: Amtsgericht München HRB 114375
Geschäftsführer: Günter Jurgeneit, Hubert Martens

---

PGP Fingerprint: F919 3919 FF12 ED5A 2801 DEA6 AA77 57A4 EDD8 979B
Skype: misch42
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


Re: [Linux-HA] Re: New experimental debian repository

2009-03-04 Thread Michael Schwartzkopff
Am Mittwoch, 4. März 2009 07:59:16 schrieb Thomas Mueller:
> On Wed, 04 Mar 2009 09:13:31 +1100, Simon Horman wrote:
> > On Tue, Mar 03, 2009 at 04:24:06PM +0100, Michael Schwartzkopff wrote:
> >> Am Dienstag, 3. März 2009 15:33:38 schrieb Michael Schwartzkopff:
> >> > Am Dienstag, 3. März 2009 12:32:35 schrieb Michael Schwartzkopff:
> >> > > Hi,
> >> > >
> >> > > I set up a new experimental repository for the debian distro:
> >> > >
> >> > > - It contains heartbeat-2, pacemaker and pacemaker-gui - No OpenAIS
> >> > > for now. Please mail me if you need it. - I compiled it with debian
> >> > > lenny, so it will not work with etch. - The files are generated
> >> > > automatically every night. The repository is updated if the compile
> >> > > was successful. So you will find always the latest versions here.
> >> > > - The files are only for i386 arch. Sorry, but I do not have access
> >> > > to a x64 machine. Donations are welcome ;-)
> >
> > Hi Michael,
> >
> > which debian distro do these packages target? If it is sid or
> > experimental, are they based off the packages that I have on
> > http://packages.vergenet.net/debian/experimental/ ? I know that I have
> > been slow on that front, but it would be good to combine any efforts in
> > that area.

Hi,

these packets are compiled on lenny. I wrote a script that gets the latest 
sources (see http://www.clusterlabs.org/wiki/Install#From_Source), patches 
them to get rid of openais, compiles and builds the packages. 

Sorry for missing OpenAIS support, but I am quite familiar with heartbeat and 
did not have the time to check out openais. I know that I have to move on 
somewhen ...

If you want I can send you my scripts and patches.

Greetings,

-- 
Dr. Michael Schwartzkopff
MultiNET Services GmbH
Addresse: Bretonischer Ring 7; 85630 Grasbrunn; Germany
Tel: +49 - 89 - 45 69 11 0
Fax: +49 - 89 - 45 69 11 21
mob: +49 - 174 - 343 28 75

mail: mi...@multinet.de
web: www.multinet.de

Sitz der Gesellschaft: 85630 Grasbrunn
Registergericht: Amtsgericht München HRB 114375
Geschäftsführer: Günter Jurgeneit, Hubert Martens

---

PGP Fingerprint: F919 3919 FF12 ED5A 2801 DEA6 AA77 57A4 EDD8 979B
Skype: misch42
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


Re: [Linux-HA] Re: New experimental debian repository

2009-03-04 Thread Michael Schwartzkopff
Am Mittwoch, 4. März 2009 07:59:16 schrieb Thomas Mueller:
> On Wed, 04 Mar 2009 09:13:31 +1100, Simon Horman wrote:
> > On Tue, Mar 03, 2009 at 04:24:06PM +0100, Michael Schwartzkopff wrote:
> >> Am Dienstag, 3. März 2009 15:33:38 schrieb Michael Schwartzkopff:
> >> > Am Dienstag, 3. März 2009 12:32:35 schrieb Michael Schwartzkopff:
> >> > > Hi,
> >> > >
> >> > > I set up a new experimental repository for the debian distro:
> >> > >
> >> > > - It contains heartbeat-2, pacemaker and pacemaker-gui - No OpenAIS
> >> > > for now. Please mail me if you need it. - I compiled it with debian
> >> > > lenny, so it will not work with etch. - The files are generated
> >> > > automatically every night. The repository is updated if the compile
> >> > > was successful. So you will find always the latest versions here.
> >> > > - The files are only for i386 arch. Sorry, but I do not have access
> >> > > to a x64 machine. Donations are welcome ;-)
> >
> > Hi Michael,
> >
> > which debian distro do these packages target? If it is sid or
> > experimental, are they based off the packages that I have on
> > http://packages.vergenet.net/debian/experimental/ ? I know that I have
> > been slow on that front, but it would be good to combine any efforts in
> > that area.
>
> my vote for "combine any efforts in that area". would be nice to have one
> good repository for hb2.99/pacemaker on debian. if needed, i can compile
> on amd64.

Ok. If you compile it I can host the files afterwards. Or perhaps Simon wants 
to help also.

Michael
-- 
Dr. Michael Schwartzkopff
MultiNET Services GmbH
Addresse: Bretonischer Ring 7; 85630 Grasbrunn; Germany
Tel: +49 - 89 - 45 69 11 0
Fax: +49 - 89 - 45 69 11 21
mob: +49 - 174 - 343 28 75

mail: mi...@multinet.de
web: www.multinet.de

Sitz der Gesellschaft: 85630 Grasbrunn
Registergericht: Amtsgericht München HRB 114375
Geschäftsführer: Günter Jurgeneit, Hubert Martens

---

PGP Fingerprint: F919 3919 FF12 ED5A 2801 DEA6 AA77 57A4 EDD8 979B
Skype: misch42
___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


[Linux-HA] Re: New experimental debian repository

2009-03-04 Thread Thomas Mueller
On Wed, 04 Mar 2009 09:13:31 +1100, Simon Horman wrote:

> On Tue, Mar 03, 2009 at 04:24:06PM +0100, Michael Schwartzkopff wrote:
>> Am Dienstag, 3. März 2009 15:33:38 schrieb Michael Schwartzkopff:
>> > Am Dienstag, 3. März 2009 12:32:35 schrieb Michael Schwartzkopff:
>> > > Hi,
>> > >
>> > > I set up a new experimental repository for the debian distro:
>> > >
>> > > - It contains heartbeat-2, pacemaker and pacemaker-gui - No OpenAIS
>> > > for now. Please mail me if you need it. - I compiled it with debian
>> > > lenny, so it will not work with etch. - The files are generated
>> > > automatically every night. The repository is updated if the compile
>> > > was successful. So you will find always the latest versions here.
>> > > - The files are only for i386 arch. Sorry, but I do not have access
>> > > to a x64 machine. Donations are welcome ;-)


> 
> Hi Michael,
> 
> which debian distro do these packages target? If it is sid or
> experimental, are they based off the packages that I have on
> http://packages.vergenet.net/debian/experimental/ ? I know that I have
> been slow on that front, but it would be good to combine any efforts in
> that area.

my vote for "combine any efforts in that area". would be nice to have one 
good repository for hb2.99/pacemaker on debian. if needed, i can compile 
on amd64.

- Thomas

___
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems