Re: [Pacemaker] communication channels howto

2010-08-26 Thread Dan Frincu
lxnf9...@comcast.net wrote: On Thu, 26 Aug 2010, lxnf9...@comcast.net wrote: On Thu, 26 Aug 2010, Dan Frincu wrote: In OpenAIS for example, in /etc/ais/openais.conf you have a directive called interface. In this directive you specify a ringnumber, bindnetaddr, mcastaddr and mcastport. C

Re: [Pacemaker] clmvd hangs on node1 if node2 is fenced

2010-08-26 Thread Michael Smith
On Thu, 26 Aug 2010, Tim Serong wrote: > > for now I have stonith-enabled="false" in > > my CIB. Is there a way to make clvmd/dlm respect it? > > No. At least, I don't think so, and/or I hope not :) I think I'd consider it a bug: I've disabled stonith, so dlm shouldn't wait forever for a fe

Re: [Pacemaker] clmvd hangs on node1 if node2 is fenced

2010-08-26 Thread Tim Serong
On 8/27/2010 at 01:49 PM, Michael Smith wrote: > On Thu, 26 Aug 2010, Tim Serong wrote: > > > > Aug 26 18:31:51 xen-test1 cluster-dlm[8870]: fence_node_time: Node > > > 236655788/xen-test2 has not been shot yet > > > Do you have STONITH configured? Note that it says "xen-test2 has not

[Pacemaker] Resource stop during migration

2010-08-26 Thread Michael Smith
Hi, I have a pacemaker setup using the Xen resource agent and I've found something weird during migration: if a VM is in the middle of live-migrating from node 1 to node 2, and I stop the resource in crm, pacemaker forgets about the migration and immediately thinks the resource is stopped, alt

Re: [Pacemaker] clmvd hangs on node1 if node2 is fenced

2010-08-26 Thread Michael Smith
On Thu, 26 Aug 2010, Tim Serong wrote: > > Aug 26 18:31:51 xen-test1 cluster-dlm[8870]: fence_node_time: Node > > 236655788/xen-test2 has not been shot yet > Do you have STONITH configured? Note that it says "xen-test2 has not > been shot yet" and "clvmd ... not fenced". It's just going to si

Re: [Pacemaker] clmvd hangs on node1 if node2 is fenced

2010-08-26 Thread Tim Serong
On 8/27/2010 at 08:50 AM, Michael Smith wrote: >> Xinwei Hu writes: > > > > > That sounds worrying actually. > > > I think this is logged as bug 585419 on SLES' bugzilla. > > > If you can reproduce this issue, it worths to reopen it I think. > > I've got a pair of fully patched SLES11

Re: [Pacemaker] About specifications of on-fail="block".

2010-08-26 Thread renayama19661014
Hi Andrew, I registered this problem on Bugzilla. * http://developerbugs.linux-foundation.org/show_bug.cgi?id=2476 Best Regards, Hideo Yamauchi. --- renayama19661...@ybb.ne.jp wrote: > Hi, > > I compared movement in a version of pacemaker about this problem. > > * 1.0.9-74392a28b7f31d7dd

Re: [Pacemaker] A demand for the expected votes indication and a question.

2010-08-26 Thread renayama19661014
Hi Andrew, > crm_mon shouldn't really display expected votes for heartbeat > clusters... they're not used in any way when heartbeat is in use. > expected votes is only relevant for ver: 0 of the pacemaker/corosync plugin. > in the future pacemaker will obtain quorum information directly from > cor

Re: [Pacemaker] [PATCH]The changing of the log level of pengine process.

2010-08-26 Thread renayama19661014
Hi Andrew, Thank you for comment. > Why not simply remove the if(was_processing_error) block? > Its just a summary message, the place that set was_processing_error > will also have logged an error. Is this meaning to abolish the next code? - if(was_processing_error) { -

Re: [Pacemaker] Best way to find master node

2010-08-26 Thread Bob Schatz
Thanks - filed as 2477 Thanks, Bob - Original Message From: Andrew Beekhof To: The Pacemaker cluster resource manager Sent: Wed, August 25, 2010 11:22:24 PM Subject: Re: [Pacemaker] Best way to find master node On Wed, Aug 25, 2010 at 6:39 PM, Bob Schatz wrote: > Yes it does. Ok.

Re: [Pacemaker] clmvd hangs on node1 if node2 is fenced

2010-08-26 Thread Michael Smith
> Xinwei Hu writes: > > > That sounds worrying actually. > > I think this is logged as bug 585419 on SLES' bugzilla. > > If you can reproduce this issue, it worths to reopen it I think. I've got a pair of fully patched SLES11 SP1 nodes and they're showing what I guess is the same behaviour: if

Re: [Pacemaker] clmvd hangs on node1 if node2 is fenced

2010-08-26 Thread Michael Smith
Xinwei Hu writes: > 2010/8/16 Rainer Lutz : > > Xinwei Hu writes: > >> This sounds a like a fixed issue for SLE11SP1 indeed. > > Well it is not fixed with SP1, but with some Patch after SP1 - don`t know > > which thou, as the clvmd is the same for SP1 before and after Online > > Patches. > T

[Pacemaker] resource locations for cloned resources (asymmetric cluster)

2010-08-26 Thread Bernd Schubert
Hi all, I'm trying to start a pingd clone resource on an asymmetric cluster. I specified locations, but it still refuses to start pingd === [r...@vrhel5-mds1 ha.d]# cat pingd.cib primitive pingdnet1 ocf:pacemaker:pingd \ params

[Pacemaker] resource locations for cloned resources (asymmetric cluster)

2010-08-26 Thread Bernd Schubert
Hi all, I'm trying to start a pingd clone resource on an asymmetric cluster. I specified locations, but it still refuses to start pingd === [r...@vrhel5-mds1 ha.d]# cat pingd.cib primitive pingdnet1 ocf:pacemaker:pingd \ params h

Re: [Pacemaker] communication channels howto

2010-08-26 Thread lxnf98mm
On Thu, 26 Aug 2010, lxnf9...@comcast.net wrote: On Thu, 26 Aug 2010, Dan Frincu wrote: In OpenAIS for example, in /etc/ais/openais.conf you have a directive called interface. In this directive you specify a ringnumber, bindnetaddr, mcastaddr and mcastport. Configuring 2 communication chann

Re: [Pacemaker] how to keep ftp connection when swap from primary to secondary

2010-08-26 Thread Liang.Ma
Hi, I installed ipvsadm and ran ipvsadm --start-daemon=master --mcast-interface=eth0 in master node and ipvsadm --start-daemon=backup --mcast-interface=eth0 in backup node. But still i lost ftp connection during node swap. Liang Ma Contractuel | Consultant | SED Systems Inc. Ground Systems

Re: [Pacemaker] how to keep ftp connection when swap from primary to secondary

2010-08-26 Thread Liang.Ma
Thank you Raoul. I will try it. Liang Ma Contractuel | Consultant | SED Systems Inc. Ground Systems Analyst Agence spatiale canadienne | Canadian Space Agency 6767, Route de l'Aéroport, Longueuil (St-Hubert), QC, Canada, J3Y 8Y9 Tél/Tel : (450) 926-5099 | Téléc/Fax: (450) 926-5083 Courriel/E-mail

Re: [Pacemaker] communication channels howto

2010-08-26 Thread lxnf98mm
On Thu, 26 Aug 2010, Dan Frincu wrote: In OpenAIS for example, in /etc/ais/openais.conf you have a directive called interface. In this directive you specify a ringnumber, bindnetaddr, mcastaddr and mcastport. Configuring 2 communication channels means using adding rrp_mode: passive, ringnumber

Re: [Pacemaker] communication channels howto

2010-08-26 Thread Dan Frincu
In OpenAIS for example, in /etc/ais/openais.conf you have a directive called interface. In this directive you specify a ringnumber, bindnetaddr, mcastaddr and mcastport. Configuring 2 communication channels means using adding rrp_mode: passive, ringnumber 0 and ringnumber 1, two interface direc

Re: [Pacemaker] how to keep ftp connection when swap from primary to secondary

2010-08-26 Thread Raoul Bhatia [IPAX]
On 08/26/2010 04:42 PM, liang...@asc-csa.gc.ca wrote: > I have followed the guide in “Clusters from Scratch” written by Andrew > Beekhof and successfully setup an Active/Passive pair of cluster > servers. The cluster runs in Fedora 13 and includes services like > apache, vsftpd and nfs. Drbd is use

Re: [Pacemaker] how to keep ftp connection when swap from primary to secondary

2010-08-26 Thread Liang.Ma
Thanks Jimbob for your quick response. So no one bothers to develop something to keep the ftp state in RAM synchronized? Or maybe it is not possible? Liang Ma Contractuel | Consultant | SED Systems Inc. Ground Systems Analyst Agence spatiale canadienne | Canadian Space Agency 6767, Route de l'

Re: [Pacemaker] how to keep ftp connection when swap from primary to secondary

2010-08-26 Thread jimbob palmer
The ftp connection is in the ram of the machine that dies, so for this to work you'd need to synchronise state to the other machine the whole time. I don't know of a cluster aware ftp server that can do this. 2010/8/26 : > Hi There, > > > > I have followed the guide in “Clusters from Scratch” wri

[Pacemaker] drbd diskless -> failover to other node

2010-08-26 Thread jimbob palmer
How can I configure pacemaker to failover when the primary node goes diskless? Many thanks. ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting

[Pacemaker] how to keep ftp connection when swap from primary to secondary

2010-08-26 Thread Liang.Ma
Hi There, I have followed the guide in "Clusters from Scratch" written by Andrew Beekhof and successfully setup an Active/Passive pair of cluster servers. The cluster runs in Fedora 13 and includes services like apache, vsftpd and nfs. Drbd is used to allow data consistence during a failover

Re: [Pacemaker] Shared Storage

2010-08-26 Thread Ruiyuan Jiang
Hi, Andrew Understood that. I am asking any recommendation for storage management under Packmaker. Ryan -Original Message- From: Andrew Beekhof [mailto:and...@beekhof.net] Sent: Thursday, August 26, 2010 3:43 AM To: The Pacemaker cluster resource manager Subject: Re: [Pacemaker] Shared

Re: [Pacemaker] IPaddr2 not failing-over

2010-08-26 Thread Vince Gabriel
Thanks Andrew! I followed the example, it's working as expected! -Vince -- Vince Gabriel Field Technical Analyst SGI office: 361.729.9151 cell: 409.392.8083 > -Original Message- > From: Andrew Beekhof [mailto:and...@beekhof.net] > Sent: Thursday, August 26, 2010 2:48 AM > To: The

[Pacemaker] communication channels howto

2010-08-26 Thread lxnf98mm
The DRBD manual says It is absolutely vital to configure at least two independent OpenAIS communication channels for this functionality to work correctly. My Google'n has not yielded any results in the how to do this department I have DRBD configured and working properly with one channel Where

Re: [Pacemaker] Quorum disk?

2010-08-26 Thread Ciro Iriarte
2010/8/26 Dejan Muhamedagic : > Hi, > > On Wed, Aug 25, 2010 at 05:01:51PM -0400, Ciro Iriarte wrote: >> Hi, I'm planning to use OpanAIS+Pacemaker on SLES11-SP1 and would like >> to know if it's possible to use a quorum disk in a two-node cluster. >> The idea is to avoid adding a third node just fo

Re: [Pacemaker] Help controlling target-role additions to the cib after a start or stop of a resource?

2010-08-26 Thread Andrew Beekhof
On Wed, Aug 25, 2010 at 5:49 PM, Vince Gabriel wrote: > The primitive below is part of a group. If the resource (HA1-ip) is > start/stop independently of its group, the target-role="Started" is added to > the CIB. Is there a way to prevent that from happening? No. > > primitive HA1-ip ocf:heartb

Re: [Pacemaker] revision patch of crm_mon

2010-08-26 Thread Andrew Beekhof
Looks good, i'll apply today 2010/8/26 Yuusuke IIDA : > Hi, Andrew > > I made a patch to revise an attribute information indication function of > crm_mon. > > The following is a revision point. >  - I sort the attribute information indication of the node in ascending order > and display it. >  -

Re: [Pacemaker] Designated reaction of Pacemaker to monitor-op returning rc=7 (OCF_NOT_RUNNING)

2010-08-26 Thread Andrew Beekhof
On Thu, Aug 26, 2010 at 10:42 AM, Dejan Muhamedagic wrote: > Hi, > > On Thu, Aug 26, 2010 at 08:20:46AM +0200, Andrew Beekhof wrote: >> On Wed, Aug 25, 2010 at 4:00 PM, Dejan Muhamedagic >> wrote: >> > Hi, >> > >> > On Tue, Aug 24, 2010 at 05:19:23PM +0200, Cnut Jansen wrote: >> >> Hi, >> >> >>

[Pacemaker] revision patch of crm_mon

2010-08-26 Thread Yuusuke IIDA
Hi, Andrew I made a patch to revise an attribute information indication function of crm_mon. The following is a revision point. - I sort the attribute information indication of the node in ascending order and display it. - A state does not display the attribute information of the node of OFFLI

Re: [Pacemaker] Quorum disk?

2010-08-26 Thread Dejan Muhamedagic
Hi, On Wed, Aug 25, 2010 at 05:01:51PM -0400, Ciro Iriarte wrote: > Hi, I'm planning to use OpanAIS+Pacemaker on SLES11-SP1 and would like > to know if it's possible to use a quorum disk in a two-node cluster. > The idea is to avoid adding a third node just for quorum... No quorum disks, sorry. T

Re: [Pacemaker] Designated reaction of Pacemaker to monitor-op returning rc=7 (OCF_NOT_RUNNING)

2010-08-26 Thread Dejan Muhamedagic
Hi, On Thu, Aug 26, 2010 at 08:20:46AM +0200, Andrew Beekhof wrote: > On Wed, Aug 25, 2010 at 4:00 PM, Dejan Muhamedagic > wrote: > > Hi, > > > > On Tue, Aug 24, 2010 at 05:19:23PM +0200, Cnut Jansen wrote: > >> Hi, > >> > >> just (for now) a short question for to make sure I didn't miss anythin

Re: [Pacemaker] Designated reaction of Pacemaker to monitor-op returning rc=7 (OCF_NOT_RUNNING)

2010-08-26 Thread Dejan Muhamedagic
Hi, On Wed, Aug 25, 2010 at 08:56:08PM +0200, Cnut Jansen wrote: > Am 25.08.2010 16:00, schrieb Dejan Muhamedagic: > > Hi, > > > > On Tue, Aug 24, 2010 at 05:19:23PM +0200, Cnut Jansen wrote: > >> Hi, > >> > >> just (for now) a short question for to make sure I didn't miss anything: > >> What's t

Re: [Pacemaker] cluster-dlm: set_fs_notified: set_fs_notified no nodeid 1812048064#012

2010-08-26 Thread Dejan Muhamedagic
Hi, On Thu, Aug 26, 2010 at 09:36:10AM +0200, Andrew Beekhof wrote: > On Wed, Aug 18, 2010 at 6:24 PM, Roberto Giordani > wrote: > > Hello, > > I'll explain what’s happened after a network black-out > > I've a cluster with pacemaker on Opensuse 11.2 64bit > > > > Last updated: Wed A

Re: [Pacemaker] IPaddr2 not failing-over

2010-08-26 Thread Andrew Beekhof
On Wed, Aug 11, 2010 at 10:55 PM, Vince Gabriel wrote: > Hi everyone, > > I have new cluster that is works exceptionally well with the exception of > the IPaddr2 virtual interfaces initiated failovers. If the interface is > downed or cable disconnected, a failover never happens. I’ve attempted to

Re: [Pacemaker] Shared Storage

2010-08-26 Thread Andrew Beekhof
On Thu, Aug 19, 2010 at 9:22 PM, Ruiyuan Jiang wrote: > Hi, > > My testing two node (openais and corosync) cluster is up and running (RHEL > v5.5). Now I'd like to create LVM disk storage for the cluster for failover. > The cluster is attached to an EMC Symmetrix SAN. The same LUNs from the EMC

Re: [Pacemaker] Howto upgrade Pacemaker cluster from Version: 1.0.2 to the last released on clusterlabs

2010-08-26 Thread Andrew Beekhof
On Wed, Aug 18, 2010 at 11:15 PM, Roberto Giordani wrote: > Hello, > I'd like to know how is it possible to upgrade a running cluster > pacemaker on Opensuse 11.2 version 1.02 to the last available on clusterlabs > using dlm + ocfs2 too The problem is that the versions of pacemaker on clusterlabs

Re: [Pacemaker] cluster-dlm: set_fs_notified: set_fs_notified no nodeid 1812048064#012

2010-08-26 Thread Andrew Beekhof
On Wed, Aug 18, 2010 at 6:24 PM, Roberto Giordani wrote: > Hello, > I'll explain what’s happened after a network black-out > I've a cluster with pacemaker on Opensuse 11.2 64bit > > Last updated: Wed Aug 18 18:13:33 2010 > Current DC: nodo1 (nodo1) > Version: 1.0.2-ec6b0bbee1f3aa72c4c

Re: [Pacemaker] Monitor of LVM resources problem

2010-08-26 Thread Andrew Beekhof
On Tue, Aug 17, 2010 at 8:09 PM, wrote: > I have a 3 node cluster running Xen resources on SLES11sp1 with HAE. The > nodes are connected to a SAN and Pacemaker controls the start of the shared > disk. From time to time, monitor of LVM volume groups or ocfs2 file system > fails : this triggers a s

Re: [Pacemaker] Email Notifications for Pacemaker

2010-08-26 Thread Andrew Beekhof
Essentially you need to run crm_mon as a daemon with the appropriate options (check the man page). You can either do this as a resource (IIRC there is an RA already) or start it manually. On Wed, Aug 18, 2010 at 2:56 PM, Mike A Meyer wrote: > Hello, > > I didn't find any documentation on setting

Re: [Pacemaker] Question regarding patch for negative master scores

2010-08-26 Thread Andrew Beekhof
On Fri, Aug 6, 2010 at 7:31 AM, Bob Schatz wrote: > The fix for bug 2358 was: > >         http://hg.clusterlabs.org/pacemaker/stable-1.0/rev/79eee5b16ef3 > > If I am running pacemaker-1.0.6-1 is it safe to only apply this change or am I > asking for trouble? If it applies you're probably safe. Th

Re: [Pacemaker] A demand for the expected votes indication and a question.

2010-08-26 Thread Andrew Beekhof
On Fri, Aug 6, 2010 at 7:25 AM, wrote: > Hi, > > Our user uses corosync and Pacemaker. > > > Last updated: Fri Aug  6 13:25:37 2010 > Stack: openais > Current DC: srv01 - partition with quorum > Version: 1.1.2-230655711dc7b8579747ddeafc6f39247f8e87fc > 3 Nodes configured, 3 expected

Re: [Pacemaker] [Problem]Cib cannot update an attribute by 16 node constitution.

2010-08-26 Thread Andrew Beekhof
On Fri, Aug 6, 2010 at 4:36 AM, nozawat wrote: > Hi Andrew, > >> Is this data all from the CIB process? >> What was the load from the other processes like? (Ie. using top) > The cib process was approximately 100% CPU rates of use, > but most of the other processes did not use a CPU when they confi

Re: [Pacemaker] [PATCH]The changing of the log level of pengine process.

2010-08-26 Thread Andrew Beekhof
On Fri, Aug 6, 2010 at 3:47 AM, wrote: > Hi Andrew, > > Thank you for comment. > >> np :-) >> >> Maybe it would be easier to show the logs and/or crm_mon output with >> and without the patch. > > However, our many users watch error log. > And some users do not like trouble to be notified of in er