[Linux-ha-dev] [ANNOUNCE] Interim heartbeat packages refreshed (2.1.2-24)

2007-11-26 Thread Andrew Beekhof
Just a quick note to say that the packages at http://software.opensuse.org/download/server:/ha-clustering were refreshed today after sufficiently (see pending bugs below) passing automated testing. This will be the last interim release for 2007. I hope they've been useful and we'll be

Re: [Linux-ha-dev] [ANNOUNCE] Interim heartbeat packages refreshed (2.1.2-24)

2007-11-26 Thread Serge Dubrouski
There was an official release planned for December 10th. Is it still on schedule? On Nov 26, 2007 4:49 AM, Andrew Beekhof [EMAIL PROTECTED] wrote: Just a quick note to say that the packages at http://software.opensuse.org/download/server:/ha-clustering were refreshed today after

Re: [Linux-ha-dev] [ANNOUNCE] Interim heartbeat packages refreshed (2.1.2-24)

2007-11-26 Thread Andrew Beekhof
On Nov 26, 2007, at 6:13 PM, Serge Dubrouski wrote: I just tried to build RPM packages in CentOS-5 and got following error: error: types must match error: /usr/src/redhat/SPECS/heartbeat.spec:87: parseExpressionBoolean returns -1 error: Name field must be present in package: (main package)

Re: [Linux-ha-dev] [ANNOUNCE] Interim heartbeat packages refreshed (2.1.2-24)

2007-11-26 Thread Serge Dubrouski
One more problem is that find-lang.sh script (line 426 in heartbeat.spec) fails on my system. The reason could be that I don't have previous versions of Heartbeat installed on the system where I build RPMs. For now I just commented out that line and was able to get RPMs built. On Nov 26, 2007

Re: [Linux-ha-dev] [ANNOUNCE] Interim heartbeat packages refreshed (2.1.2-24)

2007-11-26 Thread Serge Dubrouski
Ok, here it is: Line 87 Instead of :%if is_rhel 6 It should be: %if %{is_rhel} 6 Line 181 Instead of: %define LIBNET_DEVEL It should be: %define LIBNET_DEVEL , Line On Nov 26, 2007 10:26 AM, Serge Dubrouski [EMAIL PROTECTED] wrote: Source. I already fixed that

Re: [Linux-HA] Mounting Filing System

2007-11-26 Thread Andrew Beekhof
configuration? logs? kinda hard to help without either of those two things... On Nov 25, 2007, at 7:06 PM, Jason Snyder wrote: Software: Ubuntu 7.10 Gusty Gibbon (virtual), Heartbeat 2.1.2 (using CRM), DRBD 8.03 (all are versions that come with Ubuntu 7.10), and VMWare 1.0.3 hosting the

Re: [Linux-HA] Fencing prevents resource from failing over

2007-11-26 Thread Andrew Beekhof
On Nov 26, 2007, at 6:25 AM, [EMAIL PROTECTED] [EMAIL PROTECTED] wrote: Hi, I've a 2 node active/passive cluster ( active node=active , passive node=standby) using heartbeat 2.0.8 . I recently enabled stonith . The stonith device is an rsh device that tries to restart the cluster node.

RE: [Linux-HA] Fencing prevents resource from failing over

2007-11-26 Thread abhishek.bagchi
Thanks Andrew, My comments are inline... -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Andrew Beekhof Sent: Monday, November 26, 2007 1:44 PM To: General Linux-HA mailing list Subject: Re: [Linux-HA] Fencing prevents resource from failing over On Nov

[Linux-HA] Heartbeat OCF IPaddr doesn't support restarts ?!

2007-11-26 Thread abhishek.bagchi
Hi , I'm using heartbeat version 2.0.8 wherein I've configured an IP address alias resource using the heartbeat provided ocf resource IPaddr. However, inspite of having an operation to monitor this resource ( monitor operation is default in all resources ), the resource is not restarted when

[Linux-HA] pingd config

2007-11-26 Thread Paul Surgeon
Hi I've read the pingd doc (http://www.linux-ha.org/pingd) more than 10 times now and I can't figure out how to set up a simple working pingd config. What I'm trying to do is ping a group of IPs and then use that as a constraint to transfer control to another cluster if none of the ping nodes

[Linux-HA] switch active/standby if resource fails is monit a cure?

2007-11-26 Thread holgi
Hi, i have a running hearbeat 1.x /drbd 0.7x cluster. It provides imap and samba services to a lan. What i am in need for, is that the nodes switch if one resouce fails let's say nmbd the nodes should switch. Can i do that monit ? Or must i migrate to hearbeat 2.x ? regards

Re: [Linux-HA] lrmd consuming too much cpu and a lot of logs

2007-11-26 Thread Dejan Muhamedagic
Hi, On Mon, Nov 26, 2007 at 08:30:43AM +0100, Frank wrote: Date: Fri, 23 Nov 2007 13:52:05 +0100 From: Dejan Muhamedagic [EMAIL PROTECTED] Subject: Re: [Linux-HA] lrmd consuming too much cpu and a lot of logs To: General Linux-HA mailing list linux-ha@lists.linux-ha.org Message-ID: [EMAIL

Re: [Linux-HA] Handling HBA SAN

2007-11-26 Thread Emmanuel Lacour
On Wed, Nov 21, 2007 at 05:33:18PM +, Wojciech Turek wrote: Hi, You need STONITH. I use IPMI for this purpose. STONITH will power down node that have problems this will ensure that only one node can access the file system. Thanks, for this advice, I'm now trying to set it up. I

[Linux-HA] V2 auto failback bug or miss config

2007-11-26 Thread Adrian Revill
Hi Im trying to set up a DRBD/Heartbeat pair using V2 I have a working V1 configuration, where if the active node (nodeA) is stopped/dies the standby node (nodeB) takes over the DRBD resources. Then when the stopped/dead node (nodeA) is restarted, the resources stay on nodeB till manually

[Linux-HA] Re: pingd config

2007-11-26 Thread Paul Surgeon
I managed to get a working pingd config going but now I need a way to differentiate between interfaces or be able to spawn pingd for a ping group. I have two ping_groups in my ha.cf file and I need to monitor each group for connectivity. ping_group int_ping_grp 172.17.1.10 172.17.1.11 ping_group

Re: [Linux-HA] Announcing: Sphinx Search daemon OCF resource agent

2007-11-26 Thread Dejan Muhamedagic
Hi, On Mon, Nov 26, 2007 at 08:22:18AM +0100, Christian Rish?j wrote: Dear Linux-HA readers I hereby publish an OCF resource agent for the Sphinx Search daemon [1]. Please use and redistribute as you wish. Many thanks for the contribution. I made some comments, just look for the DM

Re: [Linux-HA] Heartbeat OCF IPaddr doesn't support restarts ?!

2007-11-26 Thread Dejan Muhamedagic
Hi, On Mon, Nov 26, 2007 at 02:30:52PM +0530, [EMAIL PROTECTED] wrote: Hi , I'm using heartbeat version 2.0.8 wherein I've configured an IP address alias resource using the heartbeat provided ocf resource IPaddr. However, inspite of having an operation to monitor this resource ( monitor

Re: [Linux-HA] Fencing prevents resource from failing over

2007-11-26 Thread Lars Marowsky-Bree
On 2007-11-26T10:55:25, [EMAIL PROTECTED] wrote: Hi, I've a 2 node active/passive cluster ( active node=active , passive node=standby) using heartbeat 2.0.8 . I recently enabled stonith . The stonith device is an rsh device that tries to restart the cluster node. What is an rsh stonith

[Linux-HA] [ANNOUNCE] Interim heartbeat packages refreshed (2.1.2-24)

2007-11-26 Thread Andrew Beekhof
Just a quick note to say that the packages at http://software.opensuse.org/download/server:/ha-clustering were refreshed today after sufficiently (see pending bugs below) passing automated testing. This will be the last interim release for 2007. I hope they've been useful and we'll be

RE: [Linux-HA] Heartbeat OCF IPaddr doesn't support restarts ?!

2007-11-26 Thread abhishek.bagchi
Hi Dejan and everyone, My apologies! Ipaddr works FINE. I probably didn't have the monitor operation earlier in my configuration, which I have now. I read somewhere that all resources are monitored by default , but don't know what is the default monitor interval is. Probably it's a large value.

Re: [Linux-HA] Heartbeat OCF IPaddr doesn't support restarts ?!

2007-11-26 Thread Dejan Muhamedagic
Hi, On Mon, Nov 26, 2007 at 05:32:43PM +0530, [EMAIL PROTECTED] wrote: Hi Dejan and everyone, My apologies! Ipaddr works FINE. I probably didn't have the monitor operation earlier in my configuration, which I have now. I read somewhere that all resources are monitored by default , but don't

RE: [Linux-HA] Fencing prevents resource from failing over

2007-11-26 Thread abhishek.bagchi
Hi Andrew, I just modified my stonith device to work in both online and offline mode. The stonith operation (standby - active) is successful with the active node cable unplugged and it seems the standby node tries to start the resource, but fails. Log is attached. But there's not enough logs to

[Linux-HA] Firewall (active/passive) with two Switch for side

2007-11-26 Thread Sim
Hello everybody! I'm trying to configure 2 firewall (ACTIVE/PASSIVE) with Heartbeat, but I have found some trouble. I have 2 different switch, connected by a Ethernet port and with different management IP, for each firewall side. I don't know which IP add to ping of Hearbeat to monitor Ethernet

Re: [Linux-HA] Fencing prevents resource from failing over

2007-11-26 Thread Andrew Beekhof
On Nov 26, 2007, at 2:38 PM, [EMAIL PROTECTED] [EMAIL PROTECTED] wrote: Hi Andrew, I just modified my stonith device to work in both online and offline mode. The stonith operation (standby - active) is successful with the active node cable unplugged and it seems the standby node tries to

Re: [Linux-HA] Heartbeat vs CIB

2007-11-26 Thread Andrew Beekhof
On Nov 23, 2007, at 3:25 PM, Audet, Jean-Michel wrote: I am currently using Heartbeat v2 with a cib.xml file. I need an Active/Standby (Master/Slave) configuration I am a newbie and I need to connect my software to the heartbeat daemons using the client library and then do check pointing

Re: [Linux-HA] Announcing: Sphinx Search daemon OCF resource agent

2007-11-26 Thread Christian Rishøj
Hi, On 26 Nov 2007, at 12:01, Dejan Muhamedagic wrote: Hi, On Mon, Nov 26, 2007 at 08:22:18AM +0100, Christian Rish?j wrote: Dear Linux-HA readers I hereby publish an OCF resource agent for the Sphinx Search daemon [1]. Please use and redistribute as you wish. Many thanks for the

Re: [Linux-HA] Mounting Filing System

2007-11-26 Thread Jason Snyder
Ah, its cibadmin, not cibadm. That explains a lot ;-) Here is the config that I am having trouble with mounting the drbd partition (it comes up primary/secondary correctly according to /proc/drbd and I can manually mount it with `mount -o noatime /dev/drbd0 /ha/apache`): cib admin_epoch=0

RE: [Linux-HA] re:ERROR: Message hist queue is filling up

2007-11-26 Thread Scott Mann
Hi Arun, Thanks for this tip. I actually had looked at this previously, turned off both iptables and ip6tables and rebooted. I just tried it again. Sadly no luck. So, it doesn't have to do with iptables that I can tell. It is odd that this problem manifests only with one system and not the

RE: [Linux-HA] Heartbeat vs CIB

2007-11-26 Thread Audet, Jean-Michel
Thanks for the feedback. I had already used the walking node functions: Here is the kind of information I am able to get: MyNode1 type=normal MyNode1 status=active MyNode2 type=normal MyNode2 status=active 192.168.0.1 type=ping MyNode1 status=ping However, I would like to know which one is

Re: [Linux-HA] Fencing prevents resource from failing over

2007-11-26 Thread Dejan Muhamedagic
Hi, On Mon, Nov 26, 2007 at 04:14:07PM +0100, Andrew Beekhof wrote: On Nov 26, 2007, at 2:38 PM, [EMAIL PROTECTED] [EMAIL PROTECTED] wrote: Hi Andrew, I just modified my stonith device to work in both online and offline mode. The stonith operation (standby - active) is successful

Re: [Linux-HA] Heartbeat vs CIB

2007-11-26 Thread Andrew Beekhof
On Nov 26, 2007, at 5:47 PM, Audet, Jean-Michel wrote: Thanks for the feedback. I had already used the walking node functions: Here is the kind of information I am able to get: MyNode1 type=normal MyNode1 status=active MyNode2 type=normal MyNode2 status=active 192.168.0.1 type=ping MyNode1

Re: Re: [Linux-HA] How to control a resource with an environment variable?

2007-11-26 Thread Atanas Dyulgerov
Hi Max, I haven't received such complete answer before. Thank you very much for the useful information. I understood how attributes work. A shell script performing some operations/monitoring sets attributes by using attrd_updater or crm_attribite. The score for a resource can be changed with

RE: [Linux-HA] Heartbeat vs CIB

2007-11-26 Thread Audet, Jean-Michel
Hi again, I am trying to merge the code from crm_mon since Friday and I am always hitting the wall on function set_working_set_defaults. My code simply crash there and stop. CRM_MON application compile and work fine. Any idea what can cause the problem? Thanks again, Jean-Michel

[Linux-HA] Use of C++

2007-11-26 Thread Audet, Jean-Michel
Hi, I am currently use the hb_api.h (heartbeat API) to add some node management in my application. My application is in C++ and I always get errors on compilation with reserved C++ word like new, delete, class, protected, private etc... I am not a C++ guru... I cannot find a way

RE: [Linux-HA] Newbee problems to bring it V2.0 up and running

2007-11-26 Thread Bill Eaton
My next step was to try the GUI to create a cib file to have a look at this file and see whats happen. But if I start the GUI I can't connect to any of my nodes, not even localhost!? It says always Can't connect to server. So whats going on there or whats wrong? I tried this on SUSE 10.3

RE: [Linux-HA] Heartbeat vs CIB

2007-11-26 Thread Audet, Jean-Michel
When I crash, I get the following messages in my /var/log/messages heartbeat: ERROR: ipc_bufpool_update: magic number in head does not match.Something very bad happened, abort now, fearside pid 27029 heartbeat: ERROR: magic=616d6a2f, expected=abcd heartbeat: info: pool: refcount=1,

Re: [Linux-HA] Mounting Filing System

2007-11-26 Thread Dejan Muhamedagic
Hi, On Mon, Nov 26, 2007 at 08:04:17AM -0800, Jason Snyder wrote: Ah, its cibadmin, not cibadm. That explains a lot ;-) Here is the config that I am having trouble with mounting the drbd partition (it comes up primary/secondary correctly according to /proc/drbd and I can manually mount

Re: [Linux-HA] ERROR: Message hist queue is filling up

2007-11-26 Thread Dejan Muhamedagic
On Sun, Nov 25, 2007 at 02:50:34PM -0500, Scott Mann wrote: Hi, I started getting this message on 1 system in a 2 node hb cluster AFTER installing 2.1.2 via the fc8 rpms (yum install heartbeat*, so both heartbeat and heartbeat-devel). I actually installed the rpms on two freshly installed

RE: [Linux-HA] ERROR: Message hist queue is filling up

2007-11-26 Thread Scott Mann
Dejan, On Sun, Nov 25, 2007 at 02:50:34PM -0500, Scott Mann wrote: Hi, I started getting this message on 1 system in a 2 node hb cluster AFTER installing 2.1.2 via the fc8 rpms (yum install heartbeat*, so both heartbeat and heartbeat-devel). I actually installed the rpms on two freshly

Re: [Linux-HA] switch active/standby if resource fails is monit a cure?

2007-11-26 Thread Pranav Peshwe
On Nov 27, 2007 12:55 PM, Pranav Peshwe [EMAIL PROTECTED] wrote: On Nov 26, 2007 1:39 PM, holgi [EMAIL PROTECTED] wrote: Hi, i have a running hearbeat 1.x /drbd 0.7x cluster. It provides imap and samba services to a lan. What i am in need for, is that the nodes switch if one resouce

Re: [Linux-HA] switch active/standby if resource fails is monit a cure?

2007-11-26 Thread Pranav Peshwe
On Nov 26, 2007 1:39 PM, holgi [EMAIL PROTECTED] wrote: Hi, i have a running hearbeat 1.x /drbd 0.7x cluster. It provides imap and samba services to a lan. What i am in need for, is that the nodes switch if one resouce fails let's say nmbd the nodes should switch. Can i do that monit ? Or

[Linux-HA] When there is an unmanaged resource, Heartbeat cannot stop.

2007-11-26 Thread HIDEO YAMAUCHI
Hi, I tested the addition of the resource from GUI. First cib.xml does not have the resource. I used the latest development edition.(Heartbeat-Dev-68a5c0c53078) I operated it in order of next. 1)I am connected to the DC node in GUI. 2)I add a FileSystem resource without appointing a parameter.