Re: [Linux-ha-dev] RFC: Roadmap for 2.2.0

2008-02-07 Thread Andrew Beekhof
On Feb 6, 2008, at 4:55 PM, Lars Marowsky-Bree wrote: On 2008-02-06T16:28:41, Andrew Beekhof [EMAIL PROTECTED] wrote: There don't seem to be any (I think we exhausted the testing discussion)... are you waiting for me to do it? Sure, feel free to; at least I have no objections. I wonder

Re: [Linux-ha-dev] RFC: Roadmap for 2.2.0

2008-02-07 Thread Lars Marowsky-Bree
On 2008-02-07T10:45:00, Andrew Beekhof [EMAIL PROTECTED] wrote: Saying Requires: pacemaker doesn't seem like a good idea though True, people who wish to run v1 only don't need Pacemaker installed. Regards, Lars -- Teamlead Kernel, SuSE Labs, Research and Development SUSE LINUX Products

Re: [Linux-HA] DRBD 8.0 under Debian Etch?

2008-02-07 Thread Dominik Klein
Short question: Does anyone here have DRBD8 running with heartbeat under Etch? Short answer: Yes. Version 8.0.8, upgrading to 8.0.9 within the next days. I use the OCF RA to manage drbd as a Master/Slave Resource. Regards Dominik ___ Linux-HA

Re: [Linux-HA] Re: Where can I find documentation on the CIB XML?

2008-02-07 Thread Andrew Beekhof
On Feb 6, 2008, at 9:48 PM, Michael Toler wrote: [EMAIL PROTECTED]@[EMAIL PROTECTED] Send the bloody e-mail, find the documentation on my own. Happens every time. I haven't located my problem yet, so if anyone see's the issue with why it's failing back ever time, go ahead and respond.

Re: [Linux-HA] Supervise but don't stop a resource

2008-02-07 Thread Andrew Beekhof
On Feb 6, 2008, at 9:59 PM, Szu-Ching Peckner wrote: how about in your resource agent script, under stop case, just don't stop the process, maybe just echo something to the log. That way when heartbeat stop, it just write info to the log but doesn't actually stop anything That sounds

Re: [Linux-HA] Config Problem - I'm going blind staring at it!

2008-02-07 Thread Andrew Beekhof
On Feb 6, 2008, at 7:12 PM, Paul Court wrote: Never mind, Rebooted both nodes and they are working now... ?!? If you have any other comments about my config though, feel free to chime up! well we tried to allocate it, but for some reason it failed on both nodes:

Re: [Linux-HA] Q: Difference between after and before order rules

2008-02-07 Thread Andrew Beekhof
On Feb 7, 2008, at 8:52 AM, [EMAIL PROTECTED] wrote: Hi all, what's the defined difference between the two order rules 1: A before B 2: B after A nothing at all :-) It seemed like a good idea to support both ways but it just adds confusion (which will be addressed when we cleanup the

[Linux-HA] Implement OCF MySQL Monitoring function or not?

2008-02-07 Thread Lino Moragon
Hi list, I'm running a HA 2 node active / passive cluster with drbd and mysql. I was told about the ocf monitoring function of the yet available mysql ocf script. Could you tell me the advantages or disadvantages of implementing this monitoring function? I mean surely it does have benefits, but

[Linux-HA] OCF RESKEY environment variables

2008-02-07 Thread Adrian Chapela
Hello, I am still testing my MySQL Master - Slave configuration. I am invertigating to do a OCF script to do this config. Where is OCF_RESKEY variables ?? This variables isn't in .ocf-returncode. Thank you! ___ Linux-HA mailing list

Re: [Linux-HA] quorumd: Problem with certificates

2008-02-07 Thread maloja01
Did you use the correct cn (certificate attribute cn must be equal to the cluster name)? If you use the cluster name mycluster and your quorum server could be reached with a special name (dont remeber it know, but you can strace it easyly) you can also use quorumdtest as a clien test program to

Re: [Linux-HA] OCF RESKEY environment variables

2008-02-07 Thread Andrew Beekhof
they're set based on the resource options you set in cib.xml On Feb 7, 2008, at 11:54 AM, Adrian Chapela wrote: Hello, I am still testing my MySQL Master - Slave configuration. I am invertigating to do a OCF script to do this config. Where is OCF_RESKEY variables ?? This variables isn't

Re: [Linux-HA] Implement OCF MySQL Monitoring function or not?

2008-02-07 Thread Andrew Beekhof
On Feb 7, 2008, at 10:08 AM, Lino Moragon wrote: Hi list, I'm running a HA 2 node active / passive cluster with drbd and mysql. I was told about the ocf monitoring function of the yet available mysql ocf script. well if its an OCF script it needs to implement some kind of monitoring -

Re: [Linux-HA] quorumd: Problem with certificates

2008-02-07 Thread Michael Schwartzkopff
Am Donnerstag, 7. Februar 2008 10:47 schrieb [EMAIL PROTECTED]: Did you use the correct cn (certificate attribute cn must be equal to the cluster name)? If you use the cluster name mycluster and your quorum server could be reached with a special name (dont remeber it know, but you can strace

Re: [Linux-HA] quorumd: Problem with certificates

2008-02-07 Thread Dejan Muhamedagic
Hi, On Thu, Feb 07, 2008 at 12:33:58PM +0100, Michael Schwartzkopff wrote: Am Donnerstag, 7. Februar 2008 10:47 schrieb [EMAIL PROTECTED]: Did you use the correct cn (certificate attribute cn must be equal to the cluster name)? If you use the cluster name mycluster and your quorum server

Re: [Linux-HA] crm crash on centos 4.5

2008-02-07 Thread Dejan Muhamedagic
On Thu, Feb 07, 2008 at 10:07:58AM +0100, Andrew Beekhof wrote: On Feb 7, 2008, at 12:57 AM, Lars Marowsky-Bree wrote: On 2008-02-06T15:11:28, Tao Yu [EMAIL PROTECTED] wrote: Running the heartbeat 2.1.2 Core: #0 0x003b9602e21d in raise () from /lib64/tls/libc.so.6 #1

Re: [Linux-HA] Re: Re: Error compiling heartbeat 2.1.3-2

2008-02-07 Thread Dejan Muhamedagic
Hi, On Thu, Feb 07, 2008 at 09:14:39AM +0100, Ilkan Akcakaya wrote: Hallo Andrew, thanks for the advice: it worked ... but I got a new error. After ./ConfigureMe configure make all install conf_yacc.y: In function ?yyparse?: conf_yacc.y:65: error: ?yyrcsid? undeclared (first use in this

Re: [Linux-HA] Implement OCF MySQL Monitoring function or not?

2008-02-07 Thread Dejan Muhamedagic
Hi, On Thu, Feb 07, 2008 at 12:22:18PM +0100, Andrew Beekhof wrote: On Feb 7, 2008, at 10:08 AM, Lino Moragon wrote: Hi list, I'm running a HA 2 node active / passive cluster with drbd and mysql. I was told about the ocf monitoring function of the yet available mysql ocf script. well if

Re: [Linux-HA] OCF RESKEY environment variables

2008-02-07 Thread Adrian Chapela
Andrew Beekhof escribió: they're set based on the resource options you set in cib.xml Yes... I find now. I am thinking in gets mysql actual OCF resource and improve with MultiState.. What do you think about ?? I think could be a solution for me. On Feb 7, 2008, at 11:54 AM, Adrian Chapela

Re: [Linux-HA] quorumd: Problem with certificates

2008-02-07 Thread Michael Schwartzkopff
Am Donnerstag, 7. Februar 2008 13:03 schrieb Dejan Muhamedagic: Well, xen02:~# /usr/lib/heartbeat/quorumdtest Segmentation fault Can you provide a backtrace. Hi, I don't know if this help you but here is the bt from gdb: xen02:~# cat backtrace.log #0 0x0804902a in ?? () #1 0x08049661

Re: [Linux-HA] Windows port of haclient

2008-02-07 Thread Luis Martin-Santos
Wouldn't it be great if there were a web based heartbeat gui? Does any documentation about the client protocol exist? Maybe I give it a try... El 07/02/2008, a las 13:55, Xinwei Hu escribió: 2008/2/6, Andrew Beekhof [EMAIL PROTECTED]: is the client still usable on linux with these changes?

[Linux-HA] Resources are stopped for a few seconds when bringing secondary node out of standby

2008-02-07 Thread Paul Court
When I use crm_standby -U node2 -v false to bring a secondary node out of standby my esources are stopped for a few seconds on the active node and then restarted (still on the active node). Also, when running the hb_report program I got the following. (I am running Ubuntu 7.10 but I have

[Linux-HA] clones for Load Balancing?

2008-02-07 Thread Abraham Iglesias
hi all, i recently read about clones and there was something that interested me a lot: i read: Clones were initially conceived as a convienient way to start N instances of an IP resource and have them distributed throughout the cluster for load balancing. They have turned out to quite useful

Re: [Linux-HA] Windows port of haclient

2008-02-07 Thread Andrew Beekhof
On Feb 7, 2008, at 1:55 PM, Xinwei Hu wrote: 2008/2/6, Andrew Beekhof [EMAIL PROTECTED]: is the client still usable on linux with these changes? It is now. ;) any chance you could attach a diff of the changes? assuming they don't break anything for non-windows users, I dont see any

Re: [Linux-HA] Windows port of haclient

2008-02-07 Thread Luis Martin-Santos
Well, my idea was more like: heartbeat-libs+bins --- python-twisted-based-http(s)-server - (xml) -- rich javascript client UI that xml part could be: a) cib.xml directly parsed on the javascript side for presenting the cluster structure b) commands going to the httpd (configuration

Re: [Linux-HA] Windows port of haclient

2008-02-07 Thread Miroslav Strugarevic
On Feb 7, 2008 4:26 PM, Dejan Muhamedagic [EMAIL PROTECTED] wrote: Hi, On Thu, Feb 07, 2008 at 02:10:23PM +0100, Luis Martin-Santos wrote: Wouldn't it be great if there were a web based heartbeat gui? Hi, that would be great. Even If someone with good Python and Glade skills could make GUI a

[Linux-HA] ClusterIP

2008-02-07 Thread Thomas Glanzmann
Hello, I would like to do a Cluster-IP Setup with SLES 10. A few things are unclear for me. With ClusterIP you have one IP address that is shared on two or more nodes. It useally uses a multicast mac address. Both nodes see all traffic. But when one node goes down how does the other node see that

Re: [Linux-HA] ClusterIP

2008-02-07 Thread Michael Schwartzkopff
Am Donnerstag 07 Februar 2008 19:11 schrieb Thomas Glanzmann: Hello, I would like to do a Cluster-IP Setup with SLES 10. A few things are unclear for me. With ClusterIP you have one IP address that is shared on two or more nodes. It useally uses a multicast mac address. Both nodes see all

Re: [Linux-HA] problems with quorumd

2008-02-07 Thread Sebastian Reitenbach
Hi, Dejan Muhamedagic [EMAIL PROTECTED] wrote: Hi, On Thu, Feb 07, 2008 at 05:00:14PM +0100, Sebastian Reitenbach wrote: Hi, I have a 4 node cluster, and wanted to setup a quorum server, so that I do not need three running cluster nodes to get quorum. The quorumd IP address is

Re: [Linux-HA] propagate value similliar to pingd

2008-02-07 Thread Lars Marowsky-Bree
On 2008-02-07T19:00:55, Thomas Glanzmann [EMAIL PROTECTED] wrote: Hello, I would like to write a script similiar to pingd that is spawnd and populates a value in the cib that I can build a rule on. What do I have to do to obtain the above. Concrete questions are: - What do I have

Re: [Linux-HA] ClusterIP

2008-02-07 Thread Thomas Glanzmann
Hello, thank you a lot for the feedback! Now I understand how the failover works. Has someone a ready to use cib.xml that I can use for testing. I am going to try my luck right now and come back in an hour or so with my findings. It would be nice if someone could comment on them. Thomas

Re: [Linux-HA] ClusterIP

2008-02-07 Thread Lars Marowsky-Bree
On 2008-02-07T19:11:26, Thomas Glanzmann [EMAIL PROTECTED] wrote: Hello, I would like to do a Cluster-IP Setup with SLES 10. A few things are unclear for me. With ClusterIP you have one IP address that is shared on two or more nodes. It useally uses a multicast mac address. Both nodes see

Re: [Linux-HA] Heartbeat and RH Cluster Suite

2008-02-07 Thread Dejan Muhamedagic
Hi, On Thu, Feb 07, 2008 at 06:28:52PM +0200, Atanas Dyulgerov wrote: Hi, I'm not going to compare both solutions. I'm creating a cluster with 3 nodes - two active and a passive. Two special applications are going to work on both active nodes. If either of them fails it will be 'migrated'

Re: [Linux-HA] problems with quorumd

2008-02-07 Thread Dejan Muhamedagic
Hi, On Thu, Feb 07, 2008 at 05:00:14PM +0100, Sebastian Reitenbach wrote: Hi, I have a 4 node cluster, and wanted to setup a quorum server, so that I do not need three running cluster nodes to get quorum. The quorumd IP address is a shared IP on another two node cluster. I've done the

[Linux-HA] Heartbeat and RH Cluster Suite

2008-02-07 Thread Atanas Dyulgerov
Hi, I'm not going to compare both solutions. I'm creating a cluster with 3 nodes - two active and a passive. Two special applications are going to work on both active nodes. If either of them fails it will be 'migrated' to the standby node. Those applications use heavily the storage device.

Re: [Linux-HA] Heartbeat and RH Cluster Suite

2008-02-07 Thread Michael Brennen
On Thu, 7 Feb 2008, Atanas Dyulgerov wrote: I'm not going to compare both solutions. I'm creating a cluster with 3 nodes - two active and a passive. Two special applications are going to work on both active nodes. If either of them fails it will be 'migrated' to the standby node. Those

Re: [Linux-HA] Heartbeat and RH Cluster Suite

2008-02-07 Thread Paul Court
So my question is, is there an NBD solution (an GNBD alternative) which works with Heartbeat? I couldn't find such. So no way for me to share a block device over the network in my Heartbeat cluster. I have to switch to RHCS... Do you plan to implement/adopt such NBD and Global File System

Re: [Linux-HA] Resources are stopped for a few seconds when bringing secondary node out of standby

2008-02-07 Thread Dejan Muhamedagic
Hi, On Thu, Feb 07, 2008 at 04:57:49PM +0100, Lars Marowsky-Bree wrote: On 2008-02-07T16:33:48, Dejan Muhamedagic [EMAIL PROTECTED] wrote: This is very strange. Are you sure that you're running bash and not dash? It's such a mishmash with all those xxshes. I'm off to check that dash

Re: [Linux-HA] Resources are stopped for a few seconds when bringing secondary node out of standby

2008-02-07 Thread Lars Marowsky-Bree
On 2008-02-07T16:33:48, Dejan Muhamedagic [EMAIL PROTECTED] wrote: This is very strange. Are you sure that you're running bash and not dash? It's such a mishmash with all those xxshes. I'm off to check that dash thing. Why not simply specify #!/bin/bash explicitly? Other scripts require it

Re: [Linux-HA] clones for Load Balancing?

2008-02-07 Thread Adrian Chapela
Dominik Klein escribió: But I you really want to do something serious about load balancing have a look at: - Linux Virtual Server with localhost feature(!) - ldirectord as Linux-HA resource And/or keepalived. I recommend you this. It is IMHO the simplest way to work with Linux Virtual Server

[Linux-HA] Re: Where can I find documentation on the CIB XML?

2008-02-07 Thread Michael Toler
[In reply to] On Feb 6, 2008 Maxim Veksler wrote: You haven't attached the full cib.xml you have, in general you should define default-resource-stickiness to prevent your resources jump without true failures. Here is the ha.cf file and the entire CIB file. I have set the

Re: [Linux-HA] propagate value similliar to pingd

2008-02-07 Thread Adrian Chapela
Thomas Glanzmann escribió: Hello, I would like to write a script similiar to pingd that is spawnd and populates a value in the cib that I can build a rule on. What do I have to do to obtain the above. Concrete questions are: - What do I have to put in the cib to spawn such an 'agent'?

Re: [Linux-HA] Implement OCF MySQL Monitoring function or not?

2008-02-07 Thread Dejan Muhamedagic
Hi, On Thu, Feb 07, 2008 at 05:50:20PM +0100, Andreas Kurz wrote: On Feb 7, 2008 10:08 AM, Lino Moragon [EMAIL PROTECTED] wrote: Hi list, I'm running a HA 2 node active / passive cluster with drbd and mysql. I was told about the ocf monitoring function of the yet available mysql ocf

Re: [Linux-HA] Implement OCF MySQL Monitoring function or not?

2008-02-07 Thread Andreas Kurz
On Feb 7, 2008 10:08 AM, Lino Moragon [EMAIL PROTECTED] wrote: Hi list, I'm running a HA 2 node active / passive cluster with drbd and mysql. I was told about the ocf monitoring function of the yet available mysql ocf script. The base monitor function in the mysql RA does only a check if

Re: [Linux-HA] Windows port of haclient

2008-02-07 Thread Lars Marowsky-Bree
On 2008-02-07T16:26:10, Dejan Muhamedagic [EMAIL PROTECTED] wrote: The documentation is rather scarce, I'm afraid. The current GUI is based on python and, if you speak python, you could take a look there. Another option would be to just invoke external programs such as crm_mon,

Re: [Linux-HA] crm crash on centos 4.5

2008-02-07 Thread Tao Yu
It will be extremely helpful if we could know the bug number related to this problem. Thanks! On Feb 7, 2008 7:05 AM, Dejan Muhamedagic [EMAIL PROTECTED] wrote: On Thu, Feb 07, 2008 at 10:07:58AM +0100, Andrew Beekhof wrote: On Feb 7, 2008, at 12:57 AM, Lars Marowsky-Bree wrote: On

Re: [Linux-HA] Resources are stopped for a few seconds when bringing secondary node out of standby

2008-02-07 Thread Dejan Muhamedagic
Hi, On Thu, Feb 07, 2008 at 01:13:12PM +, Paul Court wrote: When I use crm_standby -U node2 -v false to bring a secondary node out of standby my esources are stopped for a few seconds on the active node and then restarted (still on the active node). Also, when running the hb_report

Re: [Linux-HA] clones for Load Balancing?

2008-02-07 Thread Michael Schwartzkopff
Am Donnerstag, 7. Februar 2008 14:59 schrieb Abraham Iglesias: hi all, i recently read about clones and there was something that interested me a lot: i read: Clones were initially conceived as a convienient way to start N instances of an IP resource and have them distributed throughout the

[Linux-HA] Failed Authentication

2008-02-07 Thread Paul Roberts
I have a 2 node cluster that is setup and I'm testing. Nothing too tricky here but I don't (pretty sure they are not) talking to each other. Feb 7 20:56:47 x4100-cta-dev heartbeat: [ID 796187 daemon.warning] [20406]: WARN: string2msg_ll: node [v215-cta-dev] failed authentication

[Linux-HA] propagate value similliar to pingd

2008-02-07 Thread Thomas Glanzmann
Hello, I would like to write a script similiar to pingd that is spawnd and populates a value in the cib that I can build a rule on. What do I have to do to obtain the above. Concrete questions are: - What do I have to put in the cib to spawn such an 'agent'? - How do I propagate

Re: [Linux-HA] ClusterIP

2008-02-07 Thread Thomas Glanzmann
Hello again, here comes by cib.xml for a clusterip. But the ressource stickiness is not working for me. When I shoutdown ha-2, the two clone instances stay on ha-1. Any ideas? Before sending this e-mail I used the following command to set some location constraints: crm_resource -M -r ip0:0 -H

Re: [Linux-HA] Patches for apache RA

2008-02-07 Thread Dejan Muhamedagic
Hi, On Mon, Feb 04, 2008 at 04:16:15PM +0900, [EMAIL PROTECTED] wrote: Content-Description: Mail message body Hi All, I think that there are some problems in the apache RA. My computer environment is as follows # /usr/sbin/httpd -v Server version: Apache/2.2.3 Server built: Aug 6

[Linux-HA] problems with quorumd

2008-02-07 Thread Sebastian Reitenbach
Hi, I have a 4 node cluster, and wanted to setup a quorum server, so that I do not need three running cluster nodes to get quorum. The quorumd IP address is a shared IP on another two node cluster. I've done the following tests, the quorumd from a 2.1.2 version of heartbeat, the cluster

Re: [Linux-HA] crm crash on centos 4.5

2008-02-07 Thread Tao Yu
Thanks for all the information. I do believe we have the debuginfo package installed. Will try to reproduce this and try to get the ha-debug file. Could someone point me to the code section for this problem? I want to dig a little deep into heartbeat implementation. :) Thanks again!

Re: [Linux-HA] ClusterIP

2008-02-07 Thread Lars Marowsky-Bree
On 2008-02-07T22:43:50, Thomas Glanzmann [EMAIL PROTECTED] wrote: Hello again, here comes by cib.xml for a clusterip. But the ressource stickiness is not working for me. When I shoutdown ha-2, the two clone instances stay on ha-1. Any ideas? Before sending this e-mail I used the following

[Linux-HA] 3-node-setup

2008-02-07 Thread Ludwig Ruderstaller
Hi, I'm new to the list. I have here a, AFAIK, uncommen setup. Sorry the whole explanation is a little bit long, but i want to give you all information in the hope of your help :-) All servers are connected over a backend switch and a frontendswitch to the world. The Backend Network use

[Linux-HA] Samba and High Availability

2008-02-07 Thread Christopher Barry
All, When I began creating an active/passive samba server, I knew the session state information issue was going to make transparent fail-over unlikely. I figured minimal downtime was the best I could hope for. But I've been pleasantly surprised that it does in fact fail-over amazingly

Re: [Linux-HA] ClusterIP

2008-02-07 Thread Thomas Glanzmann
Hallo Lars Uhm, what do you think should happen when you shutdown ha-2 - of course they stat on ha-1 in that case? I meant that I shut it down temporarily and if it comes back again the clones stay both on one node instead of going back again. I don't know what you're saying here ;-) I

Re: [Linux-HA] Samba and High Availability

2008-02-07 Thread Thomas Glanzmann
Hello Christopher, Everything I have read about samba and HA made it seem like this was not possible. Are others doing this too? Can you think of some good tests to try to stress it (short of accessing a database or something). I imagine a fail-over during a large copy operation would fail,

Re: [Linux-HA] 3-node-setup

2008-02-07 Thread Andrew Beekhof
On Feb 8, 2008, at 2:48 AM, Ludwig Ruderstaller wrote: Hi, I'm new to the list. I have here a, AFAIK, uncommen setup. Sorry the whole explanation is a little bit long, but i want to give you all information in the hope of your help :-) Having 3 nodes isn't supported when using

Re: [Linux-HA] crm crash on centos 4.5

2008-02-07 Thread Andrew Beekhof
bug #1546 On Feb 7, 2008, at 10:08 PM, Tao Yu wrote: It will be extremely helpful if we could know the bug number related to this problem. Thanks! On Feb 7, 2008 7:05 AM, Dejan Muhamedagic [EMAIL PROTECTED] wrote: On Thu, Feb 07, 2008 at 10:07:58AM +0100, Andrew Beekhof wrote: On Feb

Re: [Linux-HA] Re: Where can I find documentation on the CIB XML?

2008-02-07 Thread Andrew Beekhof
On Feb 7, 2008, at 4:16 PM, Michael Toler wrote: [In reply to] On Feb 6, 2008 Maxim Veksler wrote: You haven't attached the full cib.xml you have, in general you should define default-resource-stickiness to prevent your resources jump without true failures. Here is the ha.cf file and

Re: [Linux-HA] OCF RESKEY environment variables

2008-02-07 Thread Andrew Beekhof
On Feb 7, 2008, at 12:53 PM, Adrian Chapela wrote: Andrew Beekhof escribió: they're set based on the resource options you set in cib.xml Yes... I find now. I am thinking in gets mysql actual OCF resource and improve with MultiState.. What do you think about ?? I think could be a solution