Re: [Pacemaker] pacemaker fails to start drbd using ocf:linbit:drbd

2010-07-01 Thread martin . braun
Hi Bart, my guess is that you did forget the start-delay attribute for the monitor operations, that's why you see the time-out error message. Here is an example: op monitor interval=20 role=Slave timeout=20 start-delay=1m \ op monitor interval=10 role=Master timeout=20

Re: [Pacemaker] One or a few dedicated installations

2010-06-22 Thread martin . braun
Hi Martin, I don't have a big cluster but I do have a few (5-15) active/passive clusters with each having 2 nodes. I have a similar setup running : one 6 node cluster with 3 active/passive pairs and it is working fine. When you add more node configuration will get a bit long and unclear but

Re: [Pacemaker] VMWare Guest as Cluster Nodes - how to do stonith

2010-05-21 Thread martin . braun
Hi Sebastian, Do you have any experiences with the libvirt? It sounds very promising to me. Normally a simple soap call to reboot the guest would be enough to have kind of stonith. I have some experiences with libvirt, but not yet with the ESX hypervisordriver. However with that driver

Re: [Pacemaker] VMWare Guest as Cluster Nodes - how to do stonith

2010-05-20 Thread martin . braun
Hi Sebastian, guests. Does anybody have experiences or advises on how to configure stonith for vmware guests or maybe there is another solution to build a reliable virtual cluster. See this thread:

Re: [Pacemaker] How SuSEfirewall2 affects on openais startup?

2010-05-12 Thread martin . braun
Hi Aleksey, Don't know much about SuSEfirewall2, but can't see how when it starts should influence the cluster, as long as it lets the traffic we need through. And what to do? First of all check the logs (/var/log/messages and /var/log/firewall). When you can't see anything suspicious, I

Re: [Pacemaker] 2 web and 2 db servers both with drbd. Questions concerning the set-up

2010-05-05 Thread martin . braun
Hi Oliver, I almost have a set-up which looks like this: Master sideSlave side -virtual IP-- | | web1drbd/pacemaker--web2 | | internal internal |

Re: [Pacemaker] Failover constraint problem

2010-04-19 Thread martin . braun
Hi Sandor, 1. If I migrate apache-group resorce to another node then nfs_client won't release the /mnt mount point (I know according to this config it should not). Refering to teh time-out message below, is it possible that stopping the nfs-client takes more than 20 sec? Perhaps you should

[Pacemaker] Howto handle opt-in clusters WAS: Re: resource monitor operations on wrong nodes

2010-04-16 Thread martin . braun
Hi, I have a non-symmetric cluster (symmetric-cluster=false) with four nodes. We still check _every_ node to be sure the resources aren't already running there. OK, that is reasonable - but I have trouble with the logic of the messages: they are listed as failed actions, however if the

[Pacemaker] resource monitor operations on wrong nodes

2010-04-09 Thread martin . braun
Hi, I have a non-symmetric cluster (symmetric-cluster=false) with four nodes. On two nodes I have allowed a resource group: location grpFS-pref1 grpFS 200: wdf-ux-0040 location grpFS-pref2 grpFS 200: wdf-ux-0041 grpFS is configured as: group grpFS resFS resVIP resAPP the other nodes are

Re: [Pacemaker] disable failover when doing orderly reboot

2010-04-01 Thread martin . braun
Hi Gerry, Stop all resources running on the node going to be shut down. That's what you want in the end, isn't it? That should also work: set the second node to standby and do the reboot on the primary - when primary is up again you'll set the secondary back online. Best, Martin Gerry

[Pacemaker] Feedback: Website Updates

2010-03-24 Thread martin . braun
Hi, first of all: the succesful redesign of the clusterlabs website looks very nice and the splash page is a really good idea! However I found some small teething troubles: * Explore tab: - The link Site updates is not working yet. - I would also suggest to add a direct link to the wiki (Home)

[Pacemaker] Design: 8 vs 4x2 nodes Cluster

2010-03-18 Thread martin . braun
Hi there, I want to realize a rather complex setup, so I have a couple of questions: The cluster (as a shared nothing variant) should provide: * 4 services (=server) depending on each other. * 3 of them can only be realized as active/passive failover, synched with DRBD (M/S) * The servers

[Pacemaker] pacemaker-mgmt RPM for centos?

2010-02-10 Thread martin . braun
Hi all, I am setting up a pacemaker (1.0.7) cluster (corosync) with centos-5.4 - I used the repo: http://www.clusterlabs.org/rpm/epel-5/i386/. However I can't find the rpms for the Python GUI. Is there a rpm build for epel-5 or do I have to compile from source? If yes - which version of

Re: [Pacemaker] The best way to take a CIB backup and rooback to a defined state

2009-10-06 Thread martin . braun
Hi, - cibadmin -C --xml-file backupfile.xml This is a way, but maybe not the best? I found this a convenient way: - crm configure show backup_config.xml - crash!! - cibadmin -E --force - edit configuration to delete last-lrm-refresh etc. - crm configure backup_config.xml It's just

[Pacemaker] OCF resource agent: template for python?

2009-08-14 Thread martin . braun
Hi List, we would like to use a python script as an OCF resource agent script. Do you know any free templates for this? Are there any disadvantages to consider when not using bash as a script language for OCF? Thanks in advance, Martin InterComponentWare AG: Vorstand: Peter Reuschel

Re: [Pacemaker] monitoring events exported through web interface?

2009-06-24 Thread martin . braun
Hi Dimitar, I need to publish certain resource states in my cluster through a web interface which can be accessed remotely by load balancers and other health-checking software. Is there a standard way to do this? The standard way should be SNMP- nearly every health-checking software

[Pacemaker] API for programmatically controlling a heartbeat 3 cluster?

2009-06-18 Thread martin . braun
Hi All, I want to be able to configure the CIB of a heartbeat/pacemaker cluster programmatically (Java). I don't want to reinvent the PythonGUI - I just want to be able to change some basic settings like the Virtual-IP Address (i.e. changing resource's attributes), set a node to standby or