Re: [Pacemaker] Crowdsourcing - Need a Pacemaker catchphrase
Hi, Supplemental explanation; We have the movie to promote Pacemaker in Japan. http://linux-ha.sourceforge.jp/wp/ Here is the long version(about 40 seconds). http://www.youtube.com/watch?v=D141O2U_nyc The main purpose of this movie is to let people know about Pacemaker, because many people still confuse Pacemaker and Heartbeat. Now, our boss got the additional budget to make the English version of this movie, (I was really surprised at this, he might have some magic wand.) so that's why, we need your help :) Please let me know if you have a good catchphrase for Pacemaker! By the way, the literal translation of a Japanese phrase is like this; Something is missing... Computer systems have used Heartbeat for a reliable zero downtime solution. Now, Heartbeat is evolving to the next level of high availability and robustness. That's Pacemaker! Pacemaker is the next generation of high availability clustering for Linux. Pacemaker achieves maximum availability for your cluster services by detecting and recovering from node and service-level failures. Now, for your high availability clusters, Use Pacemaker! I know Pacemaker is cluster manager, and Heartbeat is messaging layer, so there might be some misleading phrases, for example, "evolving" or "next generation". If there is the clear phrase to explain the connection of Pacemaker, Heartbeat and Corosync, it's the best one. Thanks, Junko NTT DATA INTELLILINK CORPORATION On 2010/12/07, at 1:19, Angelo Höngens wrote: You mean a 'tagline' of some sort? Here's some from the top of my head: - keeps your business running - high-availability for the masses - makes systems administrators sleep at night - keeps your bits pumping - maximum uptime, minimum cost - robust, reliable and r...? - An open source cluster suite -- With kind regards, Angelo Höngens Systems Administrator -- NetMatch tourism internet software solutions Ringbaan Oost 2b 5013 CA Tilburg T: +31 (0)13 5811088 F: +31 (0)13 5821239 mailto:a.hong...@netmatch.nl http://www.netmatch.nl -- -Original Message- From: Andrew Beekhof [mailto:and...@beekhof.net] Sent: maandag 6 december 2010 16:45 To: pacemaker@oss.clusterlabs.org Subject: [Pacemaker] Crowdsourcing - Need a Pacemaker catchphrase I am in need of a catchphrase for the Pacemaker project. Having no imagination for things not related to C, I turn to you the community :-) Ideas anyone? ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux- foundation.org/enter_bug.cgi?product=Pacemaker ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
Re: [Pacemaker] lampp support
Hi,Thank you for your reply lampp is Linux Apache and MySQL PHP, I use it at begin, so I want to add it in cluster Hi, By lampp you mean Linux, Apache, MySQL PHP/Perl/Python? And what's stopping you from using the apache and mysql RA's? You want some kind of all-in-wonder? Try a resource group. HTH jiaju liu wrote: > > Hi all > I want to add lampp into my HA cluster, unfortunately, I have not > find lampp RA, is this mean I should write RA by myself or my > resource-agent version is too old? > > my packages version are > > > > cluster-glue-libs-devel-1.0.5-1.el5 > cluster-glue-1.0.5-1.el5 > cluster-glue-libs-1.0.5-1.el5 > > > > heartbeat-libs-3.0.3-2.el5 > heartbeat-devel-3.0.3-2.el5 > heartbeat-3.0.3-2.el5 > > openais-1.1.0-1.el5 > openaislib-1.1.0-1.el5 > openaislib-devel-1.1.0-1.el5 > > > > corosynclib-devel-1.2.2-1.1.el5 > corosynclib-1.2.2-1.1.el5 > corosync-1.2.2-1.1.el5 > > > > pacemaker-libs-1.0.8-6.1.el5 > pacemaker-1.0.8-6.1.el5 > pacemaker-libs-devel-1.0.8-6.1.el5 > > > ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
Re: [Pacemaker] OCF RA dev guide: final heads up
Florain, Comments below with [BS] Thanks, Bob - Original Message From: Florian Haas To: pacemaker@oss.clusterlabs.org Sent: Mon, December 6, 2010 7:25:28 AM Subject: Re: [Pacemaker] OCF RA dev guide: final heads up Hello Bob, On 2010-12-03 20:12, Bob Schatz wrote: > Florian, > > Thanks for writing this! > > I already found one or two errors related to return codes in my agent based > on > your document. :) > > I have not read the entire document but I do have these comments: > > 1.Does this document apply to all versions of the agent framework or only > certain versions(hopefully all in one place)? I think the document should > have > > a section which specifies which versions are covered. Also, if certain areas > only apply to a certain version then a "Note" should be mentioned in the > section. > > 2.In Section 3.8 OCF_NOT_RUNNING, how can a monitor return > OCF_FAILED_MASTER? Is there an environment variable passed to the monitor > action which says "I think you are a master - tell me if you or are not"? No, the very purpose of monitor is to _find out_ the status of the resource. If the resource can query its own master/slave status, it should do so, and then if it is both a master and failed, it should return OCF_FAILED_MASTER. [BS] Okay. That makes sense now. > 3.In Section 5.3 monitor action, it would be nice if you show how a > OCF_FAILED_MASTER is returned. Hm. Let me defer that for a little bit. [BS] Sounds good > 4.Sections 5.8 migrate_to action and 5.9 migrate_from action, do these >apply > > to master/slave resources also or only to primitive resources? Good question, and indeed I don't know. It's conceivable that a clone set (remember, m/s are just clones with a little extra) has a clone-max that is less than the number of nodes in the cluster, and supports migration, and therefore a clone instance should be able to live-migrate to a different node. I have no clue whether it's indeed implemented that way, though. Andrew, maybe you can shed some extra light on this? > 5.Section 5.10 notify action, I think you to want to add a note/reference >to > > the Pacemaker Configuration Explained section "10.3.3.9 Proper Interpretation >of > > Notification Environment Variables". (Section name may be different as I was > looking at 1.0 from about a year ago). Good idea. I'll put that on my to-do list. > 6.Section 8.4 Specifying a master preference, starting in at least > version > of Pacemaker 1.0.9.1 it is possible to specify a negative master score. I >think > > it would be good to add this to the example as well as a note about which > version has this functionality since it was broken in 1.0.6. Don't you think this would just royally confuse people? [BS] You are probably right. I guess you don't want to document "bugs" and workarounds from past releases in the current manual. That makes sense. Florian ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
[Pacemaker] announcement for glue 1.0.7 and heartbeat 3.0.4
Hello, The new releases of Heartbeat (3.0.4) and cluster-glue (1.0.7) are out. Please upgrade at the earliest possible opportunity. The highlights: - cluster-glue - external stonith plugins report now real exit codes (with Pacemaker 1.1 it could happen that say failed monitor operation is reported as success to the upper layers) - improved logging for stonith with Pacemaker 1.1 (should get rid of the annoying "parse error" log messages by fence_legacy) - new external/nut stonith plugin (Network UPS Tools) (thanks to William Seligman) - rename dtr|rts parameter to dtr_rts in rcd_serial; it was necessary to change it, otherwise it cannot be used in XML; NB: this is a regression for v1 users - optimization in logd and cl_log of logging to files - (note for Ubuntu users) the new ubuntu startup lrmd plugin is included but unfortunately not functional, it will have to wait for one of the next releases - Heartbeat - various optimizations at the communication layer - better support for Pacemaker 1.1 (make sure to use traditional compression) - new mcast6 UDP IPv6 communication plugin You can get the current tarballs here: Heartbeat: http://hg.linux-ha.org/heartbeat-STABLE_3_0/archive/STABLE-3.0.4.tar.bz2 cluster-glue: http://hg.linux-ha.org/glue/archive/glue-1.0.7.tar.bz2 Enjoy! Lars Ellenberg Florian Haas Dejan Muhamedagic ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
Re: [Pacemaker] Crowdsourcing - Need a Pacemaker catchphrase
You mean a 'tagline' of some sort? Here's some from the top of my head: - keeps your business running - high-availability for the masses - makes systems administrators sleep at night - keeps your bits pumping - maximum uptime, minimum cost - robust, reliable and r...? - An open source cluster suite -- With kind regards, Angelo Höngens Systems Administrator -- NetMatch tourism internet software solutions Ringbaan Oost 2b 5013 CA Tilburg T: +31 (0)13 5811088 F: +31 (0)13 5821239 mailto:a.hong...@netmatch.nl http://www.netmatch.nl -- > -Original Message- > From: Andrew Beekhof [mailto:and...@beekhof.net] > Sent: maandag 6 december 2010 16:45 > To: pacemaker@oss.clusterlabs.org > Subject: [Pacemaker] Crowdsourcing - Need a Pacemaker catchphrase > > I am in need of a catchphrase for the Pacemaker project. > Having no imagination for things not related to C, I turn to you the > community :-) > > Ideas anyone? > > ___ > Pacemaker mailing list: Pacemaker@oss.clusterlabs.org > http://oss.clusterlabs.org/mailman/listinfo/pacemaker > > Project Home: http://www.clusterlabs.org > Getting started: > http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf > Bugs: http://developerbugs.linux- > foundation.org/enter_bug.cgi?product=Pacemaker ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
Re: [Pacemaker] ClusterMon
I think I've found the problem - comments below. On 12/05/2010 08:26 PM, Ryan Steele wrote: Hi folks, <...snip...> crm(live)configure# primitive ResourceMonitor ocf:pacemaker:ClusterMon params pidfile="/var/run/crm_mon.pid" htmlfile="/var/tmp/crm_mon.html" extra_options="-T o...@example.com -F 'Cluster Monitor ' -H smtp.example.com:25 -P '[LDAP Cluster]: Resource Changes Detected'" op monitor interval="10s" timeout="20s" The issue here is that crm_mon.c uses smtp_set_reverse_path(), which expects only a mailbox name, e.g. 'clustermoni...@example.com'. It does not appear to accept other variants, such as 'Cluster Monitor '; doing that results in the entire --mail-from argument being interpreted within angle brackets, which turns 'Cluster Monitor ' into <'Cluster Monitor' '>, which is a malformed sender address. Would the devs find it reasonable to accept a patch that uses something like regcomp() to check and see if the argument provided already contains a 'From: ', and if so, uses smtp_set_header() instead of smtp_set_reverse_path()? That would maintain backwards compatibility with the current style & give one the ability to form more complex From: addresses, since smtp_set_header() is passed through unchanged. Thanks, Ryan ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
[Pacemaker] Crowdsourcing - Need a Pacemaker catchphrase
I am in need of a catchphrase for the Pacemaker project. Having no imagination for things not related to C, I turn to you the community :-) Ideas anyone? ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
Re: [Pacemaker] OCF RA dev guide: final heads up
Hello Bob, On 2010-12-03 20:12, Bob Schatz wrote: > Florian, > > Thanks for writing this! > > I already found one or two errors related to return codes in my agent based > on > your document. :) > > I have not read the entire document but I do have these comments: > > 1.Does this document apply to all versions of the agent framework or only > certain versions(hopefully all in one place)? I think the document should > have > a section which specifies which versions are covered. Also, if certain areas > only apply to a certain version then a "Note" should be mentioned in the > section. > > 2.In Section 3.8 OCF_NOT_RUNNING, how can a monitor return > OCF_FAILED_MASTER? Is there an environment variable passed to the monitor > action which says "I think you are a master - tell me if you or are not"? No, the very purpose of monitor is to _find out_ the status of the resource. If the resource can query its own master/slave status, it should do so, and then if it is both a master and failed, it should return OCF_FAILED_MASTER. > 3.In Section 5.3 monitor action, it would be nice if you show how a > OCF_FAILED_MASTER is returned. Hm. Let me defer that for a little bit. > 4.Sections 5.8 migrate_to action and 5.9 migrate_from action, do these > apply > to master/slave resources also or only to primitive resources? Good question, and indeed I don't know. It's conceivable that a clone set (remember, m/s are just clones with a little extra) has a clone-max that is less than the number of nodes in the cluster, and supports migration, and therefore a clone instance should be able to live-migrate to a different node. I have no clue whether it's indeed implemented that way, though. Andrew, maybe you can shed some extra light on this? > 5.Section 5.10 notify action, I think you to want to add a note/reference > to > the Pacemaker Configuration Explained section "10.3.3.9 Proper Interpretation > of > Notification Environment Variables". (Section name may be different as I was > looking at 1.0 from about a year ago). Good idea. I'll put that on my to-do list. > 6.Section 8.4 Specifying a master preference, starting in at least > version > of Pacemaker 1.0.9.1 it is possible to specify a negative master score. I > think > it would be good to add this to the example as well as a note about which > version has this functionality since it was broken in 1.0.6. Don't you think this would just royally confuse people? Florian signature.asc Description: OpenPGP digital signature ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
Re: [Pacemaker] OCF RA dev guide: final heads up
On 2010-12-03 12:21, Brett Delle Grazie wrote: > May I just say: > > Thank you :) > > Very nice work. Quite welcome, thanks for the compliments. Cheers, Florian signature.asc Description: OpenPGP digital signature ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
Re: [Pacemaker] pacemaker segfault
Hi, On Mon, Dec 06, 2010 at 03:11:03PM +0300, ruslan usifov wrote: > hello > > I run pacemaker on ubuntu (Ubuntu 10.04.1 LTS) with corosync, i installed it > from apt, and my pacemaker version is: > > r...@storage0:/var/log# dpkg -l | grep 'pacemaker' > ii pacemaker 1.0.8+hg15494-2ubuntu2 HA > cluster resource manager > > > and have follow problem with pacemaker, with follow configration: > r...@storage0:/var/log# crm configure show > node storage0 > node storage1 > primitive drbd_web ocf:linbit:drbd \ > params drbd_resource="web" \ > op monitor interval="10s" timeout="60s" > primitive iscsi_ip ocf:heartbeat:IPaddr2 \ > params ip="192.168.17.19" nic="eth1:1" cidr_netmask="24" \ > op monitor interval="10s" \ > meta target-role="Started" > primitive iscsi_web_target ocf:heartbeat:iSCSITarget \ > params iqn="iqn.2010-06.playrix.local:san.web" implementation="iet" > \ > op monitor interval="10s" timeout="30s" depth="0" \ > meta target-role="Started" > primitive iscsi_web_target_lun1 ocf:heartbeat:iSCSILogicalUnit \ > params lun="1" path="/dev/drbd1" > target_iqn="iqn.2010-06.playrix.local:san.web" implementation="iet" \ > op monitor interval="10s" timeout="30s" > group iscsi iscsi_ip iscsi_web_target iscsi_web_target_lun1 > ms ms_drbd_web drbd_web \ > meta master-max="1" master-node-max="1" clone-max="2" > clone-node-max="1" notify="true" > colocation iscsi_on_drbd inf: ms_drbd_web:Master iscsi > order iscsi_target_after_drbd inf: ms_drbd_web:promote iscsi_web_target > order iscsi_target_lun_after_iscsi_target inf: iscsi_web_target > iscsi_web_target_lun1 > property $id="cib-bootstrap-options" \ > dc-version="1.0.8-042548a451fce8400660f6031f4da6f0223dd5dd" \ > cluster-infrastructure="openais" \ > expected-quorum-votes="2" \ > stonith-enabled="false" \ > no-quorum-policy="ignore" > rsc_defaults $id="rsc-options" \ > resource-stickiness="100" > > > When i shutdown node storage1, node storage0 doesn't accept Master drbd > role, so output from crm_mon -1 lokks like this: > > Last updated: Mon Dec 6 15:04:18 2010 > Stack: openais > Current DC: storage0 - partition WITHOUT quorum > Version: 1.0.8-042548a451fce8400660f6031f4da6f0223dd5dd > 2 Nodes configured, 2 expected votes > 2 Resources configured. > > > Online: [ storage0 ] > OFFLINE: [ storage1 ] > > Master/Slave Set: ms_drbd_web > Slaves: [ storage0 ] > Stopped: [ drbd_web:1 ] > Resource Group: iscsi > iscsi_ip (ocf::heartbeat:IPaddr2): Started storage0 > iscsi_web_target (ocf::heartbeat:iSCSITarget): Started storage0 > iscsi_web_target_lun1 (ocf::heartbeat:iSCSILogicalUnit): > Started storage0 FAILED > > Failed actions: > iscsi_web_target_lun1_start_0 (node=storage0, call=91, rc=1, > status=complete): unknown error > > > and when try to promote node got folow error: > crm(live)resource# promote ms_drbd_web > Error performing operation: Remote node did not respond > > > and periodicaly in /var/log/messages, i see folow error: > Dec 6 14:49:35 storage0 kernel: [ 5048.618562] pengine[8584]: segfault at 8 > ip b76ad094 sp bf8261d0 error 4 in libpengine.so.3.0.0[b76a2000+32000] > Dec 6 14:50:37 storage0 kernel: [ 5111.505491] pengine[8681]: segfault at 0 > ip b7831ef3 sp bfd28b30 error 4 in libpengine.so.3.0.0[b7821000+32000] > Dec 6 14:51:41 storage0 kernel: [ 5174.746349] pengine[8770]: segfault at 8 > ip b7751094 sp bfe1ccb0 error 4 in libpengine.so.3.0.0[b7746000+32000] > > > > Why pacemacker doesn't switch role of live node to master? And why segfault > happens? Looks like you ran into problems because of segfaults. I suspect that the segfault has been fixed in the meantime, but hard to say unless you show the backtrace. Best to open a bugzilla with your vendor. Thanks, Dejan > Please help > ___ > Pacemaker mailing list: Pacemaker@oss.clusterlabs.org > http://oss.clusterlabs.org/mailman/listinfo/pacemaker > > Project Home: http://www.clusterlabs.org > Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf > Bugs: > http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
[Pacemaker] pacemaker segfault
hello I run pacemaker on ubuntu (Ubuntu 10.04.1 LTS) with corosync, i installed it from apt, and my pacemaker version is: r...@storage0:/var/log# dpkg -l | grep 'pacemaker' ii pacemaker 1.0.8+hg15494-2ubuntu2 HA cluster resource manager and have follow problem with pacemaker, with follow configration: r...@storage0:/var/log# crm configure show node storage0 node storage1 primitive drbd_web ocf:linbit:drbd \ params drbd_resource="web" \ op monitor interval="10s" timeout="60s" primitive iscsi_ip ocf:heartbeat:IPaddr2 \ params ip="192.168.17.19" nic="eth1:1" cidr_netmask="24" \ op monitor interval="10s" \ meta target-role="Started" primitive iscsi_web_target ocf:heartbeat:iSCSITarget \ params iqn="iqn.2010-06.playrix.local:san.web" implementation="iet" \ op monitor interval="10s" timeout="30s" depth="0" \ meta target-role="Started" primitive iscsi_web_target_lun1 ocf:heartbeat:iSCSILogicalUnit \ params lun="1" path="/dev/drbd1" target_iqn="iqn.2010-06.playrix.local:san.web" implementation="iet" \ op monitor interval="10s" timeout="30s" group iscsi iscsi_ip iscsi_web_target iscsi_web_target_lun1 ms ms_drbd_web drbd_web \ meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1" notify="true" colocation iscsi_on_drbd inf: ms_drbd_web:Master iscsi order iscsi_target_after_drbd inf: ms_drbd_web:promote iscsi_web_target order iscsi_target_lun_after_iscsi_target inf: iscsi_web_target iscsi_web_target_lun1 property $id="cib-bootstrap-options" \ dc-version="1.0.8-042548a451fce8400660f6031f4da6f0223dd5dd" \ cluster-infrastructure="openais" \ expected-quorum-votes="2" \ stonith-enabled="false" \ no-quorum-policy="ignore" rsc_defaults $id="rsc-options" \ resource-stickiness="100" When i shutdown node storage1, node storage0 doesn't accept Master drbd role, so output from crm_mon -1 lokks like this: Last updated: Mon Dec 6 15:04:18 2010 Stack: openais Current DC: storage0 - partition WITHOUT quorum Version: 1.0.8-042548a451fce8400660f6031f4da6f0223dd5dd 2 Nodes configured, 2 expected votes 2 Resources configured. Online: [ storage0 ] OFFLINE: [ storage1 ] Master/Slave Set: ms_drbd_web Slaves: [ storage0 ] Stopped: [ drbd_web:1 ] Resource Group: iscsi iscsi_ip (ocf::heartbeat:IPaddr2): Started storage0 iscsi_web_target (ocf::heartbeat:iSCSITarget): Started storage0 iscsi_web_target_lun1 (ocf::heartbeat:iSCSILogicalUnit): Started storage0 FAILED Failed actions: iscsi_web_target_lun1_start_0 (node=storage0, call=91, rc=1, status=complete): unknown error and when try to promote node got folow error: crm(live)resource# promote ms_drbd_web Error performing operation: Remote node did not respond and periodicaly in /var/log/messages, i see folow error: Dec 6 14:49:35 storage0 kernel: [ 5048.618562] pengine[8584]: segfault at 8 ip b76ad094 sp bf8261d0 error 4 in libpengine.so.3.0.0[b76a2000+32000] Dec 6 14:50:37 storage0 kernel: [ 5111.505491] pengine[8681]: segfault at 0 ip b7831ef3 sp bfd28b30 error 4 in libpengine.so.3.0.0[b7821000+32000] Dec 6 14:51:41 storage0 kernel: [ 5174.746349] pengine[8770]: segfault at 8 ip b7751094 sp bfe1ccb0 error 4 in libpengine.so.3.0.0[b7746000+32000] Why pacemacker doesn't switch role of live node to master? And why segfault happens? Please help ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
Re: [Pacemaker] [lvs-users] is it possible to have ldirector and real cluster server on same physical machine?
Check the /var/log/ldirectord.log file for errors and check you can manually start it yourself: rcldirectord restart I've had to compile a Perl module myself for ldirector in SLES 11 HAE: http://www.clusterlabs.org/wiki/Load_Balanced_MySQL_Replicated_Cluster#Missing_Perl_Socket6 You also need lvs_support="true" in your ldirectord resource. I've added this to the pacemaker list as it may be more suited for support there. Darren Mansell -Original Message- From: lvs-users-boun...@linuxvirtualserver.org [mailto:lvs-users-boun...@linuxvirtualserver.org] On Behalf Of Mrvka Andreas Sent: 06 December 2010 08:44 To: LinuxVirtualServer.org users mailing list. Subject: Re: [lvs-users] is it possible to have ldirector and real cluster server on same physical machine? Hello list, sorrily I didn't succeed last week in deploying the cluster. Please can anybody show me the error? It has to be somewhere very deep inside. I only want to have a two node cluster with apache load balanced and fail-overing. It shouldn't be that complex - but where is the error? Maby the solution or this configs will help others. Here my ldirectord.cf (with TABs) autoreload = yes checkinterval = 10 checktimeout = 3 logfile = "/var/log/ldirectord.log" quiescent = yes virtual = 10.10.11.60:80 checktype = negotiate fallback = 127.0.0.1:80 protocol = tcp real = 10.10.11.61:80 gate real = 10.10.11.62:80 gate receive = "Still alive" request = "test.html" scheduler = wlc service = http My crm configure: node linlbtemp01 node linlbtemp02 primitive ClusterIP ocf:heartbeat:IPaddr2 \ operations $id="ClusterIP-operations" \ op monitor interval="5s" timeout="20s" \ params ip="10.10.11.60" nic="lo" cidr_netmask="16" lvs_support="true" primitive Virtual-IP-Apache ocf:heartbeat:IPaddr2 \ params lvs_support="true" ip="10.10.11.60" cidr_netmask="16" broadcast="255.255.255.255" \ op monitor interval="1m" timeout="10s" \ meta migration-threshold="10" primitive apache ocf:heartbeat:apache \ op monitor interval="30s" timeout="10s" \ meta migration-threshold="10" target-role="Started" \ params configfile="/etc/apache2/httpd.conf" httpd="/usr/sbin/httpd" testurl="/test.html" primitive ldirectord ocf:heartbeat:ldirectord \ params configfile="/etc/ha.d/ldirectord.cf" \ op monitor interval="2m" timeout="20s" \ meta migration-threshold="10" target-role="Started" group Load-Balancing Virtual-IP-Apache ldirectord clone cl-apache apache location Prefer-Node1 ldirectord \ rule $id="prefer-node1-rule" 100: #uname eq linlbtemp01 property $id="cib-bootstrap-options" \ dc-version="1.1.2-ecb1e2ea172ba2551f0bd763e557fccde68c849b" \ cluster-infrastructure="openais" \ expected-quorum-votes="2" My /etc/sysctl: # Disable response to broadcasts. # You don't want yourself becoming a Smurf amplifier. net.ipv4.icmp_echo_ignore_broadcasts = 1 # enable route verification on all interfaces net.ipv4.conf.all.rp_filter = 1 # enable ipV6 forwarding #net.ipv6.conf.all.forwarding = 1 # increase the number of possible inotify(7) watches fs.inotify.max_user_watches = 65536 # avoid deleting secondary IPs on deleting the primary IP #net.ipv4.conf.default.promote_secondaries = 1 #net.ipv4.conf.all.promote_secondaries = 1 #net.ipv4.conf.lo.arp_ignore = 1 #net.ipv4.conf.lo.arp_announce = 2 #net.ipv4.conf.all.arp_ignore = 1 #net.ipv4.conf.all.arp_announce = 2 net.ipv4.conf.all.arp_ignore = 1 net.ipv4.conf.eth0.arp_ignore = 1 net.ipv4.conf.all.arp_announce = 2 net.ipv4.conf.eth0.arp_announce = 2 net.ipv4.ip_forward = 1 My ifcfg-lo: IPADDR=127.0.0.1 NETMASK=255.0.0.0 NETWORK=127.0.0.0 BROADCAST=127.255.255.255 IPADDR_2=127.0.0.2/8 STARTMODE=onboot USERCONTROL=no FIREWALL=no IPADDR_0=10.10.11.60 #VIP NETMASK_0=255.255.255.255 NETWORK_0=10.10.11.0 BROADCAST_0=10.10.11.255 LABEL_0=0 Actually it seems, that my ldirectord out of openais does not start. Can anybody point me to the error? Thanks a lot in advance. Andrew -Original Message- From: lvs-users-boun...@linuxvirtualserver.org [mailto:lvs-users-boun...@linuxvirtualserver.org] On Behalf Of darren.mans...@opengi.co.uk Sent: Freitag, 3. Dezember 2010 14:53 To: lvs-us...@linuxvirtualserver.org Subject: Re: [lvs-users] is it possible to have ldirectorand realcluster server on same physical machine? Glad it helped. This is my original howto for this kind of setup: http://www.clusterlabs.org/wiki/Load_Balanced_MySQL_Replicated_Cluster darren -Original Message- From: lvs-users-boun...@linuxvirtualserver.org [mailto:lvs-users-boun...@linuxvirtualserver.org] On Behalf Of Mrvka Andreas Sent: 03 December 2010 13:46 To: 'LinuxVirtualServer.org users mailing list.' Subject: Re: [lvs-users] is it possible to have ldirectorand realcluster server on same physical machine? Hi Darren, thank
Re: [Pacemaker] lampp support
Hi, By lampp you mean Linux, Apache, MySQL PHP/Perl/Python? And what's stopping you from using the apache and mysql RA's? You want some kind of all-in-wonder? Try a resource group. HTH jiaju liu wrote: Hi all I want to add lampp into my HA cluster, unfortunately, I have not find lampp RA, is this mean I should write RA by myself or my resource-agent version is too old? my packages version are cluster-glue-libs-devel-1.0.5-1.el5 cluster-glue-1.0.5-1.el5 cluster-glue-libs-1.0.5-1.el5 heartbeat-libs-3.0.3-2.el5 heartbeat-devel-3.0.3-2.el5 heartbeat-3.0.3-2.el5 openais-1.1.0-1.el5 openaislib-1.1.0-1.el5 openaislib-devel-1.1.0-1.el5 corosynclib-devel-1.2.2-1.1.el5 corosynclib-1.2.2-1.1.el5 corosync-1.2.2-1.1.el5 pacemaker-libs-1.0.8-6.1.el5 pacemaker-1.0.8-6.1.el5 pacemaker-libs-devel-1.0.8-6.1.el5 ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker -- Dan FRINCU Systems Engineer CCNA, RHCE Streamwide Romania ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker