Re: [Pacemaker] Crowdsourcing - Need a Pacemaker catchphrase

2010-12-06 Thread Junko IKEDA

Hi,

Supplemental explanation;
We have the movie to promote Pacemaker in Japan.
http://linux-ha.sourceforge.jp/wp/

Here is the long version(about 40 seconds).
http://www.youtube.com/watch?v=D141O2U_nyc

The main purpose of this movie is to let people know about Pacemaker,
because many people still confuse Pacemaker and Heartbeat.

Now, our boss got the additional budget to make the English version of  
this movie,

(I was really surprised at this, he might have some magic wand.)
so that's why, we need your help :)
Please let me know if you have a good catchphrase for Pacemaker!

By the way, the literal translation of a Japanese phrase is like this;
Something is missing...

Computer systems have used Heartbeat for a reliable zero downtime  
solution.
Now, Heartbeat is evolving to the next level of high availability and  
robustness.

That's Pacemaker!
Pacemaker is the next generation of high availability clustering for  
Linux.
Pacemaker achieves maximum availability for your cluster services by  
detecting and recovering from node and service-level failures.

Now, for your high availability clusters,
Use Pacemaker!

I know Pacemaker is cluster manager, and Heartbeat is messaging layer,
so there might be some misleading phrases, for example, "evolving" or  
"next generation".
If there is the clear phrase to explain the connection of Pacemaker,  
Heartbeat and Corosync,

it's the best one.

Thanks,
Junko

NTT DATA INTELLILINK CORPORATION


On 2010/12/07, at 1:19, Angelo Höngens wrote:

You mean a 'tagline' of some sort? Here's some from the top of my  
head:


- keeps your business running
- high-availability for the masses
- makes systems administrators sleep at night
- keeps your bits pumping
- maximum uptime, minimum cost
- robust, reliable and r...?
- An open source cluster suite


--


With kind regards,


Angelo Höngens

Systems Administrator

--
NetMatch
tourism internet software solutions

Ringbaan Oost 2b
5013 CA Tilburg
T: +31 (0)13 5811088
F: +31 (0)13 5821239

mailto:a.hong...@netmatch.nl
http://www.netmatch.nl
--



-Original Message-
From: Andrew Beekhof [mailto:and...@beekhof.net]
Sent: maandag 6 december 2010 16:45
To: pacemaker@oss.clusterlabs.org
Subject: [Pacemaker] Crowdsourcing - Need a Pacemaker catchphrase

I am in need of a catchphrase for the Pacemaker project.
Having no imagination for things not related to C, I turn to you the
community :-)

Ideas anyone?

___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started:
http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-
foundation.org/enter_bug.cgi?product=Pacemaker


___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker



___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker


Re: [Pacemaker] lampp support

2010-12-06 Thread jiaju liu

Hi,Thank you for your reply lampp is Linux Apache and MySQL PHP, I use it at 
begin, so I want to add it in cluster


Hi,

By lampp you mean Linux, Apache, MySQL PHP/Perl/Python?

And what's stopping you from using the apache and mysql RA's? You want 
some kind of all-in-wonder? Try a resource group.

HTH

jiaju liu wrote:
>
>     Hi all
>     I want to add lampp into my HA cluster, unfortunately, I have not
>     find lampp RA, is this mean I should write RA by myself or my
>     resource-agent version is too old?
>
>     my packages version are
>
>      
>
>     cluster-glue-libs-devel-1.0.5-1.el5
>     cluster-glue-1.0.5-1.el5
>     cluster-glue-libs-1.0.5-1.el5
>
>      
>
>     heartbeat-libs-3.0.3-2.el5
>     heartbeat-devel-3.0.3-2.el5
>     heartbeat-3.0.3-2.el5 
>
>     openais-1.1.0-1.el5
>     openaislib-1.1.0-1.el5
>     openaislib-devel-1.1.0-1.el5
>
>      
>
>     corosynclib-devel-1.2.2-1.1.el5
>     corosynclib-1.2.2-1.1.el5
>     corosync-1.2.2-1.1.el5
>
>      
>
>     pacemaker-libs-1.0.8-6.1.el5
>     pacemaker-1.0.8-6.1.el5
>     pacemaker-libs-devel-1.0.8-6.1.el5
>
>
>  



  ___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker


Re: [Pacemaker] OCF RA dev guide: final heads up

2010-12-06 Thread Bob Schatz
Florain,

Comments below with [BS]


Thanks,

Bob


- Original Message 
From: Florian Haas 
To: pacemaker@oss.clusterlabs.org
Sent: Mon, December 6, 2010 7:25:28 AM
Subject: Re: [Pacemaker] OCF RA dev guide: final heads up

Hello Bob,

On 2010-12-03 20:12, Bob Schatz wrote:
> Florian,
> 
> Thanks for writing this!
> 
> I already found one or two errors related to return codes in my agent based 
> on 

> your document. :)
> 
> I have not read the entire document but I do have these comments:
> 
> 1.Does this document apply to all versions of the agent framework or only 
> certain versions(hopefully all in one place)?  I think the document should 
> have 
>
> a section which specifies which versions are covered.  Also, if certain areas 
> only apply to a certain version then a "Note" should be mentioned in the 
> section.
> 
> 2.In Section 3.8  OCF_NOT_RUNNING, how can a monitor return 
> OCF_FAILED_MASTER?   Is there an environment variable passed to the monitor 
> action which says "I think you are a master - tell me if you or are not"?

No, the very purpose of monitor is to _find out_ the status of the
resource. If the resource can query its own master/slave status, it
should do so, and then if it is both a master and failed, it should
return OCF_FAILED_MASTER.


[BS] Okay.  That makes sense now.

> 3.In Section 5.3 monitor action, it would be nice if you show how a 
> OCF_FAILED_MASTER is returned.

Hm. Let me defer that for a little bit.


[BS] Sounds good

> 4.Sections 5.8 migrate_to action and 5.9 migrate_from action, do these 
>apply 
>
> to master/slave resources also or only to primitive resources?

Good question, and indeed I don't know. It's conceivable that a clone
set (remember, m/s are just clones with a little extra) has a clone-max
that is less than the number of nodes in the cluster, and supports
migration, and therefore a clone instance should be able to live-migrate
to a different node. I have no clue whether it's indeed implemented that
way, though.

Andrew, maybe you can shed some extra light on this?

> 5.Section 5.10 notify action, I think you to want to add a note/reference 
>to 
>
> the Pacemaker Configuration Explained section "10.3.3.9 Proper Interpretation 
>of 
>
> Notification Environment Variables".  (Section name may be different as I was 
> looking at 1.0 from about a year ago).

Good idea. I'll put that on my to-do list.

> 6.Section 8.4 Specifying a master preference, starting in at least 
> version 

> of Pacemaker 1.0.9.1 it is possible to specify a negative master score.  I 
>think 
>
> it would be good to add this to the example as well as a note about which 
> version has this functionality since it was broken in 1.0.6.

Don't you think this would just royally confuse people?


[BS] You are probably right.  I guess you don't want to document "bugs" and 
workarounds from past releases in the current manual.   That makes sense.


Florian


  

___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker


[Pacemaker] announcement for glue 1.0.7 and heartbeat 3.0.4

2010-12-06 Thread Dejan Muhamedagic
Hello,

The new releases of Heartbeat (3.0.4) and cluster-glue (1.0.7)
are out. Please upgrade at the earliest possible opportunity.

The highlights:

- cluster-glue

- external stonith plugins report now real exit codes (with
  Pacemaker 1.1 it could happen that say failed monitor
  operation is reported as success to the upper layers)
- improved logging for stonith with Pacemaker 1.1 (should get
  rid of the annoying "parse error" log messages by 
fence_legacy)
- new external/nut stonith plugin (Network UPS Tools) (thanks
  to William Seligman)
- rename dtr|rts parameter to dtr_rts in rcd_serial; it was
  necessary to change it, otherwise it cannot be used in XML;
  NB: this is a regression for v1 users
- optimization in logd and cl_log of logging to files
- (note for Ubuntu users) the new ubuntu startup lrmd
  plugin is included but unfortunately not functional,
  it will have to wait for one of the next releases


- Heartbeat

- various optimizations at the communication layer
- better support for Pacemaker 1.1
  (make sure to use traditional compression)
- new mcast6 UDP IPv6 communication plugin

You can get the current tarballs here:

Heartbeat:
http://hg.linux-ha.org/heartbeat-STABLE_3_0/archive/STABLE-3.0.4.tar.bz2

cluster-glue:
http://hg.linux-ha.org/glue/archive/glue-1.0.7.tar.bz2

Enjoy!

Lars Ellenberg
Florian Haas
Dejan Muhamedagic

___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker


Re: [Pacemaker] Crowdsourcing - Need a Pacemaker catchphrase

2010-12-06 Thread Angelo Höngens
You mean a 'tagline' of some sort? Here's some from the top of my head:
 
- keeps your business running
- high-availability for the masses
- makes systems administrators sleep at night
- keeps your bits pumping
- maximum uptime, minimum cost
- robust, reliable and r...?
- An open source cluster suite


-- 

 
With kind regards,
 
 
Angelo Höngens
 
Systems Administrator
 
--
NetMatch
tourism internet software solutions
 
Ringbaan Oost 2b
5013 CA Tilburg
T: +31 (0)13 5811088
F: +31 (0)13 5821239
 
mailto:a.hong...@netmatch.nl
http://www.netmatch.nl
--


> -Original Message-
> From: Andrew Beekhof [mailto:and...@beekhof.net]
> Sent: maandag 6 december 2010 16:45
> To: pacemaker@oss.clusterlabs.org
> Subject: [Pacemaker] Crowdsourcing - Need a Pacemaker catchphrase
> 
> I am in need of a catchphrase for the Pacemaker project.
> Having no imagination for things not related to C, I turn to you the
> community :-)
> 
> Ideas anyone?
> 
> ___
> Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> 
> Project Home: http://www.clusterlabs.org
> Getting started:
> http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://developerbugs.linux-
> foundation.org/enter_bug.cgi?product=Pacemaker

___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker


Re: [Pacemaker] ClusterMon

2010-12-06 Thread Ryan Steele

I think I've found the problem - comments below.

On 12/05/2010 08:26 PM, Ryan Steele wrote:

Hi folks,



<...snip...>

crm(live)configure# primitive ResourceMonitor ocf:pacemaker:ClusterMon 
params pidfile="/var/run/crm_mon.pid" htmlfile="/var/tmp/crm_mon.html" 
extra_options="-T o...@example.com -F 'Cluster Monitor 
' -H smtp.example.com:25 -P '[LDAP 
Cluster]: Resource Changes Detected'" op monitor interval="10s" 
timeout="20s"



The issue here is that crm_mon.c uses smtp_set_reverse_path(), which 
expects only a mailbox name, e.g. 'clustermoni...@example.com'.  It does 
not appear to accept other variants, such as 'Cluster Monitor 
'; doing that results in the entire 
--mail-from argument being interpreted within angle brackets, which 
turns 'Cluster Monitor ' into <'Cluster 
Monitor' '>, which is a malformed sender 
address.


Would the devs find it reasonable to accept a patch that uses something 
like regcomp() to check and see if the argument provided already 
contains a 'From: ', and if so, uses smtp_set_header() instead of 
smtp_set_reverse_path()?  That would maintain backwards compatibility 
with the current style & give one the ability to form more complex From: 
addresses, since smtp_set_header() is passed through unchanged.


Thanks,
Ryan

___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker


[Pacemaker] Crowdsourcing - Need a Pacemaker catchphrase

2010-12-06 Thread Andrew Beekhof
I am in need of a catchphrase for the Pacemaker project.
Having no imagination for things not related to C, I turn to you the
community :-)

Ideas anyone?

___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker


Re: [Pacemaker] OCF RA dev guide: final heads up

2010-12-06 Thread Florian Haas
Hello Bob,

On 2010-12-03 20:12, Bob Schatz wrote:
> Florian,
> 
> Thanks for writing this!
> 
> I already found one or two errors related to return codes in my agent based 
> on 
> your document. :)
> 
> I have not read the entire document but I do have these comments:
> 
> 1.Does this document apply to all versions of the agent framework or only 
> certain versions(hopefully all in one place)?  I think the document should 
> have 
> a section which specifies which versions are covered.  Also, if certain areas 
> only apply to a certain version then a "Note" should be mentioned in the 
> section.
> 
> 2.In Section 3.8  OCF_NOT_RUNNING, how can a monitor return 
> OCF_FAILED_MASTER?   Is there an environment variable passed to the monitor 
> action which says "I think you are a master - tell me if you or are not"?

No, the very purpose of monitor is to _find out_ the status of the
resource. If the resource can query its own master/slave status, it
should do so, and then if it is both a master and failed, it should
return OCF_FAILED_MASTER.

> 3.In Section 5.3 monitor action, it would be nice if you show how a 
> OCF_FAILED_MASTER is returned.

Hm. Let me defer that for a little bit.

> 4.Sections 5.8 migrate_to action and 5.9 migrate_from action, do these 
> apply 
> to master/slave resources also or only to primitive resources?

Good question, and indeed I don't know. It's conceivable that a clone
set (remember, m/s are just clones with a little extra) has a clone-max
that is less than the number of nodes in the cluster, and supports
migration, and therefore a clone instance should be able to live-migrate
to a different node. I have no clue whether it's indeed implemented that
way, though.

Andrew, maybe you can shed some extra light on this?

> 5.Section 5.10 notify action, I think you to want to add a note/reference 
> to 
> the Pacemaker Configuration Explained section "10.3.3.9 Proper Interpretation 
> of 
> Notification Environment Variables".  (Section name may be different as I was 
> looking at 1.0 from about a year ago).

Good idea. I'll put that on my to-do list.

> 6.Section 8.4 Specifying a master preference, starting in at least 
> version 
> of Pacemaker 1.0.9.1 it is possible to specify a negative master score.  I 
> think 
> it would be good to add this to the example as well as a note about which 
> version has this functionality since it was broken in 1.0.6.

Don't you think this would just royally confuse people?

Florian



signature.asc
Description: OpenPGP digital signature
___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker


Re: [Pacemaker] OCF RA dev guide: final heads up

2010-12-06 Thread Florian Haas
On 2010-12-03 12:21, Brett Delle Grazie wrote:
> May I just say:
> 
> Thank you :)
> 
> Very nice work.

Quite welcome, thanks for the compliments.

Cheers,
Florian



signature.asc
Description: OpenPGP digital signature
___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker


Re: [Pacemaker] pacemaker segfault

2010-12-06 Thread Dejan Muhamedagic
Hi,

On Mon, Dec 06, 2010 at 03:11:03PM +0300, ruslan usifov wrote:
> hello
> 
> I run pacemaker on ubuntu (Ubuntu 10.04.1 LTS) with corosync, i installed it
> from apt, and my pacemaker version is:
> 
> r...@storage0:/var/log# dpkg -l | grep 'pacemaker'
> ii  pacemaker   1.0.8+hg15494-2ubuntu2  HA
> cluster resource manager
> 
> 
> and have follow problem with pacemaker, with follow configration:
> r...@storage0:/var/log# crm configure show
> node storage0
> node storage1
> primitive drbd_web ocf:linbit:drbd \
> params drbd_resource="web" \
> op monitor interval="10s" timeout="60s"
> primitive iscsi_ip ocf:heartbeat:IPaddr2 \
> params ip="192.168.17.19" nic="eth1:1" cidr_netmask="24" \
> op monitor interval="10s" \
> meta target-role="Started"
> primitive iscsi_web_target ocf:heartbeat:iSCSITarget \
> params iqn="iqn.2010-06.playrix.local:san.web" implementation="iet"
> \
> op monitor interval="10s" timeout="30s" depth="0" \
> meta target-role="Started"
> primitive iscsi_web_target_lun1 ocf:heartbeat:iSCSILogicalUnit \
> params lun="1" path="/dev/drbd1"
> target_iqn="iqn.2010-06.playrix.local:san.web" implementation="iet" \
> op monitor interval="10s" timeout="30s"
> group iscsi iscsi_ip iscsi_web_target iscsi_web_target_lun1
> ms ms_drbd_web drbd_web \
> meta master-max="1" master-node-max="1" clone-max="2"
> clone-node-max="1" notify="true"
> colocation iscsi_on_drbd inf: ms_drbd_web:Master iscsi
> order iscsi_target_after_drbd inf: ms_drbd_web:promote iscsi_web_target
> order iscsi_target_lun_after_iscsi_target inf: iscsi_web_target
> iscsi_web_target_lun1
> property $id="cib-bootstrap-options" \
> dc-version="1.0.8-042548a451fce8400660f6031f4da6f0223dd5dd" \
> cluster-infrastructure="openais" \
> expected-quorum-votes="2" \
> stonith-enabled="false" \
> no-quorum-policy="ignore"
> rsc_defaults $id="rsc-options" \
> resource-stickiness="100"
> 
> 
> When i shutdown node storage1, node storage0 doesn't  accept Master drbd
> role, so output from crm_mon -1 lokks like this:
> 
> Last updated: Mon Dec  6 15:04:18 2010
> Stack: openais
> Current DC: storage0 - partition WITHOUT quorum
> Version: 1.0.8-042548a451fce8400660f6031f4da6f0223dd5dd
> 2 Nodes configured, 2 expected votes
> 2 Resources configured.
> 
> 
> Online: [ storage0 ]
> OFFLINE: [ storage1 ]
> 
>  Master/Slave Set: ms_drbd_web
>  Slaves: [ storage0 ]
>  Stopped: [ drbd_web:1 ]
>  Resource Group: iscsi
>  iscsi_ip   (ocf::heartbeat:IPaddr2):   Started storage0
>  iscsi_web_target   (ocf::heartbeat:iSCSITarget):   Started storage0
>  iscsi_web_target_lun1  (ocf::heartbeat:iSCSILogicalUnit):
> Started storage0 FAILED
> 
> Failed actions:
> iscsi_web_target_lun1_start_0 (node=storage0, call=91, rc=1,
> status=complete): unknown error
> 
> 
> and when try to promote node got folow error:
> crm(live)resource# promote ms_drbd_web
> Error performing operation: Remote node did not respond
> 
> 
> and periodicaly in /var/log/messages, i see folow error:
> Dec  6 14:49:35 storage0 kernel: [ 5048.618562] pengine[8584]: segfault at 8
> ip b76ad094 sp bf8261d0 error 4 in libpengine.so.3.0.0[b76a2000+32000]
> Dec  6 14:50:37 storage0 kernel: [ 5111.505491] pengine[8681]: segfault at 0
> ip b7831ef3 sp bfd28b30 error 4 in libpengine.so.3.0.0[b7821000+32000]
> Dec  6 14:51:41 storage0 kernel: [ 5174.746349] pengine[8770]: segfault at 8
> ip b7751094 sp bfe1ccb0 error 4 in libpengine.so.3.0.0[b7746000+32000]
> 
> 
> 
> Why pacemacker doesn't switch role of live node to master? And why segfault
> happens?

Looks like you ran into problems because of segfaults. I suspect
that the segfault has been fixed in the meantime, but hard to
say unless you show the backtrace. Best to open a bugzilla with
your vendor.

Thanks,

Dejan


> Please help

> ___
> Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> 
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: 
> http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker


___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker


[Pacemaker] pacemaker segfault

2010-12-06 Thread ruslan usifov
hello

I run pacemaker on ubuntu (Ubuntu 10.04.1 LTS) with corosync, i installed it
from apt, and my pacemaker version is:

r...@storage0:/var/log# dpkg -l | grep 'pacemaker'
ii  pacemaker   1.0.8+hg15494-2ubuntu2  HA
cluster resource manager


and have follow problem with pacemaker, with follow configration:
r...@storage0:/var/log# crm configure show
node storage0
node storage1
primitive drbd_web ocf:linbit:drbd \
params drbd_resource="web" \
op monitor interval="10s" timeout="60s"
primitive iscsi_ip ocf:heartbeat:IPaddr2 \
params ip="192.168.17.19" nic="eth1:1" cidr_netmask="24" \
op monitor interval="10s" \
meta target-role="Started"
primitive iscsi_web_target ocf:heartbeat:iSCSITarget \
params iqn="iqn.2010-06.playrix.local:san.web" implementation="iet"
\
op monitor interval="10s" timeout="30s" depth="0" \
meta target-role="Started"
primitive iscsi_web_target_lun1 ocf:heartbeat:iSCSILogicalUnit \
params lun="1" path="/dev/drbd1"
target_iqn="iqn.2010-06.playrix.local:san.web" implementation="iet" \
op monitor interval="10s" timeout="30s"
group iscsi iscsi_ip iscsi_web_target iscsi_web_target_lun1
ms ms_drbd_web drbd_web \
meta master-max="1" master-node-max="1" clone-max="2"
clone-node-max="1" notify="true"
colocation iscsi_on_drbd inf: ms_drbd_web:Master iscsi
order iscsi_target_after_drbd inf: ms_drbd_web:promote iscsi_web_target
order iscsi_target_lun_after_iscsi_target inf: iscsi_web_target
iscsi_web_target_lun1
property $id="cib-bootstrap-options" \
dc-version="1.0.8-042548a451fce8400660f6031f4da6f0223dd5dd" \
cluster-infrastructure="openais" \
expected-quorum-votes="2" \
stonith-enabled="false" \
no-quorum-policy="ignore"
rsc_defaults $id="rsc-options" \
resource-stickiness="100"


When i shutdown node storage1, node storage0 doesn't  accept Master drbd
role, so output from crm_mon -1 lokks like this:

Last updated: Mon Dec  6 15:04:18 2010
Stack: openais
Current DC: storage0 - partition WITHOUT quorum
Version: 1.0.8-042548a451fce8400660f6031f4da6f0223dd5dd
2 Nodes configured, 2 expected votes
2 Resources configured.


Online: [ storage0 ]
OFFLINE: [ storage1 ]

 Master/Slave Set: ms_drbd_web
 Slaves: [ storage0 ]
 Stopped: [ drbd_web:1 ]
 Resource Group: iscsi
 iscsi_ip   (ocf::heartbeat:IPaddr2):   Started storage0
 iscsi_web_target   (ocf::heartbeat:iSCSITarget):   Started storage0
 iscsi_web_target_lun1  (ocf::heartbeat:iSCSILogicalUnit):
Started storage0 FAILED

Failed actions:
iscsi_web_target_lun1_start_0 (node=storage0, call=91, rc=1,
status=complete): unknown error


and when try to promote node got folow error:
crm(live)resource# promote ms_drbd_web
Error performing operation: Remote node did not respond


and periodicaly in /var/log/messages, i see folow error:
Dec  6 14:49:35 storage0 kernel: [ 5048.618562] pengine[8584]: segfault at 8
ip b76ad094 sp bf8261d0 error 4 in libpengine.so.3.0.0[b76a2000+32000]
Dec  6 14:50:37 storage0 kernel: [ 5111.505491] pengine[8681]: segfault at 0
ip b7831ef3 sp bfd28b30 error 4 in libpengine.so.3.0.0[b7821000+32000]
Dec  6 14:51:41 storage0 kernel: [ 5174.746349] pengine[8770]: segfault at 8
ip b7751094 sp bfe1ccb0 error 4 in libpengine.so.3.0.0[b7746000+32000]



Why pacemacker doesn't switch role of live node to master? And why segfault
happens?
Please help
___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker


Re: [Pacemaker] [lvs-users] is it possible to have ldirector and real cluster server on same physical machine?

2010-12-06 Thread Darren.Mansell
Check the /var/log/ldirectord.log file for errors and check you can manually 
start it yourself: 

rcldirectord restart

I've had to compile a Perl module myself for ldirector in SLES 11 HAE: 
http://www.clusterlabs.org/wiki/Load_Balanced_MySQL_Replicated_Cluster#Missing_Perl_Socket6
 

You also need lvs_support="true" in your ldirectord resource.

I've added this to the pacemaker list as it may be more suited for support 
there.

Darren Mansell


-Original Message-
From: lvs-users-boun...@linuxvirtualserver.org 
[mailto:lvs-users-boun...@linuxvirtualserver.org] On Behalf Of Mrvka Andreas
Sent: 06 December 2010 08:44
To: LinuxVirtualServer.org users mailing list.
Subject: Re: [lvs-users] is it possible to have ldirector and real cluster 
server on same physical machine?

Hello list,

sorrily I didn't succeed last week in deploying the cluster.
Please can anybody show me the error? It has to be somewhere very deep inside.

I only want to have a two node cluster with apache load balanced and 
fail-overing.
 It shouldn't be that complex - but where is the error?

Maby the solution or this configs will help others.

Here my ldirectord.cf (with TABs)
autoreload = yes
checkinterval = 10
checktimeout = 3
logfile = "/var/log/ldirectord.log"
quiescent = yes
virtual = 10.10.11.60:80
checktype = negotiate
fallback = 127.0.0.1:80
protocol = tcp
real = 10.10.11.61:80 gate
real = 10.10.11.62:80 gate
receive = "Still alive"
request = "test.html"
scheduler = wlc
service = http

My crm configure:

node linlbtemp01
node linlbtemp02
primitive ClusterIP ocf:heartbeat:IPaddr2 \
operations $id="ClusterIP-operations" \
op monitor interval="5s" timeout="20s" \
params ip="10.10.11.60" nic="lo" cidr_netmask="16" lvs_support="true"
primitive Virtual-IP-Apache ocf:heartbeat:IPaddr2 \
params lvs_support="true" ip="10.10.11.60" cidr_netmask="16" 
broadcast="255.255.255.255" \
op monitor interval="1m" timeout="10s" \
meta migration-threshold="10"
primitive apache ocf:heartbeat:apache \
op monitor interval="30s" timeout="10s" \
meta migration-threshold="10" target-role="Started" \
params configfile="/etc/apache2/httpd.conf" httpd="/usr/sbin/httpd" 
testurl="/test.html"
primitive ldirectord ocf:heartbeat:ldirectord \
params configfile="/etc/ha.d/ldirectord.cf" \
op monitor interval="2m" timeout="20s" \
meta migration-threshold="10" target-role="Started"
group Load-Balancing Virtual-IP-Apache ldirectord clone cl-apache apache 
location Prefer-Node1 ldirectord \
rule $id="prefer-node1-rule" 100: #uname eq linlbtemp01 property 
$id="cib-bootstrap-options" \
dc-version="1.1.2-ecb1e2ea172ba2551f0bd763e557fccde68c849b" \
cluster-infrastructure="openais" \
expected-quorum-votes="2"

My /etc/sysctl:
# Disable response to broadcasts.
# You don't want yourself becoming a Smurf amplifier.
net.ipv4.icmp_echo_ignore_broadcasts = 1 # enable route verification on all 
interfaces net.ipv4.conf.all.rp_filter = 1 # enable ipV6 forwarding 
#net.ipv6.conf.all.forwarding = 1 # increase the number of possible inotify(7) 
watches fs.inotify.max_user_watches = 65536 # avoid deleting secondary IPs on 
deleting the primary IP #net.ipv4.conf.default.promote_secondaries = 1 
#net.ipv4.conf.all.promote_secondaries = 1 #net.ipv4.conf.lo.arp_ignore = 1 
#net.ipv4.conf.lo.arp_announce = 2 #net.ipv4.conf.all.arp_ignore = 1 
#net.ipv4.conf.all.arp_announce = 2 net.ipv4.conf.all.arp_ignore = 1 
net.ipv4.conf.eth0.arp_ignore = 1 net.ipv4.conf.all.arp_announce = 2 
net.ipv4.conf.eth0.arp_announce = 2 net.ipv4.ip_forward = 1


My ifcfg-lo:

IPADDR=127.0.0.1
NETMASK=255.0.0.0
NETWORK=127.0.0.0
BROADCAST=127.255.255.255
IPADDR_2=127.0.0.2/8
STARTMODE=onboot
USERCONTROL=no
FIREWALL=no
IPADDR_0=10.10.11.60   #VIP
NETMASK_0=255.255.255.255
NETWORK_0=10.10.11.0
BROADCAST_0=10.10.11.255
LABEL_0=0


Actually it seems, that my ldirectord out of openais does not start.

Can anybody point me to the error?

Thanks a lot in advance.
Andrew


-Original Message-
From: lvs-users-boun...@linuxvirtualserver.org 
[mailto:lvs-users-boun...@linuxvirtualserver.org] On Behalf Of 
darren.mans...@opengi.co.uk
Sent: Freitag, 3. Dezember 2010 14:53
To: lvs-us...@linuxvirtualserver.org
Subject: Re: [lvs-users] is it possible to have ldirectorand realcluster server 
on same physical machine?

Glad it helped. This is my original howto for this kind of setup:

http://www.clusterlabs.org/wiki/Load_Balanced_MySQL_Replicated_Cluster 

darren


-Original Message-
From: lvs-users-boun...@linuxvirtualserver.org 
[mailto:lvs-users-boun...@linuxvirtualserver.org] On Behalf Of Mrvka Andreas
Sent: 03 December 2010 13:46
To: 'LinuxVirtualServer.org users mailing list.'
Subject: Re: [lvs-users] is it possible to have ldirectorand realcluster server 
on same physical machine?

Hi Darren,

thank 

Re: [Pacemaker] lampp support

2010-12-06 Thread Dan Frincu

Hi,

By lampp you mean Linux, Apache, MySQL PHP/Perl/Python?

And what's stopping you from using the apache and mysql RA's? You want 
some kind of all-in-wonder? Try a resource group.


HTH

jiaju liu wrote:


Hi all
I want to add lampp into my HA cluster, unfortunately, I have not
find lampp RA, is this mean I should write RA by myself or my
resource-agent version is too old?

my packages version are

 


cluster-glue-libs-devel-1.0.5-1.el5
cluster-glue-1.0.5-1.el5
cluster-glue-libs-1.0.5-1.el5

 


heartbeat-libs-3.0.3-2.el5
heartbeat-devel-3.0.3-2.el5
heartbeat-3.0.3-2.el5 


openais-1.1.0-1.el5
openaislib-1.1.0-1.el5
openaislib-devel-1.1.0-1.el5

 


corosynclib-devel-1.2.2-1.1.el5
corosynclib-1.2.2-1.1.el5
corosync-1.2.2-1.1.el5

 


pacemaker-libs-1.0.8-6.1.el5
pacemaker-1.0.8-6.1.el5
pacemaker-libs-devel-1.0.8-6.1.el5


 



___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
  


--
Dan FRINCU
Systems Engineer
CCNA, RHCE
Streamwide Romania

___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker