Re: [Pacemaker] [pacemaker] need some help regarding network failure setup in pacemaker.

2011-04-26 Thread Andrew Beekhof
On Wed, Apr 20, 2011 at 1:32 PM, Rakesh K rakirocker4...@gmail.com wrote:
 Jelle de Jong jelledejong@... writes:
 Hi Jelle de Jong


 On 20-04-11 11:44, rakesh k wrote:
  How can we detect network failure in pacemaker configuration.

 http://www.clusterlabs.org/wiki/Pingd_with_resources_on_different_networks
 http://www.woodwose.net/thatremindsme/2011/04/the-pacemaker-ping-resource-agent/
 http://wiki.lustre.org/index.php/Using_Pacemaker_with_Lustre

 crm configure help location
 crm ra info ocf:ping

 That should give you a jup start.

 You may need to increase the corosync token.

 Kind regards,

 Jelle de Jong

 ___
 Pacemaker mailing list: Pacemaker@...
 http://oss.clusterlabs.org/mailman/listinfo/pacemaker

 Project Home: http://www.clusterlabs.org
 Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
 Bugs: 
 http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker


 Thanks for the help

 my question is

 I had gone through the scripts, where i found in
 ping_update method there is a variable called ACTIVE no.of nodes(host_list)
 active based on this value, for our scenario, can we stop the
 heartbeat/pacemaker process, when the host node cannot ping any other nodes in
 the cluster frame work.

no.  host_list should never contain the addresses of cluster nodes.
the ping RA is intended to test _external_ connectivity.

 provide me your suggestion so that it will help us in
 our context.

 Regards
 Rakesh




 ___
 Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
 http://oss.clusterlabs.org/mailman/listinfo/pacemaker

 Project Home: http://www.clusterlabs.org
 Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
 Bugs: 
 http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker


___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker


[Pacemaker] [pacemaker] need some help regarding network failure setup in pacemaker.

2011-04-20 Thread rakesh k
Hello Everybody


 How can we detect network failure in pacemaker configuration. where my to
nodes in cluster frame work  are as follows

two network routers connected via switch as mediator for communication.

how can we detect network failure and stop the heartbeat processes, when i
shutdown the interface. i am seeing a split brain issue.where heart beat is
started on two nodes and each acting as two different heartbeat process.

I had configured pingd resource which comes with pacemaker, as clone
processes when there is a network failure, i see there is a split brain
issue where the heart beat processes are starting separately on both nodes,
my question is here is, is there any way to stop the heart beat process
while pingd on a particular node gives that there is communication between
the interface and node where HA is running.

Regards
rakesh
___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker


Re: [Pacemaker] [pacemaker] need some help regarding network failure setup in pacemaker.

2011-04-20 Thread Jelle de Jong
On 20-04-11 11:44, rakesh k wrote:
 How can we detect network failure in pacemaker configuration.

http://www.clusterlabs.org/wiki/Pingd_with_resources_on_different_networks
http://www.woodwose.net/thatremindsme/2011/04/the-pacemaker-ping-resource-agent/
http://wiki.lustre.org/index.php/Using_Pacemaker_with_Lustre

crm configure help location
crm ra info ocf:ping

That should give you a jup start.

You may need to increase the corosync token.

Kind regards,

Jelle de Jong

___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker


Re: [Pacemaker] [pacemaker] need some help regarding network failure setup in pacemaker.

2011-04-20 Thread Rakesh K
Jelle de Jong jelledejong@... writes:
Hi Jelle de Jong

 
 On 20-04-11 11:44, rakesh k wrote:
  How can we detect network failure in pacemaker configuration.
 
 http://www.clusterlabs.org/wiki/Pingd_with_resources_on_different_networks
 http://www.woodwose.net/thatremindsme/2011/04/the-pacemaker-ping-resource-agent/
 http://wiki.lustre.org/index.php/Using_Pacemaker_with_Lustre
 
 crm configure help location
 crm ra info ocf:ping
 
 That should give you a jup start.
 
 You may need to increase the corosync token.
 
 Kind regards,
 
 Jelle de Jong
 
 ___
 Pacemaker mailing list: Pacemaker@...
 http://oss.clusterlabs.org/mailman/listinfo/pacemaker
 
 Project Home: http://www.clusterlabs.org
 Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
 Bugs: 
 http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
 
 
Thanks for the help

my question is 

I had gone through the scripts, where i found in 
ping_update method there is a variable called ACTIVE no.of nodes(host_list)
active based on this value, for our scenario, can we stop the
heartbeat/pacemaker process, when the host node cannot ping any other nodes in
the cluster frame work. provide me your suggestion so that it will help us in
our context.

Regards
Rakesh




___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker