Re: [Pacemaker] [pacemaker] need some help regarding network failure setup in pacemaker.
On Wed, Apr 20, 2011 at 1:32 PM, Rakesh K rakirocker4...@gmail.com wrote: Jelle de Jong jelledejong@... writes: Hi Jelle de Jong On 20-04-11 11:44, rakesh k wrote: How can we detect network failure in pacemaker configuration. http://www.clusterlabs.org/wiki/Pingd_with_resources_on_different_networks http://www.woodwose.net/thatremindsme/2011/04/the-pacemaker-ping-resource-agent/ http://wiki.lustre.org/index.php/Using_Pacemaker_with_Lustre crm configure help location crm ra info ocf:ping That should give you a jup start. You may need to increase the corosync token. Kind regards, Jelle de Jong ___ Pacemaker mailing list: Pacemaker@... http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker Thanks for the help my question is I had gone through the scripts, where i found in ping_update method there is a variable called ACTIVE no.of nodes(host_list) active based on this value, for our scenario, can we stop the heartbeat/pacemaker process, when the host node cannot ping any other nodes in the cluster frame work. no. host_list should never contain the addresses of cluster nodes. the ping RA is intended to test _external_ connectivity. provide me your suggestion so that it will help us in our context. Regards Rakesh ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
[Pacemaker] [pacemaker] need some help regarding network failure setup in pacemaker.
Hello Everybody How can we detect network failure in pacemaker configuration. where my to nodes in cluster frame work are as follows two network routers connected via switch as mediator for communication. how can we detect network failure and stop the heartbeat processes, when i shutdown the interface. i am seeing a split brain issue.where heart beat is started on two nodes and each acting as two different heartbeat process. I had configured pingd resource which comes with pacemaker, as clone processes when there is a network failure, i see there is a split brain issue where the heart beat processes are starting separately on both nodes, my question is here is, is there any way to stop the heart beat process while pingd on a particular node gives that there is communication between the interface and node where HA is running. Regards rakesh ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
Re: [Pacemaker] [pacemaker] need some help regarding network failure setup in pacemaker.
On 20-04-11 11:44, rakesh k wrote: How can we detect network failure in pacemaker configuration. http://www.clusterlabs.org/wiki/Pingd_with_resources_on_different_networks http://www.woodwose.net/thatremindsme/2011/04/the-pacemaker-ping-resource-agent/ http://wiki.lustre.org/index.php/Using_Pacemaker_with_Lustre crm configure help location crm ra info ocf:ping That should give you a jup start. You may need to increase the corosync token. Kind regards, Jelle de Jong ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
Re: [Pacemaker] [pacemaker] need some help regarding network failure setup in pacemaker.
Jelle de Jong jelledejong@... writes: Hi Jelle de Jong On 20-04-11 11:44, rakesh k wrote: How can we detect network failure in pacemaker configuration. http://www.clusterlabs.org/wiki/Pingd_with_resources_on_different_networks http://www.woodwose.net/thatremindsme/2011/04/the-pacemaker-ping-resource-agent/ http://wiki.lustre.org/index.php/Using_Pacemaker_with_Lustre crm configure help location crm ra info ocf:ping That should give you a jup start. You may need to increase the corosync token. Kind regards, Jelle de Jong ___ Pacemaker mailing list: Pacemaker@... http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker Thanks for the help my question is I had gone through the scripts, where i found in ping_update method there is a variable called ACTIVE no.of nodes(host_list) active based on this value, for our scenario, can we stop the heartbeat/pacemaker process, when the host node cannot ping any other nodes in the cluster frame work. provide me your suggestion so that it will help us in our context. Regards Rakesh ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker