Hi, Note that FMS_PROMOTE_ACTIVE_TIMER and opensaf_reboot scripts are two platform adaptation attributes in OpenSAF w.r.t failover and fencing. An OpenSAF user can customize these in their deployments.
Upon receiving connection loss indication with the active controller, the STANDBY controller starts this promote active timer (see FMS_PROMOTE_ACTIVE_TIMER in /etc/opensaf/fmd.conf). This timer acts as a tolerance mechanism to handle or differentiate temporary link-flaps and false-positives in your network. Upon expiry of this timer, the STANDBY invokes opensaf_reboot script (with the intention to reboot the ACTIVE node) and subsequently promotes itself to ACTIVE. The opensaf_reboot script is an integration point for the OpenSAF user. So, during failover when this opensaf_reboot script is invoked the node information (node_id, PLM ee name) of the peer ACTIVE node is passed as input to this script. Inside this script, the user can modify so as to invoke 'commands' that will perform remote reboots of the old ACTIVE node. The 'commands' here could be an IPMI command or any STONITH agent/command. Cheers, Mathi. ----- [email protected] wrote: > When a system gets into split-brain scenario, both controllers assume > active role. How does a payload node distinguish which controller it > is associated to? Is there a way that we find out which payload nodes > connect to which controller? > > Our cluster needs to provide service 24x7. So restarting the cluster > is not possible when this situation occurs. What is the best way to > correct a split-brain situation? If we stop and restart one of the > controller nodes to allow it to rejoin the other controller, should we > also restart the payload nodes associated to that controller? Those > payload nodes should be stopped before stopping their associated > controller node, correct? > > Shu Wang > > > > > ________________________________ > The information transmitted herein is intended only for the person or > entity to which it is addressed and may contain confidential, > proprietary and/or privileged material. Any review, retransmission, > dissemination or other use of, or taking of any action in reliance > upon, this information by persons or entities other than the intended > recipient is prohibited. If you received this in error, please contact > the sender and delete the material from any computer. > ------------------------------------------------------------------------------ > Monitor Your Dynamic Infrastructure at Any Scale With Datadog! > Get real-time metrics from all of your servers, apps and tools > in one place. > SourceForge users - Click here to start your Free Trial of Datadog > now! > http://pubads.g.doubleclick.net/gampad/clk?id=241902991&iu=/4140 > _______________________________________________ > Opensaf-users mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/opensaf-users ------------------------------------------------------------------------------ Monitor Your Dynamic Infrastructure at Any Scale With Datadog! Get real-time metrics from all of your servers, apps and tools in one place. SourceForge users - Click here to start your Free Trial of Datadog now! http://pubads.g.doubleclick.net/gampad/clk?id=241902991&iu=/4140 _______________________________________________ Opensaf-users mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/opensaf-users
