[ClusterLabs] corosync totem.token too long may cause pacemaker(cluster) unstable?

2017-03-07 Thread cys
Hi, We changed totem.token from 3s to 60s. Then something strange were observed, such as unexpected node offline. I read corosync.conf manpage, but still don't understand the reason. Can anyone explain this? or maybe our conf is broken? Our corosync.conf: compatibility: whitetank quorum {

[ClusterLabs] pending actions

2017-03-07 Thread Jehan-Guillaume de Rorthais
Hi, Occasionally, I find my cluster with one pending action not being executed for some minutes (I guess until the "PEngine Recheck Timer" elapse). Running "crm_simulate -SL" shows the pending actions. I'm still confused about how it can happens, why it happens and how to avoid this. Earlier

Re: [ClusterLabs] resource was disabled automatically

2017-03-07 Thread Ken Gaillot
On 03/06/2017 08:29 PM, cys wrote: > At 2017-03-07 05:47:19, "Ken Gaillot" wrote: >> To figure out why a resource was stopped, you want to check the logs on >> the DC (which will be the node with the most "pengine:" messages around >> that time). When the PE decides a

Re: [ClusterLabs] FenceAgentAPI

2017-03-07 Thread Digimer
On 07/03/17 05:09 AM, Jan Pokorný wrote: > On 06/03/17 17:12 -0500, Digimer wrote: >> The old FenceAgentAPI document on fedorahosted is gone now that fedora >> hosted is closed. So I created a copy on the clusterlabs wiki: >> >> http://wiki.clusterlabs.org/wiki/FenceAgentAPI > > Note that just

Re: [ClusterLabs] Antw: Expected recovery behavior of remote-node guest when corosync ring0 is lost in a passive mode RRP config?

2017-03-07 Thread Scott Greenlese
Ulrich, Thank you very much for your feedback. You wrote, "Could it be you forgot "allow-migrate=true" at the resource level or some migration IP address at the node level? I only have SLES11 here..." I know for sure that the pacemaker remote node (zs95kjg110102) I mentioned below is

Re: [ClusterLabs] FenceAgentAPI

2017-03-07 Thread Jan Pokorný
On 06/03/17 17:12 -0500, Digimer wrote: > The old FenceAgentAPI document on fedorahosted is gone now that fedora > hosted is closed. So I created a copy on the clusterlabs wiki: > > http://wiki.clusterlabs.org/wiki/FenceAgentAPI Note that just few days ago I've announced that the page has