Hi,
We changed totem.token from 3s to 60s. Then something strange were observed,
such as unexpected node offline.
I read corosync.conf manpage, but still don't understand the reason.
Can anyone explain this? or maybe our conf is broken?
Our corosync.conf:
compatibility: whitetank
quorum {
Hi,
Occasionally, I find my cluster with one pending action not being executed for
some minutes (I guess until the "PEngine Recheck Timer" elapse).
Running "crm_simulate -SL" shows the pending actions.
I'm still confused about how it can happens, why it happens and how to avoid
this.
Earlier
On 03/06/2017 08:29 PM, cys wrote:
> At 2017-03-07 05:47:19, "Ken Gaillot" wrote:
>> To figure out why a resource was stopped, you want to check the logs on
>> the DC (which will be the node with the most "pengine:" messages around
>> that time). When the PE decides a
On 07/03/17 05:09 AM, Jan Pokorný wrote:
> On 06/03/17 17:12 -0500, Digimer wrote:
>> The old FenceAgentAPI document on fedorahosted is gone now that fedora
>> hosted is closed. So I created a copy on the clusterlabs wiki:
>>
>> http://wiki.clusterlabs.org/wiki/FenceAgentAPI
>
> Note that just
Ulrich,
Thank you very much for your feedback.
You wrote, "Could it be you forgot "allow-migrate=true" at the resource
level or some migration IP address at the node level?
I only have SLES11 here..."
I know for sure that the pacemaker remote node (zs95kjg110102) I mentioned
below is
On 06/03/17 17:12 -0500, Digimer wrote:
> The old FenceAgentAPI document on fedorahosted is gone now that fedora
> hosted is closed. So I created a copy on the clusterlabs wiki:
>
> http://wiki.clusterlabs.org/wiki/FenceAgentAPI
Note that just few days ago I've announced that the page has