Ken, very much appreciate your help on this - I am wondering what you might
have gleaned from the logs.
Thanks!
On Mon, Feb 8, 2021 at 2:43 PM Stuart Massey wrote:
> Wonderful, thank you for looking at this!
> I have posted uncompressed "saving inputs" files at the links below - 3241
> is the im
Wonderful, thank you for looking at this!
I have posted uncompressed "saving inputs" files at the links below - 3241
is the immediately preceding one that exists, and 3242 is the one created
upon encountering the problem state. In both cases, it looks to me like
node02 is DC. There are none of thes
On Mon, 2021-02-08 at 12:01 -0500, Stuart Massey wrote:
> I'm wondering if anyone can advise us on next steps here and/or
> correct our understanding. This seems like a race condition that
> causes resources to be stopped unnecessarily. Is there a way to
> prevent a node from processing cib updates
I'm wondering if anyone can advise us on next steps here and/or correct our
understanding. This seems like a race condition that causes resources to be
stopped unnecessarily. Is there a way to prevent a node from processing cib
updates from a peer while DC negotiations are underway? Our "node02" is
Sequence seems to be:
- node02 is DC and master/primary, node01 is maintenance mode and
slave/secondary
- comms go down
- node01 elects itself master, and deletes node01 status from its cib
- comms come up
- cluster starts reforming
- node01 sends cib updates to node02
- DC
On Mon, 2021-02-01 at 11:09 -0500, Stuart Massey wrote:
> Hi Ken,
> Thanks. In this case, transient_attributes for node02 in the cib on
> node02 which never lost quorum seem to be deleted by a request from
> node01 when node01 rejoins the cluster - IF I understand the
> pacemaker.log correctly. Thi
On Mon, 2021-02-01 at 11:16 -0500, Stuart Massey wrote:
> Andrei,
> You are right, thank you. I have an earlier thread on which I posted
> a pacemaker.log for this issue, and didn't think to point to it here.
> The link is
> http://project.ibss.net/samples/deidPacemakerLog.2021-01-25.txtxt .
> So,
Andrei,
You are right, thank you. I have an earlier thread on which I posted a
pacemaker.log for this issue, and didn't think to point to it here.
The link is http://project.ibss.net/samples/deidPacemakerLog.2021-01-25.txt
.
So, node01 is in maintenance mode, and constraints prevent any resources
Hi Ken,
Thanks. In this case, transient_attributes for node02 in the cib on node02
which never lost quorum seem to be deleted by a request from node01 when
node01 rejoins the cluster - IF I understand the pacemaker.log correctly.
This causes node02 to stop resources, which will not be restarted unt
On Mon, 2021-02-01 at 09:58 -0600, Ken Gaillot wrote:
> On Fri, 2021-01-29 at 12:37 -0500, Stuart Massey wrote:
> > Can someone help me with this?
> > Background:
> > > "node01" is failing, and has been placed in "maintenance" mode.
> > > It
> > > occasionally loses connectivity.
> > > "node02" is
On Fri, 2021-01-29 at 12:37 -0500, Stuart Massey wrote:
> Can someone help me with this?
> Background:
> > "node01" is failing, and has been placed in "maintenance" mode. It
> > occasionally loses connectivity.
> > "node02" is able to run our resources
>
> Consider the following messages from pace
29.01.2021 20:37, Stuart Massey пишет:
> Can someone help me with this?
> Background:
>
> "node01" is failing, and has been placed in "maintenance" mode. It
> occasionally loses connectivity.
>
> "node02" is able to run our resources
>
> Consider the following messages from pacemaker.log on "nod
Can someone help me with this?
Background:
"node01" is failing, and has been placed in "maintenance" mode. It
occasionally loses connectivity.
"node02" is able to run our resources
Consider the following messages from pacemaker.log on "node02", just after
"node01" has rejoined the cluster (per "
13 matches
Mail list logo