Re: [ClusterLabs] pcsd processes using 100% CPU

2018-05-22 Thread Shobe, Casey
> Not really enough info to debug... And I don't think I encountered this > myself. Is there something more I can do to gather more information when this happens? I'm not very familiar with strace, just ran it on the PID and saw the screen fill up with sched_yield() lines... This happens acro

[ClusterLabs] pcsd processes using 100% CPU

2018-05-18 Thread Shobe, Casey
On a couple clusters that have been running for a little while (without fencing), I'm seeing runaway server.rb processes using 100% of a single CPU core each. When I look at ps, I can see that these have something to do with pcsd: USER PID %CPU %MEMVSZ RSS TTY STAT START TIME

Re: [ClusterLabs] Frequent PAF log messages - Forbidding promotion on in state "startup"

2018-05-15 Thread Shobe, Casey
, copy in the recovery.conf, and `pcs cluster start` for each standby node needing rebuilt? > On May 13, 2018, at 5:58 AM, Jehan-Guillaume de Rorthais > wrote: > > This message originated outside of DISH and was sent by: j...@dalibo.com > > On Fri, 11 May 2018 16:25:18 +00

[ClusterLabs] Frequent PAF log messages - Forbidding promotion on in state "startup"

2018-05-11 Thread Shobe, Casey
I'm using PAF and my corosync log ends up filled with messages like this (about 3 times per minute for each standby node): pgsqlms(postgresql-10-main)[26822]: 2018/05/11_06:47:08 INFO: Forbidding promotion on "d-gp2-dbp63-1" in state "startup" pgsqlms(postgresql-10-main)[26822]: 2018/05

[ClusterLabs] PAF log messages - Forbidding promotion on in state "startup"

2018-05-11 Thread Shobe, Casey
I'm using PAF and my corosync log ends up filled with messages like this (about 3 times per minute for each standby node): pgsqlms(postgresql-10-main)[26822]: 2018/05/11_06:47:08 INFO: Forbidding promotion on "d-gp2-dbp63-1" in state "startup" pgsqlms(postgresql-10-main)[26822]: 2018/05