Re: [ClusterLabs] pcsd processes using 100% CPU

2018-05-22 Thread Shobe, Casey
> Not really enough info to debug... And I don't think I encountered this > myself. Is there something more I can do to gather more information when this happens? I'm not very familiar with strace, just ran it on the PID and saw the screen fill up with sched_yield() lines... This happens

[ClusterLabs] Antw: Re: pcsd processes using 100% CPU

2018-05-22 Thread Ulrich Windl
>>> Jan Pokorný schrieb am 22.05.2018 um 19:09 in Nachricht <20180522170924.gc2...@redhat.com>: > On 18/05/18 20:04 +, Shobe, Casey wrote: >> On a couple clusters that have been running for a little while >> (without fencing), I'm seeing runaway server.rb processes using

Re: [ClusterLabs] How to set up fencing/stonith

2018-05-22 Thread Casey & Gina
> It does exactly what you told it to do. If you want to power-on VM on > reset instead, remove RESETPOWERON parameter. Sorry, that was a part of the command that I found in /usr/share/doc/cluster-glue/stonith/README.vcenter, as well as on

Re: [ClusterLabs] How to set up fencing/stonith

2018-05-22 Thread Casey & Gina
In the meantime, I thought I'd try running the fence_vmware_soap command, but it doesn't seem to be working, despite me using the same credentials that worked with the external/vcenter plugin. Is there a way to get more debugging information about why it says unable to connect/login? The

Re: [ClusterLabs] How to set up fencing/stonith

2018-05-22 Thread Casey & Gina
> On May 18, 2018, at 1:29 PM, Ken Gaillot wrote: >> Perhaps there is a bug in the packaging? > > It sounds like it, or perhaps a portability issue in the agent itself. There were missing dependencies. I've resolved that, so now am coming back to trying this...

Re: [ClusterLabs] How to set up fencing/stonith

2018-05-22 Thread Casey & Gina
> There are missing dependencies in Ubuntu 16.04, see > https://github.com/ClusterLabs/pcs/issues/168 > for details. Thank you! > It may be worth filing a bug against Ubuntu. I did that already when I sent this E-mail, to which they suggested the same fix. I have shared the above link in that

Re: [ClusterLabs] pcsd processes using 100% CPU

2018-05-22 Thread Casey & Gina
> Can you share some HW specs with us, at least the architecture > to start with -- x86_64=amd64, arm (gen/mode?), something else? It's x86_64, running Ubuntu 16.04; the latest package versions available from Ubuntu repositories. They are vmWare ESX nodes with 16 CPU cores and 64GB of memory

Re: [ClusterLabs] pcsd processes using 100% CPU

2018-05-22 Thread Jan Pokorný
On 18/05/18 20:04 +, Shobe, Casey wrote: > On a couple clusters that have been running for a little while > (without fencing), I'm seeing runaway server.rb processes using 100% > of a single CPU core each. > > When I look at ps, I can see that these have something to do with > pcsd: > > USER

Re: [ClusterLabs] ethmonitor RA agent error. How can I fix this? (RHEL)

2018-05-22 Thread Ken Gaillot
On Tue, 2018-05-22 at 09:15 +0800, Confidential Company wrote: > I have two Virtual machines with two network interfaces.  > > See configuration below: > > *eth0 - service network > *eth1 - heartbeat network > > *vi /etc/hosts - RhelA(ip of eth1) / RhelB(ip of eth1)  > *service firewalld stop >