On Tue, 2018-05-22 at 09:15 +0800, Confidential Company wrote:
> I have two Virtual machines with two network interfaces.
>
> See configuration below:
>
> *eth0 - service network
> *eth1 - heartbeat network
>
> *vi /etc/hosts - RhelA(ip of eth1) / RhelB(ip of eth1)
> *service firewalld stop
>
On 18/05/18 20:04 +, Shobe, Casey wrote:
> On a couple clusters that have been running for a little while
> (without fencing), I'm seeing runaway server.rb processes using 100%
> of a single CPU core each.
>
> When I look at ps, I can see that these have something to do with
> pcsd:
>
> USER
> Can you share some HW specs with us, at least the architecture
> to start with -- x86_64=amd64, arm (gen/mode?), something else?
It's x86_64, running Ubuntu 16.04; the latest package versions available from
Ubuntu repositories. They are vmWare ESX nodes with 16 CPU cores and 64GB of
memory co
> There are missing dependencies in Ubuntu 16.04, see
> https://github.com/ClusterLabs/pcs/issues/168
> for details.
Thank you!
> It may be worth filing a bug against Ubuntu.
I did that already when I sent this E-mail, to which they suggested the same
fix. I have shared the above link in that
> On May 18, 2018, at 1:29 PM, Ken Gaillot wrote:
>> Perhaps there is a bug in the packaging?
>
> It sounds like it, or perhaps a portability issue in the agent itself.
There were missing dependencies. I've resolved that, so now am coming back to
trying this...
pcmk_host_list="" - not su
In the meantime, I thought I'd try running the fence_vmware_soap command, but
it doesn't seem to be working, despite me using the same credentials that
worked with the external/vcenter plugin. Is there a way to get more debugging
information about why it says unable to connect/login? The error
> It does exactly what you told it to do. If you want to power-on VM on
> reset instead, remove RESETPOWERON parameter.
Sorry, that was a part of the command that I found in
/usr/share/doc/cluster-glue/stonith/README.vcenter, as well as on
https://www.hastexo.com/resources/hints-and-kinks/fencin
>>> Jan Pokorný schrieb am 22.05.2018 um 19:09 in
Nachricht
<20180522170924.gc2...@redhat.com>:
> On 18/05/18 20:04 +, Shobe, Casey wrote:
>> On a couple clusters that have been running for a little while
>> (without fencing), I'm seeing runaway server.rb processes using 100%
>> of a single CP
> Not really enough info to debug... And I don't think I encountered this
> myself.
Is there something more I can do to gather more information when this happens?
I'm not very familiar with strace, just ran it on the PID and saw the screen
fill up with sched_yield() lines... This happens acro