Re: [Pacemaker] On recovery of failed node, pengine fails to correctly monitor 'dirty' resources

2014-08-12 Thread Jarred Griggles
This seems to happen in every resource agent we've been using -- including ClusterMon, anything, etc. These resource agents also aren't being called when these errors appear -- though the logs in /var/log/messages would indicate as if they were called. The calls may differ that are being display

[Pacemaker] On recovery of failed node, pengine fails to correctly monitor 'dirty' resources

2014-08-11 Thread Jarred Griggles
Greetings, We are using pacemaker and cman in a two-node cluster with no-quorum-policy: ignore and stonith-enabled: false on a Centos 6 system (pacemaker related RPM versions are listed below).  We are seeing some bizarre (to us) behavior when a node is fully lost (e.g. reboot -nf ).  Here's t