Re: [Pacemaker] Strange DRBD error in cluster operation

2011-09-06 Thread Michael Schwartzkopff
 On Thu, Sep 01, 2011 at 02:59:56PM +0200, Michael Schwartzkopff wrote:
  Hi,
  
  from time to time we see the DRBD M/S resource failing on one of our
  clusters.
  
  From the logs we see that the monitoring fail with rc=5 (not_installed)
  and the log entry:
  
  lrmd: [2454]: info: RA output: (resDRBD:1:monitor:stderr)
  /etc/drbd.conf:3: Failed to open include file
  'drbd.d/global_common.conf'.
  
  This happens about once per week and causes constant trouble.
  
  Any ideas what might be the reason for this behavior?
 
 You periodically re-create that file from some recipe,
 and it so happens that at the time of the monitor,
 it is not there?

Of course, this also was my first thought.

The file is managed by cfengine, but the guys in charge for cfengine swear that 
it does not interfere with the monitoring.

So I wanted to ask if there is another known reason for this behavior, besides 
the obvious.

Thanks.

-- 
Dr. Michael Schwartzkopff
Guardinistr. 63
81375 München

Tel: (0163) 172 50 98
Fax: (089) 620 304 13


signature.asc
Description: This is a digitally signed message part.
___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker


Re: [Pacemaker] Strange DRBD error in cluster operation

2011-09-05 Thread Lars Ellenberg
On Thu, Sep 01, 2011 at 02:59:56PM +0200, Michael Schwartzkopff wrote:
 Hi,
 
 from time to time we see the DRBD M/S resource failing on one of our clusters.
 
 From the logs we see that the monitoring fail with rc=5 (not_installed) and 
 the log entry:
 
 lrmd: [2454]: info: RA output: (resDRBD:1:monitor:stderr) /etc/drbd.conf:3: 
 Failed to open include file 'drbd.d/global_common.conf'.
 
 This happens about once per week and causes constant trouble.
 
 Any ideas what might be the reason for this behavior?

You periodically re-create that file from some recipe,
and it so happens that at the time of the monitor,
it is not there?


-- 
: Lars Ellenberg
: LINBIT | Your Way to High Availability
: DRBD/HA support and consulting http://www.linbit.com

___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker


Re: [Pacemaker] Strange DRBD error in cluster operation

2011-09-05 Thread Michael Schwartzkopff
 On Thu, Sep 01, 2011 at 02:59:56PM +0200, Michael Schwartzkopff wrote:
  Hi,
  
  from time to time we see the DRBD M/S resource failing on one of our
  clusters.
  
  From the logs we see that the monitoring fail with rc=5 (not_installed)
  and the log entry:
  
  lrmd: [2454]: info: RA output: (resDRBD:1:monitor:stderr)
  /etc/drbd.conf:3: Failed to open include file
  'drbd.d/global_common.conf'.
  
  This happens about once per week and causes constant trouble.
  
  Any ideas what might be the reason for this behavior?
 
 You periodically re-create that file from some recipe,
 and it so happens that at the time of the monitor,
 it is not there?

Of course, this also was my first thought.

The file is managed by cfengine, but the guys in charge for cfengine swear that 
it does not interfere with the monitoring.

So I wanted to ask if there is another known reason for this behavior, besides 
the obvious.

Thanks.

-- 
Dr. Michael Schwartzkopff
Guardinistr. 63
81375 München

Tel: (0163) 172 50 98


signature.asc
Description: This is a digitally signed message part.
___
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker