Hi,

On Tue, Apr 08, 2008 at 08:18:14PM +0200, Bernd Schubert wrote:
> On Tuesday 08 April 2008 19:32:58 Bernd Schubert wrote:
> > Hello,
> >
> > I need to set a rather huge dead time of 1200s, but the initial dead time
> > is supposed to be of 120s or less. However, heartbeat tries to be
> > schoolmasterly and doesn't want to accept my settings:
> >
> > deadtime 1200 # time to declare a node dead
> > initdead 120  # time to declare a node dead on heartbeat startup
> > keepalive 120 # how often to send keepalive packets
> >
> >
> > heartbeat[6523]: 2008/04/08_19:23:16 ERROR: Initial dead time [120000] is
> > smaller than deadtime [1200000]
> > eartbeat[6523]: 2008/04/08_19:23:16 ERROR: Configuration error, heartbeat
> > not started.
> >
> >
> > Well, heartbeat is not startup up automatically here and even the nodes are
> > not powered on automatically after a hard reset. So when I start heartbeat
> > I'm activeley monitoring everything and there is absolutely no need to let
> > me wait at least 20min on start up. I'm even not convinced a deadtime of
> > 20min is sufficient, since this is for a Lustre cluster and Lustre
> > sometimes manages to create such a high load that nothing else than the
> > Lustre and related kernel threads do work on the system...
> >
> > So pretty please, is there a setting allowing to override this ridiculous
> > initdead  time checking?

This looks like a valid use case. The patch to issue just a
warning instead of stopping with a configuration error you can
find here:

http://hg.linux-ha.org/dev/rev/01ef2fcb540e

Note that you'd have to adjust all timeouts to something larger
than the deadtime. It's reasonable to assume that everything else
is going to be delayed even more than heartbeats.

Thanks,

Dejan

> Doesn't look like the error can be overriden
> 
>         /* Check deadtime parameters */
>         if (config->initial_deadtime_ms < config->deadtime_ms) {
>                 ha_log(LOG_ERR
>                 ,       "Initial dead time [%ld] is smaller than"
>                 " deadtime [%ld]"
>                 ,       config->initial_deadtime_ms, config->deadtime_ms);
>                 ++errcount;
>         }else if (config->initial_deadtime_ms < 10000) {
> 
> 
> *grumble*
> 
> 
> 
> -- 
> Bernd Schubert
> Q-Leap Networks GmbH
> _______________________________________________
> Linux-HA mailing list
> Linux-HA@lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
_______________________________________________
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to