________________________________

From: [email protected] on behalf of Igor Chudov
Sent: Thu 8/5/2010 9:47 PM
To: General Linux-HA mailing list
Subject: Re: [Linux-HA] Heartbeat does not take over if BOTH machines 
arebootedat the same time



On Thu, Aug 5, 2010 at 6:32 PM, Pushkar Pradhan <[email protected]> wrote:
> I set up two Ubuntu Lucid machines to serve as a two-node Heartbeat
> cluster without Corosync.
>
> They support a DRBD service, IP address, NFS and Samba services.
>
> Things mostly work, and if I reboot one server, the other takes over.
>
> What does NOT work is that if I reboot both, then *neither* takes
> over. When they are in this state -- both running and none active --
> if I reboot one of them, then the other begins to work.
>
> This is becoming a real embarrassment for me at work and I would love
> to get some help.
>
> haresources:
> pfs-srv3 drbddisk::r0 Filesystem::/dev/drbd0::/pfs::ext3 10.1.8.45/24
> nfs-kernel-server smbd
> pfs-srv4
>
> ha.cf:
> use_logd on
> udpport 12694
> keepalive 1
> warntime 15
> deadtime 20
> debug 1
> initdead 60
> bcast eth1
> node pfs-srv3
> node pfs-srv4
> auto_failback on
> crm off
>
>
> Can you experiment with a really large initdead time like 2 or 5 minutes? 
> Also see if it helps to do unicast messaging?

Larger initdead does not help. I will try unicast tomorrow but I doubt
it will help.

Pushkar, could someone or someone else suggest some tools to trouble
shoot this issue?

Right now I am poking in the dark.


Igor,

Sorry to hear that. Any luck with unicast messaging? I am interested in helping 
you, if you want we can take this discussion offline, i.e. off the HA mailing 
list.

pushkar

 

<<winmail.dat>>

_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to