Hello, I am new to HA2 and am having some configuration issues.   I installed 
HA2  (2.0.8-1) on two Suse 10 (SLES10) machines using Alan's Education Project 
Screencast (http://www.linux-ha.org/Education/Newbie/InstallHeartbeatScreencast)

I think I have a node configuration issue even though it is in ha.cf.   I am 
very familiar with Novell Cluster Services.   The problem I outline below makes 
me think that both of the nodes are trying to be the "Master" but I don't how 
to fix this.  I've spent a week on this and am feeling very stupid!   Here 
goes.....

My ha.cf file for the 2 servers shows

use_logd yes
bcast eth1
node it-mgatedom it-mgatedomc
crm on


The logd.cf shows

logfacility     daemon


The authkeys show

auth 1
1 sha1 cluster1


Now, when I start it up on IT-MGATEDOM,  it shows "done"

crm_mon shows only 1 node configured and after a couple minutes the "Current 
DC: NONE" becomes "Current DC: it-mgatedom" with 0 resources configured.  It 
still shows 1 node, not 2.  

Then I go to IT-MGATEDOMC to start it up......   It says "done" and when I do a 
tail /var/log/message I see this



it-mgatedomc:~ # /etc/init.d/heartbeat start
Starting High-Availability services:
                                                                     done

it-mgatedomc:~ # tail /var/log/messages
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: 
G_main_add_TriggerHandler:  Added signal manual handler
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: 
G_main_add_TriggerHandler:  Added signal manual handler
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Removing 
/var/run/heartbea t/rsctmp failed, recreating.
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast 
heartb eat started on port 694 (694) interface eth1
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast 
heartb eat closed on port 694 interface eth1 - Status: 1
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: 
Added signal handler for signal 17
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Local status now set to: 
' up'
May  8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedom:eth1 up.
May  8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Status update for node 
it- mgatedom: status active
May  8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedomc:eth1 up.
it-mgatedomc:~ # tail /var/log/messages
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: 
G_main_add_TriggerHandler:  Added signal manual handler
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: 
G_main_add_TriggerHandler:  Added signal manual handler
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Removing 
/var/run/heartbea t/rsctmp failed, recreating.
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast 
heartb eat started on port 694 (694) interface eth1
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast 
heartb eat closed on port 694 interface eth1 - Status: 1
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: 
Added signal handler for signal 17
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Local status now set to: 
' up'
May  8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedom:eth1 up.
May  8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Status update for node 
it- mgatedom: status active
May  8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedomc:eth1 up.
it-mgatedomc:~ # tail /var/log/messages
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: 
G_main_add_TriggerHandler:  Added signal manual handler
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: 
G_main_add_TriggerHandler:  Added signal manual handler
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Removing 
/var/run/heartbea t/rsctmp failed, recreating.
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast 
heartb eat started on port 694 (694) interface eth1
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast 
heartb eat closed on port 694 interface eth1 - Status: 1
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: 
Added signal handler for signal 17
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Local status now set to: 
' up'
May  8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedom:eth1 up.
May  8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Status update for node 
it- mgatedom: status active
May  8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedomc:eth1 up.
it-mgatedomc:~ # tail /var/log/messages
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: 
G_main_add_TriggerHandler:  Added signal manual handler
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: 
G_main_add_TriggerHandler:  Added signal manual handler
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Removing 
/var/run/heartbea t/rsctmp failed, recreating.
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast 
heartb eat started on port 694 (694) interface eth1
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast 
heartb eat closed on port 694 interface eth1 - Status: 1
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: 
Added signal handler for signal 17
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Local status now set to: 
' up'
May  8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedom:eth1 up.
May  8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Status update for node 
it- mgatedom: status active
May  8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedomc:eth1 up.
it-mgatedomc:~ # tail /var/log/messages
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: 
G_main_add_TriggerHandler:  Added signal manual handler
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: 
G_main_add_TriggerHandler:  Added signal manual handler
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Removing 
/var/run/heartbea t/rsctmp failed, recreating.
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast 
heartb eat started on port 694 (694) interface eth1
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast 
heartb eat closed on port 694 interface eth1 - Status: 1
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: 
Added signal handler for signal 17
May  8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Local status now set to: 
' up'
May  8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedom:eth1 up.
May  8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Status update for node 
it- mgatedom: status active
May  8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedomc:eth1 up.
it-mgatedomc:~ # tail /var/log/messages
May  8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: hist->ackseq =0
May  8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: hist->lowseq =0, 
hist->hi seq=103
May  8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: expecting from 
it-mgatedo m
May  8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: it's ackseq=0
May  8 12:07:06 it-mgatedomc heartbeat: [4514]: debug:
May  8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: hist->ackseq =0
May  8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: hist->lowseq =0, 
hist->hi seq=104
May  8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: expecting from 
it-mgatedo m
May  8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: it's ackseq=0
May  8 12:07:06 it-mgatedomc heartbeat: [4514]: debug:



The line that says "expecting from it-mgatedom" confuses me.

crm_mon shows "Not Connected".

netstat -n -l | grep 694 shows that udp 694 is there.

The strange thing is if I stop both of them and start it on IT-MGATEDOMC first, 
then it will come up just fine and then when I start it on IT-MGATEDOM, it has 
the above issue.

Any ideas?

Thank you, 
Eric...

_______________________________________________
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to