its almost always a firewall. try stopping the firewall completely and see if the problem persists.
On 5/8/07, Eric Marcus <[EMAIL PROTECTED]> wrote:
Hello, I am new to HA2 and am having some configuration issues. I installed HA2 (2.0.8-1) on two Suse 10 (SLES10) machines using Alan's Education Project Screencast (http://www.linux-ha.org/Education/Newbie/InstallHeartbeatScreencast) I think I have a node configuration issue even though it is in ha.cf. I am very familiar with Novell Cluster Services. The problem I outline below makes me think that both of the nodes are trying to be the "Master" but I don't how to fix this. I've spent a week on this and am feeling very stupid! Here goes..... My ha.cf file for the 2 servers shows use_logd yes bcast eth1 node it-mgatedom it-mgatedomc crm on The logd.cf shows logfacility daemon The authkeys show auth 1 1 sha1 cluster1 Now, when I start it up on IT-MGATEDOM, it shows "done" crm_mon shows only 1 node configured and after a couple minutes the "Current DC: NONE" becomes "Current DC: it-mgatedom" with 0 resources configured. It still shows 1 node, not 2. Then I go to IT-MGATEDOMC to start it up...... It says "done" and when I do a tail /var/log/message I see this it-mgatedomc:~ # /etc/init.d/heartbeat start Starting High-Availability services: done it-mgatedomc:~ # tail /var/log/messages May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal manual handler May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal manual handler May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Removing /var/run/heartbea t/rsctmp failed, recreating. May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast heartb eat started on port 694 (694) interface eth1 May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast heartb eat closed on port 694 interface eth1 - Status: 1 May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: Added signal handler for signal 17 May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Local status now set to: ' up' May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedom:eth1 up. May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Status update for node it- mgatedom: status active May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedomc:eth1 up. it-mgatedomc:~ # tail /var/log/messages May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal manual handler May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal manual handler May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Removing /var/run/heartbea t/rsctmp failed, recreating. May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast heartb eat started on port 694 (694) interface eth1 May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast heartb eat closed on port 694 interface eth1 - Status: 1 May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: Added signal handler for signal 17 May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Local status now set to: ' up' May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedom:eth1 up. May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Status update for node it- mgatedom: status active May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedomc:eth1 up. it-mgatedomc:~ # tail /var/log/messages May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal manual handler May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal manual handler May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Removing /var/run/heartbea t/rsctmp failed, recreating. May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast heartb eat started on port 694 (694) interface eth1 May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast heartb eat closed on port 694 interface eth1 - Status: 1 May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: Added signal handler for signal 17 May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Local status now set to: ' up' May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedom:eth1 up. May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Status update for node it- mgatedom: status active May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedomc:eth1 up. it-mgatedomc:~ # tail /var/log/messages May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal manual handler May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal manual handler May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Removing /var/run/heartbea t/rsctmp failed, recreating. May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast heartb eat started on port 694 (694) interface eth1 May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast heartb eat closed on port 694 interface eth1 - Status: 1 May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: Added signal handler for signal 17 May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Local status now set to: ' up' May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedom:eth1 up. May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Status update for node it- mgatedom: status active May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedomc:eth1 up. it-mgatedomc:~ # tail /var/log/messages May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal manual handler May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal manual handler May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Removing /var/run/heartbea t/rsctmp failed, recreating. May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast heartb eat started on port 694 (694) interface eth1 May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast heartb eat closed on port 694 interface eth1 - Status: 1 May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: Added signal handler for signal 17 May 8 12:06:16 it-mgatedomc heartbeat: [4514]: info: Local status now set to: ' up' May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedom:eth1 up. May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Status update for node it- mgatedom: status active May 8 12:06:17 it-mgatedomc heartbeat: [4514]: info: Link it-mgatedomc:eth1 up. it-mgatedomc:~ # tail /var/log/messages May 8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: hist->ackseq =0 May 8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: hist->lowseq =0, hist->hi seq=103 May 8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: expecting from it-mgatedo m May 8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: it's ackseq=0 May 8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: May 8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: hist->ackseq =0 May 8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: hist->lowseq =0, hist->hi seq=104 May 8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: expecting from it-mgatedo m May 8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: it's ackseq=0 May 8 12:07:06 it-mgatedomc heartbeat: [4514]: debug: The line that says "expecting from it-mgatedom" confuses me. crm_mon shows "Not Connected". netstat -n -l | grep 694 shows that udp 694 is there. The strange thing is if I stop both of them and start it on IT-MGATEDOMC first, then it will come up just fine and then when I start it on IT-MGATEDOM, it has the above issue. Any ideas? Thank you, Eric... _______________________________________________ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
_______________________________________________ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems