Hello, We have several clusters with the Fedora FC10, drbd 8.2.7, heartbeat 2.1.3, that works perfectly, I am preparing an upgrade of the clusters.
I upgraded the test cluster to the Fedora FC12, drbd 8.3.7 (custom build, FC12 comes with drbd 8.3.6 without the kernel module...) and hearbeat 3.0.0 (from FC12 heartbeat-3.0.0-0.7.0daab7da36a8.hg.fc12.i686 ). When I move manually (hb_stanby) packages between the nodes I have no problem, but when I crash one node the other node do not take the packages owned by the crashed node, I have no idea what append. Here the log of the up node. .... Jun 02 09:06:28 noeud1.apec.fr heartbeat: [3295]: info: Link noeud2.apec.fr:bond0 dead. Jun 02 09:06:28 noeud1.apec.fr heartbeat: [3295]: info: Link noeud2.apec.fr:eth2 dead. Jun 02 09:07:19 noeud1.apec.fr heartbeat: [3295]: WARN: node noeud2.apec.fr: is dead Jun 02 09:07:19 noeud1.apec.fr heartbeat: [3295]: WARN: No STONITH device configured. Jun 02 09:07:19 noeud1.apec.fr heartbeat: [3295]: WARN: Shared disks are not protected. Jun 02 09:07:19 noeud1.apec.fr heartbeat: [3295]: info: Resources being acquired from noeud2.apec.fr. Jun 02 09:07:19 noeud1.apec.fr heartbeat: [15868]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL Jun 02 09:07:19 noeud1.apec.fr harc[15868]: [15879]: info: Running /etc/ha.d//rc.d/status status Jun 02 09:07:19 noeud1.apec.fr heartbeat: [3295]: debug: StartNextRemoteRscReq(): child count 1 Jun 02 09:07:19 noeud1.apec.fr heartbeat: [15869]: info: Local Resource acquisition completed. Jun 02 09:07:19 noeud1.apec.fr heartbeat: [3295]: debug: StartNextRemoteRscReq(): child count 1 Jun 02 09:07:19 noeud1.apec.fr heartbeat: [3295]: debug: StartNextRemoteRscReq(): child count 1 Jun 02 09:07:19 noeud1.apec.fr heartbeat: [15914]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL Jun 02 09:07:19 noeud1.apec.fr harc[15914]: [15920]: info: Running /etc/ha.d//rc.d/ip-request-resp ip-request-resp Jun 02 09:07:19 noeud1.apec.fr ip-request-resp[15914]: [15926]: received ip-request-resp IPaddr::192.168.10.250 OK yes Jun 02 09:07:19 noeud1.apec.fr heartbeat: [15928]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL Jun 02 09:07:19 noeud1.apec.fr harc[15928]: [15934]: info: Running /etc/ha.d//rc.d/ip-request-resp ip-request-resp Jun 02 09:07:19 noeud1.apec.fr ip-request-resp[15928]: [15940]: received ip-request-resp IPaddr::192.168.10.252 OK yes [Nothing after] Here the file ha.cf keepalive 2 deadtime 60 warntime 10 initdead 120 udpport 694 ucast bond0 10.1.1.2 ucast eth2 192.168.10.254 auto_failback off node noeud1.apec.fr node noeud2.apec.fr ping_group network10 192.168.10.1 deadping 10 hbgenmethod time realtime on apiauth ipfail uid=hacluster apiauth ccm uid=hacluster apiauth cms uid=hacluster apiauth ping gid=haclient uid=root apiauth default gid=haclient msgfmt classic use_logd yes conn_logd_time 60 compression bz2 The logd.cf debugfile /var/log/ha-debug logfile /var/log/ha-log logfacility local1 The haresources noeud1.apec.fr IPaddr::192.168.10.250 drbddisk::named Filesystem::/dev/drbd0::/named::ext3 noeud1.apec.fr IPaddr::192.168.10.252 drbddisk::dhcpd Filesystem::/dev/drbd1::/dhcpd::ext3 noeud2.apec.fr IPaddr::192.168.10.251 drbddisk::named2 Filesystem::/dev/drbd2::/named2::ext3 noeud2.apec.fr IPaddr::192.168.10.249 drbddisk::samba Filesystem::/dev/drbd3::/samba::ext3 smb Best regards. Francis _______________________________________________ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems