On 4/29/2011 at 03:36 AM, "Stallmann, Andreas" <astallm...@conet.de> wrote: > Hi! > > In one of my clusters I disconnect one of the nodes (say app01) from the > network. App02 takes of the resources as it should. Nice. > When I reconnect app01 to the network, crm_mon on app01 continues to report > app02 as "offline" and crm_mon on app02 does the same for app01. Still, no > errors are reported for TOTEM in the logs, and corosync-cfgtool -s reports > both > rings as "active with no faults". > > When sniffing for multicast-packets, I see packets originating from app01 but > > not from app02.
Just on a punt... There's not a (partial) firewall running on app02 is there? Regards, Tim > Pinging the nodes (using ips or names) works for all interfaces. > > I'm at a loss. Any ideas? How can I debug what's happening between the two > nodes? And how can I bring an "offline" node online again without rebooting > or restarting corosync? > > Thanks in advance, > > Andreas - breaking any record in this mailing list in asking questions... > PS: corosync.conf below: > > compatibility: whitetank > aisexec { > user: root > group: root > } > service { > ver: 0 > name: pacemaker > use_mgmtd: yes > use_logd: yes > } > totem { > version: 2 > token: 5000 > token_retransmits_before_loss_const: 10 > join: 60 > consensus: 6000 > vsftype: none > max_messages: 20 > clear_node_high_bit: yes > secauth: off > threads: 0 > interface { > ringnumber: 0 > bindnetaddr: 10.10.10.0 > mcastaddr: 239.192.200.51 > mcastport: 5405 > } > interface { > ringnumber: 1 > bindnetaddr: 192.168.1.0 > mcastaddr: 239.192.200.52 > mcastport: 5405 > } > rrp_mode: active > } > logging { > fileline: off > to_stderr: no > to_logfile: no > to_syslog: yes > syslog_facility: daemon > debug: off > timestamp: off > } > amf { > mode: disabled > } > > ------------------------ > CONET Solutions GmbH, Theodor-Heuss-Allee 19, 53773 Hennef. > Registergericht/Registration Court: Amtsgericht Siegburg (HRB Nr. 9136) > Gesch?ftsf?hrer/Managing Directors: J?rgen Zender (Sprecher/Chairman), Anke > H?fer > _______________________________________________ > Linux-HA mailing list > Linux-HA@lists.linux-ha.org > http://lists.linux-ha.org/mailman/listinfo/linux-ha > See also: http://linux-ha.org/ReportingProblems > -- Tim Serong <tser...@novell.com> Senior Clustering Engineer, OPS Engineering, Novell Inc. _______________________________________________ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems