On 4/29/2011 at 03:36 AM, "Stallmann, Andreas" <astallm...@conet.de> wrote: 
> Hi! 
>  
> In one of my clusters I disconnect one of the nodes (say app01) from the  
> network. App02 takes of the resources as it should. Nice. 
> When I reconnect app01 to the network, crm_mon on app01 continues to report  
> app02 as "offline" and crm_mon on app02 does the same for app01. Still, no  
> errors are reported for TOTEM in the logs, and corosync-cfgtool -s reports 
> both  
> rings as "active with no faults". 
>  
> When sniffing for multicast-packets, I see packets originating from app01 but 
>  
> not from app02. 

Just on a punt...  There's not a (partial) firewall running on app02 is there?

Regards,

Tim

> Pinging the nodes (using ips or names) works for all interfaces. 
>  
> I'm at a loss. Any ideas? How can I debug what's happening between the two  
> nodes? And how can I bring an "offline" node online again without rebooting  
> or restarting corosync? 
>  
> Thanks in advance, 
>  
> Andreas - breaking any record in this mailing list in asking questions... 
> PS: corosync.conf below: 
>  
> compatibility: whitetank 
> aisexec { 
>         user:           root 
>         group:          root 
> } 
> service { 
>         ver:            0 
>         name:           pacemaker 
>         use_mgmtd:      yes 
>         use_logd:       yes 
> } 
> totem { 
>         version:        2 
>         token:          5000 
>         token_retransmits_before_loss_const: 10 
>         join:           60 
>         consensus:      6000 
>         vsftype:        none 
>         max_messages:   20 
>         clear_node_high_bit: yes 
>         secauth:        off 
>         threads:        0 
>         interface { 
>                 ringnumber:     0 
>                 bindnetaddr:    10.10.10.0 
>                 mcastaddr:      239.192.200.51 
>                 mcastport:      5405 
>         } 
>         interface { 
>                 ringnumber:     1 
>                 bindnetaddr:    192.168.1.0 
>                 mcastaddr:      239.192.200.52 
>                 mcastport:      5405 
>         } 
>         rrp_mode:       active 
> } 
> logging { 
>         fileline:       off 
>         to_stderr:      no 
>         to_logfile:     no 
>         to_syslog:      yes 
>         syslog_facility: daemon 
>         debug:          off 
>         timestamp:      off 
> } 
> amf { 
>         mode: disabled 
> } 
>  
> ------------------------ 
> CONET Solutions GmbH, Theodor-Heuss-Allee 19, 53773 Hennef. 
> Registergericht/Registration Court: Amtsgericht Siegburg (HRB Nr. 9136) 
> Gesch?ftsf?hrer/Managing Directors: J?rgen Zender (Sprecher/Chairman), Anke  
> H?fer 
> _______________________________________________ 
> Linux-HA mailing list 
> Linux-HA@lists.linux-ha.org 
> http://lists.linux-ha.org/mailman/listinfo/linux-ha 
> See also: http://linux-ha.org/ReportingProblems 
>  




-- 
Tim Serong <tser...@novell.com>
Senior Clustering Engineer, OPS Engineering, Novell Inc.



_______________________________________________
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to