Hi,

I have a 4 nodes SFCFSRAC cluster, running on Linux (RHEL 5 x86_64), with 
SFCFSRAC version 5MP3RP2.

As part of my ATP, I've tried disconnecting node1 from the storage (by 
shutting down it's FC ports at the FC switch). The node paniced, and the 
cluster did recognize the node failure, evicted and reconfigured.
After the node paniced, I've restored it's FC connection and booted the node 
back into the cluster.

To my suprise, while node1 rejoined the cluster, node4 paniced with the 
following message:
"GAB: port b is halting the system due to network failure".

There are no network issues, and this is consistent - each time node1 rejoins 
the cluster after a failure, node4 will panic with the same message.


During the test node3 was the master, and the last events logged on node4 just 
before the crash are these:

Mar 15 13:38:01 crmdb-rac-node4 kernel: GAB INFO V-15-1-20036 Port h gen   
6e7c0c membership ;123
Mar 15 13:38:01 crmdb-rac-node4 kernel: GAB INFO V-15-1-20038 Port h gen   
6e7c0c k_jeopardy 0
Mar 15 13:38:01 crmdb-rac-node4 kernel: GAB INFO V-15-1-20040 Port h gen   
6e7c0c    visible 0
Mar 15 13:38:01 crmdb-rac-node4 kernel: GAB INFO V-15-1-20036 Port w gen   
6e7c11 membership ;123
Mar 15 13:38:01 crmdb-rac-node4 kernel: GAB INFO V-15-1-20038 Port w gen   
6e7c11 k_jeopardy 0
Mar 15 13:38:01 crmdb-rac-node4 kernel: GAB INFO V-15-1-20040 Port w gen   
6e7c11    visible 0
Mar 15 13:38:01 crmdb-rac-node4 Had[10552]: VCS INFO V-16-1-10077 Received new 
cluster membership
Mar 15 13:38:01 crmdb-rac-node4 Had[10552]: VCS ERROR V-16-1-10113 System 
crmdb-rac-node1 (Node '0') is in DDNA Membership - Membership: 0xe, Visible: 
0x0
Mar 15 13:38:01 crmdb-rac-node4 kernel: GAB INFO V-15-1-20036 Port d gen   
6e7c0d membership ;123
Mar 15 13:38:01 crmdb-rac-node4 kernel: GAB INFO V-15-1-20038 Port d gen   
6e7c0d k_jeopardy 0
Mar 15 13:38:01 crmdb-rac-node4 kernel: GAB INFO V-15-1-20040 Port d gen   
6e7c0d    visible 0
Mar 15 13:38:01 crmdb-rac-node4 kernel: GAB INFO V-15-1-20036 Port f gen   
6e7c14 membership ;123
Mar 15 13:38:01 crmdb-rac-node4 kernel: GAB INFO V-15-1-20038 Port f gen   
6e7c14 k_jeopardy 0
Mar 15 13:38:01 crmdb-rac-node4 kernel: GAB INFO V-15-1-20040 Port f gen   
6e7c14    visible 0
Mar 15 13:38:01 crmdb-rac-node4 kernel: GAB INFO V-15-1-20036 Port v gen   
6e7c0f membership ;123
Mar 15 13:38:02 crmdb-rac-node4 kernel: GAB INFO V-15-1-20038 Port v gen   
6e7c0f k_jeopardy 0
Mar 15 13:38:02 crmdb-rac-node4 kernel: GAB INFO V-15-1-20040 Port v gen   
6e7c0f    visible 0
Mar 15 13:38:02 crmdb-rac-node4 kernel: GAB INFO V-15-1-20032 Port a closed


Any ideas?


_______________________________________________
Veritas-ha maillist  -  Veritas-ha@mailman.eng.auburn.edu
http://mailman.eng.auburn.edu/mailman/listinfo/veritas-ha

Reply via email to