pacemaker 1.1.12-11.12
openais 1.1.4-5.24.5
corosync 1.4.7-0.23.5

Its a two node active/passive cluster and we just upgraded the SLES 11
SP 3 to SLES 11 SP 4(nothing  else) but when we try to start the cluster
service we get the following error:

"Totem is unable to form a cluster because of an operating system or
network fault."

Firewall is stopped and disabled on both the nodes. Both nodes can
ping/ssh/vnc each other.

Hard to help. First of all, I would recommend to ask SUSE support because I don't really have access to source code of corosync 1.4.7-0.23.5 package, so really don't know what patches are added.



corosync.conf:
aisexec {
     group:    root
     user:    root
}
service {
     use_mgmtd:    yes
     use_logd:    yes
     ver:    0
     name:    pacemaker
}
totem {
     rrp_mode:    none
     join:    60
     max_messages:    20
     vsftype:    none
     token:    5000
     consensus:    6000

     interface {
         bindnetaddr:    192.168.150.0

         member {
             memberaddr:     192.168.150.12
         }
         member {
             memberaddr:      192.168.150.13
         }
         mcastport:    5405

         ringnumber:    0

     }
     secauth:    off
     version:    2
     transport:    udpu
     token_retransmits_before_loss_const:    10
     clear_node_high_bit:    new
}
logging {
     to_logfile:    no
     to_syslog:    yes
     debug:    off
     timestamp:    off
     to_stderr:    no
     fileline:    off
     syslog_facility:    daemon
}
amf {
     mode:    disable
}

/var/log/messages:
Apr  6 17:51:49 prd1 corosync[8672]:  [MAIN  ] Corosync Cluster Engine
('1.4.7'): started and ready to provide service.
Apr  6 17:51:49 prd1 corosync[8672]:  [MAIN  ] Corosync built-in
features: nss
Apr  6 17:51:49 prd1 corosync[8672]:  [MAIN  ] Successfully configured
openais services to load
Apr  6 17:51:49 prd1 corosync[8672]:  [MAIN  ] Successfully read main
configuration file '/etc/corosync/corosync.conf'.
Apr  6 17:51:49 prd1 corosync[8672]:  [TOTEM ] Initializing transport
(UDP/IP Unicast).
Apr  6 17:51:49 prd1 corosync[8672]:  [TOTEM ] Initializing
transmit/receive security: libtomcrypt SOBER128/SHA1HMAC (mode 0).
Apr  6 17:51:49 prd1 corosync[8672]:  [TOTEM ] The network interface is
down.

^^^ This is important line. It means corosync was unable to find interface for bindnetaddr 192.168.150.0. Make sure interface with this network address exists.

Regards,
  Honza


_______________________________________________
Users mailing list: Users@clusterlabs.org
http://clusterlabs.org/mailman/listinfo/users

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org

Reply via email to