Hi Steve
I've done more systematic tests about this issue , and here are my
conclusions:
with my network configuration where one heartbeat is on eth1 and one
other on
br0 (bridge) , I've tested start of corosync several times with :
at first/ 1st ringnumber linked to eth1 IF AND 2nd ringnumber linked to
br0 IF
and then/ 1st ringnumber linked to br0 IF 2nd ringnumber linked to eth1 IF
It appears that just after changing the order networks in rings,
the first start always fails, meaning that crm_mon displays both nodes
as UNCLEAN at vitam eternam. Moreover we can't stop corosync with
/etc/init.d/corosync stop, it remains stalled at vitam eternam and we have
to kill the process and do rm -f of the subsys lock file.
After that , the second start always works ! crm_mon displays both nodes
"On" after 60s.
And then, we can stop and start corosync many times without any problem
anymore.
As soon as I change again the order of networks in rings, the first
start fails
again, and I've have to do same thing as described just upon, so that any
further stop/start are working fine again.
So my conclusion is that, if there is one bridge IF among both rings for
heartbeat, there is a systematic problem at first start only !
Hope these tests could help.
Thanks
Regards
Alain Moullé
Hi
>
> is it supported to have one of the ringnumber with a bindnetaddr linked to
> an Eth bridge if (br0) ?
>
> because on one config, I have :
> rrp_mode : active
> and
> for 1st ringnumber :
> bindnetaddr: 12.1.0.0
> for 2nd ringnumber :
> bindnetaddr: 12.0.0.0
>
I have never tried this - ie using a machine connected between two lans
to act as a software network switch for one cluster. Can't say if it
works or not.
please give output of ifconfig. It may be that your netmask is not
properly set, or that br0 is not in an up state (below ipaddr says it is
an UNKNOWN state).
Regards
-steve
_______________________________________________
Openais mailing list
Openais@lists.linux-foundation.org
https://lists.linux-foundation.org/mailman/listinfo/openais