Hi all,

I'm seeing lots of nodes being "lost" in my logs :

Oct 22 16:47:46 corosync [pcmk  ] info: pcmk_peer_update: lost: s3db3 419632812
Oct 22 18:18:48 corosync [pcmk  ] info: pcmk_peer_update: lost: s3db3 419632812
Oct 22 18:47:14 corosync [pcmk  ] info: pcmk_peer_update: lost: s3db5 402855596
Oct 22 21:17:42 corosync [pcmk  ] info: pcmk_peer_update: lost: s3db5 402855596
Oct 23 04:27:48 corosync [pcmk  ] info: pcmk_peer_update: lost: s3db5 402855596
Oct 23 07:18:47 corosync [pcmk  ] info: pcmk_peer_update: lost: s3db5 402855596
Oct 23 07:18:50 [10452] s3db1       crmd:     info: do_election_count_vote:  
Election 11 (owner: s3db5) lost: vote from s3db5 (Uptime)
Oct 23 07:18:50 [10452] s3db1       crmd:     info: do_election_count_vote:  
Election 12 (owner: s3db5) lost: vote from s3db5 (Uptime)
Oct 23 08:10:42 corosync [pcmk  ] info: pcmk_peer_update: lost: s3db5 402855596
Oct 23 08:10:42 corosync [pcmk  ] info: pcmk_peer_update: lost: s3db3 419632812
Oct 23 08:10:42 corosync [pcmk  ] info: pcmk_peer_update: lost: s3quorum1 
1141053100

This is occurring on a systems running nearly 100% idle on a very quiet 10Gb/s 
network with no other activity, and no packet loss. Any idea what else could be 
causing this ? I notice on the ClusterLabs wiki, there appear to be tweaked 
values in the "Initial Configuration" page 
(http://clusterlabs.org/wiki/Initial_Configuration) which seem to deal with 
timeouts (see below). I have kept everything to the stock defaults, except I'm 
using udpu. Are these a likely candidate ? Is it recommended that these values 
should be applied in a production environment ?

Recommended values from Wiki :
            # How long before declaring a token lost (ms)
           token:          5000
            # How many token retransmits before forming a new configuration
           token_retransmits_before_loss_const: 20
            # How long to wait for join messages in the membership protocol (ms)
           join:           1000
            # How long to wait for consensus to be achieved before starting a 
new round of membership configuration (ms)
           consensus:      7500
            # Turn off the virtual synchrony filter
           vsftype:        none
            # Number of messages that may be sent by one processor on receipt 
of the token
           max_messages:   20
            # Disable encryption
           secauth:            off
            # How many threads to use for encryption/decryption
           threads:            0

My configuration :

compatibility: whitetank
totem {
    version: 2
    secauth: off
    interface {
        member {
            memberaddr: 172.22.3.26
        }
        member {
            memberaddr: 172.22.3.25
        }

.... And so on...

        ringnumber: 0
        # different on all hosts
        bindnetaddr: 172.22.3.26
        mcastport: 5405
    }
    transport: udpu
}



Many thanks,

-Mark
________________________________
Mark Round
Senior Systems Administrator
NCC Group
Kings Court Kingston Road
Leatherhead, KT22 7SL

Telephone: +44 1372 383815
Mobile: +44 7790 770413
Fax:
Website: www.nccgroup.com<http://www.nccgroup.com>
Email:  [email protected]<mailto:[email protected]>
        [http://www.nccgroup.com/media/192418/nccgrouplogo.jpg] 
<http://www.nccgroup.com/>
________________________________

This email is sent for and on behalf of NCC Group. NCC Group is the trading 
name of NCC Group Performance Testing Limited (Registered in England CRN: 
4069379). Registered Office: Manchester Technology Centre, Oxford Road, 
Manchester, M1 7EF. The ultimate holding company is NCC Group plc (Registered 
in England CRN: 4627044).

Confidentiality: This e-mail contains proprietary information, some or all of 
which may be confidential and/or legally privileged. It is for the intended 
recipient only. If an addressing or transmission error has misdirected this 
e-mail, please notify the author by replying to this e-mail and then delete the 
original. If you are not the intended recipient you may not use, disclose, 
distribute, copy, print or rely on any information contained in this e-mail. 
You must not inform any other person other than NCC Group or the sender of its 
existence.

For more information about NCC Group please visit 
www.nccgroup.com<http://www.nccgroup.com>

P Before you print think about the ENVIRONMENT


For more information please visit <a 
href="http://www.mimecast.com";>http://www.mimecast.com<br>
This email message has been delivered safely and archived online by Mimecast.
</a>
_______________________________________________
discuss mailing list
[email protected]
http://lists.corosync.org/mailman/listinfo/discuss

Reply via email to