[ https://issues.apache.org/jira/browse/CASSANDRA-15141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16980297#comment-16980297 ]
Alex Liu edited comment on CASSANDRA-15141 at 11/22/19 4:28 PM: ---------------------------------------------------------------- How does {{[getAddressReplicas()|https://github.com/apache/cassandra/blob/7df67eff2d66dba4bed2b4f6aeabf05144d9b057/src/java/org/apache/cassandra/service/StorageService.java#L3002] block heartbeat propagation?}} was (Author: alexliu68): How does {{[getAddressReplicas()|https://github.com/apache/cassandra/blob/7df67eff2d66dba4bed2b4f6aeabf05144d9b057/src/java/org/apache/cassandra/service/StorageService.java#L3002] blocks heartbeat propagation?}} > Faster token ownership calculation for NetworkTopologyStrategy > -------------------------------------------------------------- > > Key: CASSANDRA-15141 > URL: https://issues.apache.org/jira/browse/CASSANDRA-15141 > Project: Cassandra > Issue Type: Improvement > Components: Cluster/Gossip, Cluster/Membership > Reporter: Jay Zhuang > Assignee: Jay Zhuang > Priority: Normal > > This function > [{{getAddressReplicas()}}|https://github.com/apache/cassandra/blob/7df67eff2d66dba4bed2b4f6aeabf05144d9b057/src/java/org/apache/cassandra/service/StorageService.java#L3002] > during removenode and decommission is slow for large vnode cluster with > NetworkTopologyStrategy. As it needs to build whole replications map for > every token range. > In one of our cluster (> 1k nodes), it takes about 20 seconds for each > NetworkTopologyStrategy keyspace, so the total time to process a removenode > message takes at least 80 seconds (20 * 4: 3 system keyspaces, 1 user > keyspace). It blocks the heartbeat propagation and causes false down node. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org