[ https://issues.apache.org/jira/browse/CASSANDRA-1037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12881228#action_12881228 ]
Jonathan Ellis commented on CASSANDRA-1037: ------------------------------------------- i'm skeptical that you'd be able to balance fast enough to deal w/ hot spots significantly better than "balance by disk space", but i suppose it's worth a try > Improve load balancing to take into account load in terms of operations > ----------------------------------------------------------------------- > > Key: CASSANDRA-1037 > URL: https://issues.apache.org/jira/browse/CASSANDRA-1037 > Project: Cassandra > Issue Type: Improvement > Reporter: Jeremy Hanna > Priority: Minor > Fix For: 0.7 > > > Currently in cassandra, the load balancing takes into account disk space. > When using an order-preserving partitioner, there can be hot spots in the > various ranges of tokens in terms of operations. We would like to propose > improving the load balancing so that it takes that the number of operations > into account. > There are two places where this can be handled: > 1. when the cluster decides on which nodes need to be balanced out. > 2. how to balance an individual node - where to split > For number 1, the number of operations that a node performed could be > factored in to how important it is to balance that node. > For number 2, we are already using a midpoint in the node when trying to load > balance with respect to space. We would propose adding a weight to the > midpoint to lean towards splitting so that the operational load could be > better handled, not just space. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.