Ryan Berdeen created KAFKA-1677:
-----------------------------------
Summary: Governor on concurrent replica reassignments
Key: KAFKA-1677
URL: https://issues.apache.org/jira/browse/KAFKA-1677
Project: Kafka
Issue Type: New Feature
Components: controller
Reporter: Ryan Berdeen
Assignee: Neha Narkhede
We have seen a cluster be killed via too many concurrent partition transfers.
An ideal solution is a configuration setting to limit the number of concurrent
transfers per host (dynamically tunable). (eg: transfer_limit defined in
http://docs.basho.com/riak/latest/ops/advanced/configs/configuration-files/#Ring).
To work around this, we generate our assignments, then use a tool to feed the
reassignments in small batches.
The size of the batch is based on either
* *the number partitions*, e.g., reassign all replicas for the first 2
partitions that have any moves
* *the number of individual replica moves*, e.g. when reassigning \[1,2,3,4] to
\[5,6,7,8], first reassign to \[5,6,3,4] then reassign to \[5,6,7,8]
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)