[ 
https://issues.apache.org/jira/browse/KAFKA-1300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13986026#comment-13986026
 ] 

Joel Koshy commented on KAFKA-1300:
-----------------------------------

Understood, but the primary use case would be to proceed to do a controlled
shutdown of the next broker in the shutdown plan. However, with retries and
a large enough retry interval that is not needed. (E.g., you can set a very
large number of retries.)

The documentation recommends closely monitoring under-replicated-partition
counts across the cluster (and alert if it is anything other than zero).
i.e., ensuring brokers are in a fully replicated state is a "best-practice"
for operations and should be 24/7 (not just during bounces).



> Added WaitForReplaction admin tool.
> -----------------------------------
>
>                 Key: KAFKA-1300
>                 URL: https://issues.apache.org/jira/browse/KAFKA-1300
>             Project: Kafka
>          Issue Type: New Feature
>          Components: tools
>    Affects Versions: 0.8.0
>         Environment: Ubuntu 12.04
>            Reporter: Brenden Matthews
>              Labels: patch
>             Fix For: 0.8.1
>
>         Attachments: 0001-Added-WaitForReplaction-admin-tool.patch
>
>
> I have created a tool similar to the broker shutdown tool for doing rolling 
> restarts of Kafka clusters.
> The tool watches the max replica lag of the specified broker, and waits until 
> the lag drops to 0 before exiting.
> To do a rolling restart, here's the process we use:
> for (broker <- brokers) {
>   run shutdown tool for broker
>   terminate broker
>   start new broker
>   run wait for replication tool on new broker
> }
> Here's an example command line use:
> ./kafka-run-class.sh kafka.admin.WaitForReplication --zookeeper 
> zk.host.com:2181 --num.retries 100 --retry.interval.ms 60000 --broker 0



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to