[ 
https://issues.apache.org/jira/browse/KAFKA-1070?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14702030#comment-14702030
 ] 

Alex Etling commented on KAFKA-1070:
------------------------------------

Hello Kafka Geniuses,
     I have a question regarding this PR.  Some context: 
    We have started playing around with kafka 0.8.2.1 to build a data pipeline 
at the company I work at.  Initially, to get autoscaling with AWS working, we 
were mapping the IP of our boxes to our broker id.  For a while everything was 
good.  Today I realized (please correct me if I am wrong) that kafka assigns 
the replicas to a topic at topic creation time. These replicas are not modified 
later unless you specifically rebalance the cluster(this is different than ISR 
which can go from 0 servers to the set of replicas).  This leads to an 
interesting question on how to cycle in new boxes.  The easiest way seems to be 
to copy all data from one box to another, kill the old box and start the new 
box with the same broker.id.  This is not really easy when you do a direct 
mapping of IP -> broker.id.  
     So now we come to this Jira ticket.  I was wondering if you could 
enumerate for me how this auto-assign node id would deal with the cycling of a 
box.  If a bring down a box that was auto-assigned a broker.id of X and bring 
back up a new box, what will happen.  Will that new box have broker.id X as 
well?  What if I bring down two boxes with broker.id X and broker.id Y, what is 
the broker.id of the new box i spin up. 

Thanks for the help,
Alex
     

> Auto-assign node id
> -------------------
>
>                 Key: KAFKA-1070
>                 URL: https://issues.apache.org/jira/browse/KAFKA-1070
>             Project: Kafka
>          Issue Type: Bug
>            Reporter: Jay Kreps
>            Assignee: Sriharsha Chintalapani
>              Labels: usability
>             Fix For: 0.8.3
>
>         Attachments: KAFKA-1070.patch, KAFKA-1070_2014-07-19_16:06:13.patch, 
> KAFKA-1070_2014-07-22_11:34:18.patch, KAFKA-1070_2014-07-24_20:58:17.patch, 
> KAFKA-1070_2014-07-24_21:05:33.patch, KAFKA-1070_2014-08-21_10:26:20.patch, 
> KAFKA-1070_2014-11-20_10:50:04.patch, KAFKA-1070_2014-11-25_20:29:37.patch, 
> KAFKA-1070_2015-01-01_17:39:30.patch, KAFKA-1070_2015-01-12_10:46:54.patch, 
> KAFKA-1070_2015-01-12_18:30:17.patch
>
>
> It would be nice to have Kafka brokers auto-assign node ids rather than 
> having that be a configuration. Having a configuration is irritating because 
> (1) you have to generate a custom config for each broker and (2) even though 
> it is in configuration, changing the node id can cause all kinds of bad 
> things to happen.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to