[
https://issues.apache.org/jira/browse/KAFKA-340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13482523#comment-13482523
]
Neha Narkhede commented on KAFKA-340:
-------------------------------------
Thanks for patch v2, looks good overall. Couple minor questions -
3. KafkaController
3.1 In maybeRemoveReplicaFromIsr, let's add a warn statement that says why the
replica was not removed from the isr. There are 3 conditions when that happens,
lets state which condition was satisfied. This will be very useful for
debugging.
3.2 If removing the replica from isr is unnecessary, why do we still have to
send the leader and isr request to the leader ?
3.3 In the OfflineReplica state change, since the leader doesn't change, we
probably don't need to update the leader and isr cache too. Realize that this
is not introduced in this patch, but will be nice to fix it anyway.
3.4 What is the point of sending leader and isr request at the end of
shutdownBroker, since the OfflineReplica state change would've taken care of
that anyway. It seems like you just need to send the stop replica request with
the delete partitions flag turned off, no ?
4. PartitionLeaderSelector
Minor logging correction - So far we have been using [%s,%d] convention for
logging topic-partition. Let's change the following logging statement to do the
same -
debug("Partition %s-%d : current leader = %d, new leader = %d"
5. StopReplicaRequest
Probably makes sense to change warn to error or even throw exception to alert
on an invalid byte in the stop replica request. This is pretty serious and
probably points to corruption somewhere in the code
> Implement clean shutdown in 0.8
> -------------------------------
>
> Key: KAFKA-340
> URL: https://issues.apache.org/jira/browse/KAFKA-340
> Project: Kafka
> Issue Type: Sub-task
> Components: core
> Affects Versions: 0.8
> Reporter: Jun Rao
> Assignee: Joel Koshy
> Priority: Blocker
> Labels: bugs
> Attachments: KAFKA-340-v1.patch, KAFKA-340-v2.patch
>
> Original Estimate: 168h
> Remaining Estimate: 168h
>
> If we are shutting down a broker when the ISR of a partition includes only
> that broker, we could lose some messages that have been previously committed.
> For clean shutdown, we need to guarantee that there is at least 1 other
> broker in ISR after the broker is shut down.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira