[
https://issues.apache.org/jira/browse/KAFKA-2841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15005170#comment-15005170
]
Guozhang Wang commented on KAFKA-2841:
--------------------------------------
[~hachikuji] Is this solvable in KAFKA-2721?
> Group metadata cache loading is not safe when reloading a partition
> -------------------------------------------------------------------
>
> Key: KAFKA-2841
> URL: https://issues.apache.org/jira/browse/KAFKA-2841
> Project: Kafka
> Issue Type: Bug
> Affects Versions: 0.9.0.0
> Reporter: Jason Gustafson
> Assignee: Jason Gustafson
> Priority: Blocker
>
> If the coordinator receives a leaderAndIsr request which includes a higher
> leader epoch for one of the partitions that it owns, then it will reload the
> offset/metadata for that partition again. This can happen because the leader
> epoch is incremented for ISR changes which do not result in a new leader for
> the partition. Currently, the coordinator replaces cached metadata values
> blindly on reloading, which can result in weird behavior such as unexpected
> session timeouts or request timeouts while rebalancing.
> To fix this, we need to check that the group being loaded has a higher
> generation than the cached value before replacing it. Also, if we have to
> replace a cached value (which shouldn't happen except when loading), we need
> to be very careful to ensure that any active delayed operations won't affect
> the group.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)