Shawn Wang created KAFKA-14016:
----------------------------------
Summary: Revoke more partitions than expected in Cooperative
rebalance
Key: KAFKA-14016
URL: https://issues.apache.org/jira/browse/KAFKA-14016
Project: Kafka
Issue Type: Bug
Components: clients
Affects Versions: 3.3.0
Reporter: Shawn Wang
In https://issues.apache.org/jira/browse/KAFKA-13419 we found that some
consumer didn't reset generation and state after sync group fail with
REABALANCE_IN_PROGRESS error.
So we fixed it by reset generationId (no memberId) when sync group fail with
REABALANCE_IN_PROGRESS error.
But this change missed the reset part, so another change made in
https://issues.apache.org/jira/browse/KAFKA-13891 make this works.
After apply this change, we found that: sometimes consumer will revoker almost
1/3 of the partitions with cooperative enabled. Because if a consumer did a
very quick re-join, other consumers will get REABALANCE_IN_PROGRESS in
syncGroup and revoked their partition before re-jion.
I think the whole history is : in Cooperative rebalance, we found a duplicate
assign bug https://issues.apache.org/jira/browse/KAFKA-12984 and have several
fixes:
#
https://issues.apache.org/jira/browse/KAFKA-12983
*
**
--
This message was sent by Atlassian Jira
(v8.20.7#820007)