[
https://issues.apache.org/jira/browse/KAFKA-18469?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17912635#comment-17912635
]
Lianet Magrans commented on KAFKA-18469:
----------------------------------------
Merged to trunk and cherry-picked to 4.0
[https://github.com/apache/kafka/commit/5f46ff855475c23f8dfba9ec77df4de9dd184ebb]
> AsyncConsumer fails to retry ListOffsetRequest on ReplicaNotAvailable error
> without metadata update
> ---------------------------------------------------------------------------------------------------
>
> Key: KAFKA-18469
> URL: https://issues.apache.org/jira/browse/KAFKA-18469
> Project: Kafka
> Issue Type: Bug
> Components: clients, consumer
> Reporter: Yu-Lin Chen
> Assignee: Yu-Lin Chen
> Priority: Blocker
> Labels: consumer-threading-refactor
> Fix For: 4.0.0
>
>
> In AsyncConsumer, the ListOffsetRequest is only retried after the metadata
> update[1]. However, not every retriable errors are followed by a metadata
> update, such as the ReplicaNotAvailable error from remote storage. This
> errors leads to Consumer#offsetsForTimes failing after api timeout(60
> seconds).
> This issue does not occur with ClassicConsumer, which always triggers a
> metadata update before retrying. [2]
>
> This issue is the root cause of the flaky test KAFKA-18036, where
> consumer#offsetsForTimes is called before remote metadata cache initialized.
>
> [1]
> [https://github.com/apache/kafka/blob/5684fc7a2ee1a4f29cb6d69d713233ed3c297882/clients/src/main/java/org/apache/kafka/clients/consumer/internals/OffsetsRequestManager.java#L529-L534]
> [2]
> https://github.com/apache/kafka/blob/5684fc7a2ee1a4f29cb6d69d713233ed3c297882/clients/src/main/java/org/apache/kafka/clients/consumer/internals/OffsetFetcher.java#L180
--
This message was sent by Atlassian Jira
(v8.20.10#820010)