Chia-Ping Tsai created KAFKA-19500: -------------------------------------- Summary: kafka-consumer-groups.sh should fail quickly if the partition leader is unavailable Key: KAFKA-19500 URL: https://issues.apache.org/jira/browse/KAFKA-19500 Project: Kafka Issue Type: Improvement Reporter: Chia-Ping Tsai Assignee: Chia-Ping Tsai
{code:java} Error: Executing consumer group command failed due to java.util.concurrent.ExecutionException: org.apache.kafka.common.errors.TimeoutException: Call(callName=listOffsets(api=METADATA), deadlineMs=1752487911886, tries=492805, nextAllowedTryMs=1752487912888) timed out at 1752487911888 after 492805 attempt(s) java.lang.RuntimeException: java.util.concurrent.ExecutionException: org.apache.kafka.common.errors.TimeoutException: Call(callName=listOffsets(api=METADATA), deadlineMs=1752487911886, tries=492805, nextAllowedTryMs=1752487912888) timed out at 1752487911888 after 492805 attempt(s) at org.apache.kafka.tools.OffsetsUtils.getLogTimestampOffsets(OffsetsUtils.java:190) at org.apache.kafka.tools.OffsetsUtils.resetByDuration(OffsetsUtils.java:352) at org.apache.kafka.tools.consumer.group.ConsumerGroupCommand$ConsumerGroupService.prepareOffsetsToReset(ConsumerGroupCommand.java:1015) at org.apache.kafka.tools.consumer.group.ConsumerGroupCommand$ConsumerGroupService.resetOffsetsForInactiveGroup(ConsumerGroupCommand.java:704) at org.apache.kafka.tools.consumer.group.ConsumerGroupCommand$ConsumerGroupService.lambda$resetOffsets$24(ConsumerGroupCommand.java:681) at java.base/java.util.HashMap.forEach(HashMap.java:1429) at org.apache.kafka.tools.consumer.group.ConsumerGroupCommand$ConsumerGroupService.resetOffsets(ConsumerGroupCommand.java:675) at org.apache.kafka.tools.consumer.group.ConsumerGroupCommand.run(ConsumerGroupCommand.java:130) at org.apache.kafka.tools.consumer.group.ConsumerGroupCommand.main(ConsumerGroupCommand.java:110) Caused by: java.util.concurrent.ExecutionException: org.apache.kafka.common.errors.TimeoutException: Call(callName=listOffsets(api=METADATA), deadlineMs=1752487911886, tries=492805, nextAllowedTryMs=1752487912888) timed out at 1752487911888 after 492805 attempt(s) at java.base/java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:396) at java.base/java.util.concurrent.CompletableFuture.get(CompletableFuture.java:2073) at org.apache.kafka.common.internals.KafkaFutureImpl.get(KafkaFutureImpl.java:155) at org.apache.kafka.tools.OffsetsUtils.getLogTimestampOffsets(OffsetsUtils.java:167) ... 8 more {code} `Admin#listOffsets` needs to communicate to the partition leader to get latest information. Hence, the call hangs if the node hosting the leader is unavailable. It should fail quickly by using `describeTopics` to check the leaders for all input partitions -- This message was sent by Atlassian Jira (v8.20.10#820010)