Gakhramanzode commented on PR #16767:
URL: https://github.com/apache/kafka/pull/16767#issuecomment-2470591629
@C0urante @frankvicky
Hello,
I'm encountering an issue that appears to be related to this pull request.
**Context:**
- After upgrading from Kafka 3.6.0 to 3.9.0, our MirrorCheckpointConnector
consistently fails with the error:
```bash
[2024-11-12 16:41:53,084] ERROR [Worker clientId=app-dev->telemetry-dev,
groupId=app-dev-mm2] Failed to reconfigure connector's tasks
(MirrorCheckpointConnector), retrying after backoff.
(org.apache.kafka.connect.runtime.distributed.DistributedHerder:2195)
org.apache.kafka.connect.errors.RetriableException: Timeout while loading
consumer groups.
at
org.apache.kafka.connect.mirror.MirrorCheckpointConnector.taskConfigs(MirrorCheckpointConnector.java:138)
at
org.apache.kafka.connect.runtime.Worker.connectorTaskConfigs(Worker.java:398)
at
org.apache.kafka.connect.runtime.distributed.DistributedHerder.reconfigureConnector(DistributedHerder.java:2243)
at
org.apache.kafka.connect.runtime.distributed.DistributedHerder.reconfigureConnectorTasksWithExponentialBackoffRetries(DistributedHerder.java:2183)
at
org.apache.kafka.connect.runtime.distributed.DistributedHerder.lambda$null$47(DistributedHerder.java:2199)
at
org.apache.kafka.connect.runtime.distributed.DistributedHerder.runRequest(DistributedHerder.java:2402)
at
org.apache.kafka.connect.runtime.distributed.DistributedHerder.tick(DistributedHerder.java:498)
at
org.apache.kafka.connect.runtime.distributed.DistributedHerder.run(DistributedHerder.java:383)
at
java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:539)
at
java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
at
java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136)
at
java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635)
at java.base/java.lang.Thread.run(Thread.java:840)
```
- Increasing `admin.timeout.ms` and related timeout settings did not resolve
the issue.
- The problem seems to stem from `knownConsumerGroups` remaining `null`,
causing `taskConfigs()` to throw a `RetriableException`.
**Questions:**
- Alternatively, are there any suggested workarounds to mitigate this issue
in the meantime?
Thank you for your efforts in resolving this problem. Any assistance would
be greatly appreciated.
Best regards
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]