Hi. Which server did you shutdown in testing? If it was 192.168.20.223, that is natural kafka-consumer-groups script fails because you passed only 192.168.20.223 to the bootstrap-server arg.
In HA setup, you have to pass multiple brokers (as the comma separated string) to bootstrap-server so that the client can fetch initial metadata from other servers even when one fails. 2024年1月20日(土) 0:30 Yavuz Sert <yavuz.s...@netsia.com>: > Hi all, > > I'm trying to do some tests about high availability on kafka v2.8.2 > I have 3 kafka brokers and 3 zookeeper instances. > when i shutdown one of the kafka service only in one server i got this > error: > > [root@node-223 ~]# /root/kafka_2.12-2.8.2/bin/kafka-consumer-groups.sh > --bootstrap-server 192.168.20.223:9092 --group app2 --describe > > Error: Executing consumer group command failed due to > org.apache.kafka.common.errors.TimeoutException: > Call(callName=findCoordinator, deadlineMs=1705677946526, tries=47, > nextAllowedTryMs=1705677946627) timed out at 1705677946527 after 47 > attempt(s) > java.util.concurrent.ExecutionException: > org.apache.kafka.common.errors.TimeoutException: > Call(callName=findCoordinator, deadlineMs=1705677946526, tries=47, > nextAllowedTryMs=1705677946627) timed out at 1705677946527 after 47 > attempt(s) > at > > org.apache.kafka.common.internals.KafkaFutureImpl.wrapAndThrow(KafkaFutureImpl.java:45) > at > > org.apache.kafka.common.internals.KafkaFutureImpl.access$000(KafkaFutureImpl.java:32) > at > > org.apache.kafka.common.internals.KafkaFutureImpl$SingleWaiter.await(KafkaFutureImpl.java:89) > at > > org.apache.kafka.common.internals.KafkaFutureImpl.get(KafkaFutureImpl.java:260) > at > > kafka.admin.ConsumerGroupCommand$ConsumerGroupService.$anonfun$describeConsumerGroups$1(ConsumerGroupCommand.scala:550) > at > scala.collection.TraversableLike.$anonfun$map$1(TraversableLike.scala:286) > at scala.collection.Iterator.foreach(Iterator.scala:943) > at scala.collection.Iterator.foreach$(Iterator.scala:943) > at scala.collection.AbstractIterator.foreach(Iterator.scala:1431) > at scala.collection.IterableLike.foreach(IterableLike.scala:74) > at scala.collection.IterableLike.foreach$(IterableLike.scala:73) > at scala.collection.AbstractIterable.foreach(Iterable.scala:56) > at scala.collection.TraversableLike.map(TraversableLike.scala:286) > at scala.collection.TraversableLike.map$(TraversableLike.scala:279) > at scala.collection.AbstractTraversable.map(Traversable.scala:108) > at > > kafka.admin.ConsumerGroupCommand$ConsumerGroupService.describeConsumerGroups(ConsumerGroupCommand.scala:549) > at > > kafka.admin.ConsumerGroupCommand$ConsumerGroupService.collectGroupsOffsets(ConsumerGroupCommand.scala:565) > at > > kafka.admin.ConsumerGroupCommand$ConsumerGroupService.describeGroups(ConsumerGroupCommand.scala:368) > at > kafka.admin.ConsumerGroupCommand$.run(ConsumerGroupCommand.scala:73) > at > kafka.admin.ConsumerGroupCommand$.main(ConsumerGroupCommand.scala:60) > at > kafka.admin.ConsumerGroupCommand.main(ConsumerGroupCommand.scala) > Caused by: org.apache.kafka.common.errors.TimeoutException: > Call(callName=findCoordinator, deadlineMs=1705677946526, tries=47, > nextAllowedTryMs=1705677946627) timed out at 1705677946527 after 47 > attempt(s) > *Caused by: org.apache.kafka.common.errors.TimeoutException: Timed out > waiting for a node assignment. Call: findCoordinator* > > kafka conf (for 1 server) > broker.id=0 > listeners=PLAINTEXT://0.0.0.0:9092 > advertised.listeners=PLAINTEXT://192.168.20.223:9092 > num.network.threads=3 > num.io.threads=8 > socket.send.buffer.bytes=102400 > socket.receive.buffer.bytes=102400 > socket.request.max.bytes=104857600 > log.dirs=/root/kafkadir > num.partitions=1 > num.recovery.threads.per.data.dir=1 > offsets.topic.replication.factor=1 > transaction.state.log.replication.factor=1 > transaction.state.log.min.isr=1 > log.retention.hours=1 > log.segment.bytes=104857600 > log.retention.check.interval.ms=300000 > delete.topic.enable=true > zookeeper.connection.timeout.ms=18000 > zookeeper.connect=192.168.20.223:2181,192.168.20.224:2181, > 192.168.20.225:2181 > group.initial.rebalance.delay.ms=0 > max.request.size=104857600 > message.max.bytes=104857600 > > How can i fix or troubleshoot the error? > > Thanks > > Yavuz > -- ======================== Okada Haruki ocadar...@gmail.com ========================