[ https://issues.apache.org/jira/browse/KAFKA-15793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17835653#comment-17835653 ]
Igor Soarez edited comment on KAFKA-15793 at 4/10/24 8:56 AM: -------------------------------------------------------------- This has come up again in [https://ci-builds.apache.org/blue/organizations/jenkins/Kafka%2Fkafka-pr/detail/PR-15522/5/tests/] {code:java} [2024-04-09T21:06:17.307Z] Gradle Test Run :core:test > Gradle Test Executor 97 > DeleteTopicsRequestTest > testTopicDeletionClusterHasOfflinePartitions(String) > "testTopicDeletionClusterHasOfflinePartitions(String).quorum=zk" STARTED [2024-04-09T21:06:17.307Z] kafka.zk.ZkMigrationIntegrationTest.testMigrateTopicDeletions(ClusterInstance)[7] failed, log available in /home/jenkins/jenkins-agent/workspace/Kafka_kafka-pr_PR-15522/core/build/reports/testOutput/kafka.zk.ZkMigrationIntegrationTest.testMigrateTopicDeletions(ClusterInstance)[7].test.stdout [2024-04-09T21:06:17.307Z] [2024-04-09T21:06:17.307Z] Gradle Test Run :core:test > Gradle Test Executor 96 > ZkMigrationIntegrationTest > testMigrateTopicDeletions(ClusterInstance) > testMigrateTopicDeletions [7] Type=ZK, MetadataVersion=3.7-IV4, Security=PLAINTEXT FAILED [2024-04-09T21:06:17.307Z] org.apache.kafka.server.fault.FaultHandlerException: nonFatalFaultHandler: Unhandled error in MetadataChangeEvent: Check op on KRaft Migration ZNode failed. Expected zkVersion = 5. This indicates that another KRaft controller is making writes to ZooKeeper. [2024-04-09T21:06:17.307Z] at app//kafka.zk.KafkaZkClient.handleUnwrappedMigrationResult$1(KafkaZkClient.scala:2001) [2024-04-09T21:06:17.307Z] at app//kafka.zk.KafkaZkClient.unwrapMigrationResponse$1(KafkaZkClient.scala:2027) [2024-04-09T21:06:17.307Z] at app//kafka.zk.KafkaZkClient.$anonfun$retryMigrationRequestsUntilConnected$2(KafkaZkClient.scala:2052) [2024-04-09T21:06:17.307Z] at app//scala.collection.StrictOptimizedIterableOps.map(StrictOptimizedIterableOps.scala:100) [2024-04-09T21:06:17.307Z] at app//scala.collection.StrictOptimizedIterableOps.map$(StrictOptimizedIterableOps.scala:87) [2024-04-09T21:06:17.307Z] at app//scala.collection.mutable.ArrayBuffer.map(ArrayBuffer.scala:43) [2024-04-09T21:06:17.307Z] at app//kafka.zk.KafkaZkClient.retryMigrationRequestsUntilConnected(KafkaZkClient.scala:2052) [2024-04-09T21:06:17.307Z] at app//kafka.zk.migration.ZkTopicMigrationClient.$anonfun$updateTopicPartitions$1(ZkTopicMigrationClient.scala:265) [2024-04-09T21:06:17.307Z] at app//kafka.zk.migration.ZkTopicMigrationClient.updateTopicPartitions(ZkTopicMigrationClient.scala:255) [2024-04-09T21:06:17.307Z] at app//org.apache.kafka.metadata.migration.KRaftMigrationZkWriter.lambda$handleTopicsDelta$20(KRaftMigrationZkWriter.java:334) [2024-04-09T21:06:17.307Z] at app//org.apache.kafka.metadata.migration.KRaftMigrationDriver.applyMigrationOperation(KRaftMigrationDriver.java:248) [2024-04-09T21:06:17.307Z] at app//org.apache.kafka.metadata.migration.KRaftMigrationDriver.access$300(KRaftMigrationDriver.java:62) [2024-04-09T21:06:17.307Z] at app//org.apache.kafka.metadata.migration.KRaftMigrationDriver$MetadataChangeEvent.lambda$run$1(KRaftMigrationDriver.java:532) [2024-04-09T21:06:17.307Z] at app//org.apache.kafka.metadata.migration.KRaftMigrationDriver.lambda$countingOperationConsumer$6(KRaftMigrationDriver.java:845) [2024-04-09T21:06:17.307Z] at app//org.apache.kafka.metadata.migration.KRaftMigrationZkWriter.lambda$handleTopicsDelta$21(KRaftMigrationZkWriter.java:331) [2024-04-09T21:06:17.307Z] at java.base@17.0.7/java.util.HashMap.forEach(HashMap.java:1421) [2024-04-09T21:06:17.307Z] at app//org.apache.kafka.metadata.migration.KRaftMigrationZkWriter.handleTopicsDelta(KRaftMigrationZkWriter.java:297) [2024-04-09T21:06:17.307Z] at app//org.apache.kafka.metadata.migration.KRaftMigrationZkWriter.handleDelta(KRaftMigrationZkWriter.java:112) [2024-04-09T21:06:17.307Z] at app//org.apache.kafka.metadata.migration.KRaftMigrationDriver$MetadataChangeEvent.run(KRaftMigrationDriver.java:531) [2024-04-09T21:06:17.307Z] at app//org.apache.kafka.queue.KafkaEventQueue$EventContext.run(KafkaEventQueue.java:128) [2024-04-09T21:06:17.307Z] at app//org.apache.kafka.queue.KafkaEventQueue$EventHandler.handleEvents(KafkaEventQueue.java:211) [2024-04-09T21:06:17.307Z] at app//org.apache.kafka.queue.KafkaEventQueue$EventHandler.run(KafkaEventQueue.java:182) [2024-04-09T21:06:17.307Z] at java.base@17.0.7/java.lang.Thread.run(Thread.java:833) [2024-04-09T21:06:17.307Z] [2024-04-09T21:06:17.307Z] Caused by: [2024-04-09T21:06:17.307Z] java.lang.RuntimeException: Check op on KRaft Migration ZNode failed. Expected zkVersion = 5. This indicates that another KRaft controller is making writes to ZooKeeper. [2024-04-09T21:06:17.307Z] at kafka.zk.KafkaZkClient.handleUnwrappedMigrationResult$1(KafkaZkClient.scala:2001) [2024-04-09T21:06:17.307Z] ... 22 more [2024-04-09T21:06:17.307Z] {code} was (Author: soarez): This has come up again: {code:java} [2024-04-09T21:06:17.307Z] Gradle Test Run :core:test > Gradle Test Executor 97 > DeleteTopicsRequestTest > testTopicDeletionClusterHasOfflinePartitions(String) > "testTopicDeletionClusterHasOfflinePartitions(String).quorum=zk" STARTED [2024-04-09T21:06:17.307Z] kafka.zk.ZkMigrationIntegrationTest.testMigrateTopicDeletions(ClusterInstance)[7] failed, log available in /home/jenkins/jenkins-agent/workspace/Kafka_kafka-pr_PR-15522/core/build/reports/testOutput/kafka.zk.ZkMigrationIntegrationTest.testMigrateTopicDeletions(ClusterInstance)[7].test.stdout [2024-04-09T21:06:17.307Z] [2024-04-09T21:06:17.307Z] Gradle Test Run :core:test > Gradle Test Executor 96 > ZkMigrationIntegrationTest > testMigrateTopicDeletions(ClusterInstance) > testMigrateTopicDeletions [7] Type=ZK, MetadataVersion=3.7-IV4, Security=PLAINTEXT FAILED [2024-04-09T21:06:17.307Z] org.apache.kafka.server.fault.FaultHandlerException: nonFatalFaultHandler: Unhandled error in MetadataChangeEvent: Check op on KRaft Migration ZNode failed. Expected zkVersion = 5. This indicates that another KRaft controller is making writes to ZooKeeper. [2024-04-09T21:06:17.307Z] at app//kafka.zk.KafkaZkClient.handleUnwrappedMigrationResult$1(KafkaZkClient.scala:2001) [2024-04-09T21:06:17.307Z] at app//kafka.zk.KafkaZkClient.unwrapMigrationResponse$1(KafkaZkClient.scala:2027) [2024-04-09T21:06:17.307Z] at app//kafka.zk.KafkaZkClient.$anonfun$retryMigrationRequestsUntilConnected$2(KafkaZkClient.scala:2052) [2024-04-09T21:06:17.307Z] at app//scala.collection.StrictOptimizedIterableOps.map(StrictOptimizedIterableOps.scala:100) [2024-04-09T21:06:17.307Z] at app//scala.collection.StrictOptimizedIterableOps.map$(StrictOptimizedIterableOps.scala:87) [2024-04-09T21:06:17.307Z] at app//scala.collection.mutable.ArrayBuffer.map(ArrayBuffer.scala:43) [2024-04-09T21:06:17.307Z] at app//kafka.zk.KafkaZkClient.retryMigrationRequestsUntilConnected(KafkaZkClient.scala:2052) [2024-04-09T21:06:17.307Z] at app//kafka.zk.migration.ZkTopicMigrationClient.$anonfun$updateTopicPartitions$1(ZkTopicMigrationClient.scala:265) [2024-04-09T21:06:17.307Z] at app//kafka.zk.migration.ZkTopicMigrationClient.updateTopicPartitions(ZkTopicMigrationClient.scala:255) [2024-04-09T21:06:17.307Z] at app//org.apache.kafka.metadata.migration.KRaftMigrationZkWriter.lambda$handleTopicsDelta$20(KRaftMigrationZkWriter.java:334) [2024-04-09T21:06:17.307Z] at app//org.apache.kafka.metadata.migration.KRaftMigrationDriver.applyMigrationOperation(KRaftMigrationDriver.java:248) [2024-04-09T21:06:17.307Z] at app//org.apache.kafka.metadata.migration.KRaftMigrationDriver.access$300(KRaftMigrationDriver.java:62) [2024-04-09T21:06:17.307Z] at app//org.apache.kafka.metadata.migration.KRaftMigrationDriver$MetadataChangeEvent.lambda$run$1(KRaftMigrationDriver.java:532) [2024-04-09T21:06:17.307Z] at app//org.apache.kafka.metadata.migration.KRaftMigrationDriver.lambda$countingOperationConsumer$6(KRaftMigrationDriver.java:845) [2024-04-09T21:06:17.307Z] at app//org.apache.kafka.metadata.migration.KRaftMigrationZkWriter.lambda$handleTopicsDelta$21(KRaftMigrationZkWriter.java:331) [2024-04-09T21:06:17.307Z] at java.base@17.0.7/java.util.HashMap.forEach(HashMap.java:1421) [2024-04-09T21:06:17.307Z] at app//org.apache.kafka.metadata.migration.KRaftMigrationZkWriter.handleTopicsDelta(KRaftMigrationZkWriter.java:297) [2024-04-09T21:06:17.307Z] at app//org.apache.kafka.metadata.migration.KRaftMigrationZkWriter.handleDelta(KRaftMigrationZkWriter.java:112) [2024-04-09T21:06:17.307Z] at app//org.apache.kafka.metadata.migration.KRaftMigrationDriver$MetadataChangeEvent.run(KRaftMigrationDriver.java:531) [2024-04-09T21:06:17.307Z] at app//org.apache.kafka.queue.KafkaEventQueue$EventContext.run(KafkaEventQueue.java:128) [2024-04-09T21:06:17.307Z] at app//org.apache.kafka.queue.KafkaEventQueue$EventHandler.handleEvents(KafkaEventQueue.java:211) [2024-04-09T21:06:17.307Z] at app//org.apache.kafka.queue.KafkaEventQueue$EventHandler.run(KafkaEventQueue.java:182) [2024-04-09T21:06:17.307Z] at java.base@17.0.7/java.lang.Thread.run(Thread.java:833) [2024-04-09T21:06:17.307Z] [2024-04-09T21:06:17.307Z] Caused by: [2024-04-09T21:06:17.307Z] java.lang.RuntimeException: Check op on KRaft Migration ZNode failed. Expected zkVersion = 5. This indicates that another KRaft controller is making writes to ZooKeeper. [2024-04-09T21:06:17.307Z] at kafka.zk.KafkaZkClient.handleUnwrappedMigrationResult$1(KafkaZkClient.scala:2001) [2024-04-09T21:06:17.307Z] ... 22 more [2024-04-09T21:06:17.307Z] {code} > Flaky test ZkMigrationIntegrationTest.testMigrateTopicDeletions > --------------------------------------------------------------- > > Key: KAFKA-15793 > URL: https://issues.apache.org/jira/browse/KAFKA-15793 > Project: Kafka > Issue Type: Test > Affects Versions: 3.7.0, 3.6.1 > Reporter: Divij Vaidya > Assignee: David Arthur > Priority: Major > Labels: flaky-test > Fix For: 3.7.0, 3.6.1 > > Attachments: Screenshot 2023-11-06 at 11.30.06.png > > > The tests have been flaky since they were introduced in > [https://github.com/apache/kafka/pull/14545] (see picture attached). > The stack traces for the flakiness can be found at > [https://ge.apache.org/scans/tests?search.relativeStartTime=P28D&search.rootProjectNames=kafka&search.tags=trunk&search.timeZoneId=Europe%2FBerlin&tests.container=kafka.zk.ZkMigrationIntegrationTest] > -- This message was sent by Atlassian Jira (v8.20.10#820010)