Apologies, I duplicated KAFKA-16157 twice in my previous message. I intended to 
mention KAFKA-16195
with the PR at https://github.com/apache/kafka/pull/15262 as the second JIRA.

Thanks,
Gaurav

> On 26 Jan 2024, at 15:34, ka...@gnarula.com wrote:
> 
> Hi Stan,
> 
> I wanted to share some updates about the bugs you shared earlier.
> 
> - KAFKA-14616: I've reviewed and tested the PR from Colin and have observed
> the fix works as intended.
> - KAFKA-16162: I reviewed Proven's PR and found some gaps in the proposed 
> fix. I've
> therefore raised https://github.com/apache/kafka/pull/15270 following a 
> discussion with Luke in JIRA.
> - KAFKA-16082: I don't think this is marked as a blocker anymore. I'm awaiting
> feedback/reviews at https://github.com/apache/kafka/pull/15136
> 
> In addition to the above, there are 2 JIRAs I'd like to bring everyone's 
> attention to:
> 
> - KAFKA-16157: This is similar to KAFKA-14616 and is marked as a blocker. 
> I've raised
> https://github.com/apache/kafka/pull/15263 and am awaiting reviews on it.
> - KAFKA-16157: I raised this yesterday and have addressed feedback from Luke. 
> This should
> hopefully get merged soon.
> 
> Regards,
> Gaurav
> 
> 
>> On 24 Jan 2024, at 11:51, ka...@gnarula.com wrote:
>> 
>> Hi Stanislav,
>> 
>> Thanks for bringing these JIRAs/PRs up.
>> 
>> I'll be testing the open PRs for KAFKA-14616 and KAFKA-16162 this week and I 
>> hope to have some feedback
>> by Friday. I gather the latter JIRA is marked as a WIP by Proven and he's 
>> away. I'll try to build on his work in the meantime.
>> 
>> As for KAFKA-16082, we haven't been able to deduce a data loss scenario. 
>> There's a PR open
>> by me for promoting an abandoned future replica with approvals from Omnia 
>> and Proven,
>> so I'd appreciate a committer reviewing it.
>> 
>> Regards,
>> Gaurav
>> 
>> On 23 Jan 2024, at 20:17, Stanislav Kozlovski 
>> <stanis...@confluent.io.INVALID> wrote:
>>> 
>>> Hey all, I figured I'd give an update about what known blockers we have
>>> right now:
>>> 
>>> - KAFKA-16101: KRaft migration rollback documentation is incorrect -
>>> https://github.com/apache/kafka/pull/15193; This need not block RC
>>> creation, but we need the docs updated so that people can test properly
>>> - KAFKA-14616: Topic recreation with offline broker causes permanent URPs -
>>> https://github.com/apache/kafka/pull/15230 ; I am of the understanding that
>>> this is blocking JBOD for 3.7
>>> - KAFKA-16162: New created topics are unavailable after upgrading to 3.7 -
>>> a strict blocker with an open PR https://github.com/apache/kafka/pull/15232
>>> - although I understand Proveen is out of office
>>> - KAFKA-16082: JBOD: Possible dataloss when moving leader partition - I am
>>> hearing mixed opinions on whether this is a blocker (
>>> https://github.com/apache/kafka/pull/15136)
>>> 
>>> Given that there are 3 JBOD blocker bugs, and I am not confident they will
>>> all be merged this week - I am on the edge of voting to revert JBOD from
>>> this release, or mark it early access.
>>> 
>>> By all accounts, it seems that if we keep with JBOD the release will have
>>> to spill into February, which is a month extra from the time-based release
>>> plan we had of start of January.
>>> 
>>> Can I ask others for an opinion?
>>> 
>>> Best,
>>> Stan
>>> 
>>> On Thu, Jan 18, 2024 at 1:21 PM Luke Chen <show...@gmail.com> wrote:
>>> 
>>>> Hi all,
>>>> 
>>>> I think I've found another blocker issue: KAFKA-16162
>>>> <https://issues.apache.org/jira/browse/KAFKA-16162> .
>>>> The impact is after upgrading to 3.7.0, any new created topics/partitions
>>>> will be unavailable.
>>>> I've put my findings in the JIRA.
>>>> 
>>>> Thanks.
>>>> Luke
>>>> 
>>>> On Thu, Jan 18, 2024 at 9:50 AM Matthias J. Sax <mj...@apache.org> wrote:
>>>> 
>>>>> Stan, thanks for driving this all forward! Excellent job.
>>>>> 
>>>>> About
>>>>> 
>>>>>> StreamsStandbyTask - https://issues.apache.org/jira/browse/KAFKA-16141
>>>>>> StreamsUpgradeTest - https://issues.apache.org/jira/browse/KAFKA-16139
>>>>> 
>>>>> For `StreamsUpgradeTest` it was a test setup issue and should be fixed
>>>>> now in trunk and 3.7 (and actually also in 3.6...)
>>>>> 
>>>>> For `StreamsStandbyTask` the failing test exposes a regression bug, so
>>>>> it's a blocker. I updated the ticket accordingly. We already have an
>>>>> open PR that reverts the code introducing the regression.
>>>>> 
>>>>> 
>>>>> -Matthias
>>>>> 
>>>>> On 1/17/24 9:44 AM, Proven Provenzano wrote:
>>>>>> We have another blocking issue for the RC :
>>>>>> https://issues.apache.org/jira/browse/KAFKA-16157. This bug is similar
>>>>> to
>>>>>> https://issues.apache.org/jira/browse/KAFKA-14616. The new issue
>>>> however
>>>>>> can lead to the new topic having partitions that a producer cannot
>>>> write
>>>>> to.
>>>>>> 
>>>>>> --Proven
>>>>>> 
>>>>>> On Tue, Jan 16, 2024 at 12:04 PM Proven Provenzano <
>>>>> pprovenz...@confluent.io>
>>>>>> wrote:
>>>>>> 
>>>>>>> 
>>>>>>> I have a PR https://github.com/apache/kafka/pull/15197 for
>>>>>>> https://issues.apache.org/jira/browse/KAFKA-16131 that is building
>>>> now.
>>>>>>> --Proven
>>>>>>> 
>>>>>>> On Mon, Jan 15, 2024 at 5:03 AM Jakub Scholz <ja...@scholz.cz> wrote:
>>>>>>> 
>>>>>>>> *> Hi Jakub,> > Thanks for trying the RC. I think what you found is a
>>>>>>>> blocker bug because it *
>>>>>>>> *> will generate huge amount of logspam. I guess we didn't find it in
>>>>>>>> junit
>>>>>>>> tests *
>>>>>>>> *> since logspam doesn't fail the automated tests. But certainly it's
>>>>> not
>>>>>>>> suitable *
>>>>>>>> *> for production. Did you file a JIRA yet?*
>>>>>>>> 
>>>>>>>> Hi Colin,
>>>>>>>> 
>>>>>>>> I opened https://issues.apache.org/jira/browse/KAFKA-16131.
>>>>>>>> 
>>>>>>>> Thanks & Regards
>>>>>>>> Jakub
>>>>>>>> 
>>>>>>>> On Mon, Jan 15, 2024 at 8:57 AM Colin McCabe <cmcc...@apache.org>
>>>>> wrote:
>>>>>>>> 
>>>>>>>>> Hi Stanislav,
>>>>>>>>> 
>>>>>>>>> Thanks for making the first RC. The fact that it's titled RC2 is
>>>>> messing
>>>>>>>>> with my mind a bit. I hope this doesn't make people think that we're
>>>>>>>>> farther along than we are, heh.
>>>>>>>>> 
>>>>>>>>> On Sun, Jan 14, 2024, at 13:54, Jakub Scholz wrote:
>>>>>>>>>> *> Nice catch! It does seem like we should have gated this behind
>>>> the
>>>>>>>>>> metadata> version as KIP-858 implies. Is the cluster configured
>>>> with
>>>>>>>>>> multiple log> dirs? What is the impact of the error messages?*
>>>>>>>>>> 
>>>>>>>>>> I did not observe any obvious impact. I was able to send and
>>>> receive
>>>>>>>>>> messages as normally. But to be honest, I have no idea what else
>>>>>>>>>> this might impact, so I did not try anything special.
>>>>>>>>>> 
>>>>>>>>>> I think everyone upgrading an existing KRaft cluster will go
>>>> through
>>>>>>>> this
>>>>>>>>>> stage (running Kafka 3.7 with an older metadata version for at
>>>> least
>>>>> a
>>>>>>>>>> while). So even if it is just a logged exception without any other
>>>>>>>>> impact I
>>>>>>>>>> wonder if it might scare users from upgrading. But I leave it to
>>>>>>>> others
>>>>>>>>> to
>>>>>>>>>> decide if this is a blocker or not.
>>>>>>>>>> 
>>>>>>>>> 
>>>>>>>>> Hi Jakub,
>>>>>>>>> 
>>>>>>>>> Thanks for trying the RC. I think what you found is a blocker bug
>>>>>>>> because
>>>>>>>>> it will generate huge amount of logspam. I guess we didn't find it
>>>> in
>>>>>>>> junit
>>>>>>>>> tests since logspam doesn't fail the automated tests. But certainly
>>>>> it's
>>>>>>>>> not suitable for production. Did you file a JIRA yet?
>>>>>>>>> 
>>>>>>>>>> On Sun, Jan 14, 2024 at 10:17 PM Stanislav Kozlovski
>>>>>>>>>> <stanis...@confluent.io.invalid> wrote:
>>>>>>>>>> 
>>>>>>>>>>> Hey Luke,
>>>>>>>>>>> 
>>>>>>>>>>> This is an interesting problem. Given the fact that the KIP for
>>>>>>>> having a
>>>>>>>>>>> 3.8 release passed, I think it weights the scale towards not
>>>> calling
>>>>>>>>> this a
>>>>>>>>>>> blocker and expecting it to be solved in 3.7.1.
>>>>>>>>>>> 
>>>>>>>>>>> It is unfortunate that it would not seem safe to migrate to KRaft
>>>> in
>>>>>>>>> 3.7.0
>>>>>>>>>>> (given the inability to rollback safely), but if that's true - the
>>>>>>>> same
>>>>>>>>>>> case would apply for 3.6.0. So in any case users w\ould be
>>>> expected
>>>>>>>> to
>>>>>>>>> use a
>>>>>>>>>>> patch release for this.
>>>>>>>>> 
>>>>>>>>> Hi Luke,
>>>>>>>>> 
>>>>>>>>> Thanks for testing rollback. I think this is a case where the
>>>>>>>>> documentation is wrong. The intention was to for the steps to
>>>>> basically
>>>>>>>> be:
>>>>>>>>> 
>>>>>>>>> 1. roll all the brokers into zk mode, but with migration enabled
>>>>>>>>> 2. take down the kraft quorum
>>>>>>>>> 3. rmr /controller, allowing a hybrid broker to take over.
>>>>>>>>> 4. roll all the brokers into zk mode without migration enabled (if
>>>>>>>> desired)
>>>>>>>>> 
>>>>>>>>> With these steps, there isn't really unavailability since a ZK
>>>>>>>> controller
>>>>>>>>> can be elected quickly after the kraft quorum is gone.
>>>>>>>>> 
>>>>>>>>>>> Further, since we will have a 3.8 release - it is
>>>>>>>>>>> likely we will ultimately recommend users upgrade from that
>>>> version
>>>>>>>>> given
>>>>>>>>>>> its aim is to have strategic KRaft feature parity with ZK.
>>>>>>>>>>> That being said, I am not 100% on this. Let me know whether you
>>>>> think
>>>>>>>>> this
>>>>>>>>>>> should block the release, Luke. I am also tagging Colin and David
>>>> to
>>>>>>>>> weigh
>>>>>>>>>>> in with their opinions, as they worked on the migration logic.
>>>>>>>>> 
>>>>>>>>> The rollback docs are new in 3.7 so the fact that they're wrong is a
>>>>>>>> clear
>>>>>>>>> blocker, I think. But easy to fix, I believe. I will create a PR.
>>>>>>>>> 
>>>>>>>>> best,
>>>>>>>>> Colin
>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>>>> Hey Kirk and Chris,
>>>>>>>>>>> 
>>>>>>>>>>> Unless I'm missing something - KAFKALESS-16029 is simply a bad log
>>>>>>>> due
>>>>>>>>> to
>>>>>>>>>>> improper closing. And the PR description implies this has been
>>>>>>>> present
>>>>>>>>>>> since 3.5. While annoying, I don't see a strong reason for this to
>>>>>>>> block
>>>>>>>>>>> the release.
>>>>>>>>>>> 
>>>>>>>>>>> Hey Jakub,
>>>>>>>>>>> 
>>>>>>>>>>> Nice catch! It does seem like we should have gated this behind the
>>>>>>>>> metadata
>>>>>>>>>>> version as KIP-858 implies. Is the cluster configured with
>>>> multiple
>>>>>>>> log
>>>>>>>>>>> dirs? What is the impact of the error messages?
>>>>>>>>>>> 
>>>>>>>>>>> Tagging Igor (the author of the KIP) to weigh in.
>>>>>>>>>>> 
>>>>>>>>>>> Best,
>>>>>>>>>>> Stanislav
>>>>>>>>>>> 
>>>>>>>>>>> On Sat, Jan 13, 2024 at 7:22 PM Jakub Scholz <ja...@scholz.cz>
>>>>>>>> wrote:
>>>>>>>>>>> 
>>>>>>>>>>>> Hi,
>>>>>>>>>>>> 
>>>>>>>>>>>> I was trying the RC2 and run into the following issue ... when I
>>>>>>>> run
>>>>>>>>>>>> 3.7.0-RC2 KRaft cluster with metadata version set to 3.6-IV2
>>>>>>>> metadata
>>>>>>>>>>>> version, I seem to be getting repeated errors like this in the
>>>>>>>>> controller
>>>>>>>>>>>> logs:
>>>>>>>>>>>> 
>>>>>>>>>>>> 2024-01-13 16:58:01,197 INFO [QuorumController id=0]
>>>>>>>>>>> assignReplicasToDirs:
>>>>>>>>>>>> event failed with UnsupportedVersionException in 15 microseconds.
>>>>>>>>>>>> (org.apache.kafka.controller.QuorumController)
>>>>>>>>>>>> [quorum-controller-0-event-handler]
>>>>>>>>>>>> 2024-01-13 16:58:01,197 ERROR [ControllerApis nodeId=0]
>>>> Unexpected
>>>>>>>>> error
>>>>>>>>>>>> handling request RequestHeader(apiKey=ASSIGN_REPLICAS_TO_DIRS,
>>>>>>>>>>>> apiVersion=0, clientId=1000, correlationId=14, headerVersion=2)
>>>> --
>>>>>>>>>>>> AssignReplicasToDirsRequestData(brokerId=1000, brokerEpoch=5,
>>>>>>>>>>>> directories=[DirectoryData(id=w_uxN7pwQ6eXSMrOKceYIQ,
>>>>>>>>>>>> topics=[TopicData(topicId=bvAKLSwmR7iJoKv2yZgygQ,
>>>>>>>>>>>> partitions=[PartitionData(partitionIndex=2),
>>>>>>>>>>>> PartitionData(partitionIndex=1)]),
>>>>>>>>>>>> TopicData(topicId=uNe7f5VrQgO0zST6yH1jDQ,
>>>>>>>>>>>> partitions=[PartitionData(partitionIndex=0)])])]) with context
>>>>>>>>>>>> 
>>>> RequestContext(header=RequestHeader(apiKey=ASSIGN_REPLICAS_TO_DIRS,
>>>>>>>>>>>> apiVersion=0, clientId=1000, correlationId=14, headerVersion=2),
>>>>>>>>>>>> connectionId='172.16.14.219:9090-172.16.14.217:53590-7',
>>>>>>>>> clientAddress=/
>>>>>>>>>>>> 172.16.14.217, principal=User:CN=my-cluster-kafka,O=io.strimzi,
>>>>>>>>>>>> listenerName=ListenerName(CONTROLPLANE-9090),
>>>> securityProtocol=SSL,
>>>>>>>>>>>> 
>>>> clientInformation=ClientInformation(softwareName=apache-kafka-java,
>>>>>>>>>>>> softwareVersion=3.7.0), fromPrivilegedListener=false,
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>> 
>>>>>>>> 
>>>>> 
>>>> principalSerde=Optional[org.apache.kafka.common.security.authenticator.DefaultKafkaPrincipalBuilder@71004ad2
>>>>>>>>>>>> ])
>>>>>>>>>>>> (kafka.server.ControllerApis) [quorum-controller-0-event-handler]
>>>>>>>>>>>> java.util.concurrent.CompletionException:
>>>>>>>>>>>> org.apache.kafka.common.errors.UnsupportedVersionException:
>>>>>>>> Directory
>>>>>>>>>>>> assignment is not supported yet.
>>>>>>>>>>>> 
>>>>>>>>>>>> at
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>> 
>>>>>>>> 
>>>>> 
>>>> java.base/java.util.concurrent.CompletableFuture.encodeThrowable(CompletableFuture.java:332)
>>>>>>>>>>>> at
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>> 
>>>>>>>> 
>>>>> 
>>>> java.base/java.util.concurrent.CompletableFuture.completeThrowable(CompletableFuture.java:347)
>>>>>>>>>>>> at
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>> 
>>>>>>>> 
>>>>> 
>>>> java.base/java.util.concurrent.CompletableFuture$UniApply.tryFire(CompletableFuture.java:636)
>>>>>>>>>>>> at
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>> 
>>>>>>>> 
>>>>> 
>>>> java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:510)
>>>>>>>>>>>> at
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>> 
>>>>>>>> 
>>>>> 
>>>> java.base/java.util.concurrent.CompletableFuture.completeExceptionally(CompletableFuture.java:2162)
>>>>>>>>>>>> at
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>> 
>>>>>>>> 
>>>>> 
>>>> org.apache.kafka.controller.QuorumController$ControllerWriteEvent.complete(QuorumController.java:880)
>>>>>>>>>>>> at
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>> 
>>>>>>>> 
>>>>> 
>>>> org.apache.kafka.controller.QuorumController$ControllerWriteEvent.handleException(QuorumController.java:871)
>>>>>>>>>>>> at
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>> 
>>>>>>>> 
>>>>> 
>>>> org.apache.kafka.queue.KafkaEventQueue$EventContext.completeWithException(KafkaEventQueue.java:148)
>>>>>>>>>>>> at
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>> 
>>>>>>>> 
>>>>> 
>>>> org.apache.kafka.queue.KafkaEventQueue$EventContext.run(KafkaEventQueue.java:137)
>>>>>>>>>>>> at
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>> 
>>>>>>>> 
>>>>> 
>>>> org.apache.kafka.queue.KafkaEventQueue$EventHandler.handleEvents(KafkaEventQueue.java:210)
>>>>>>>>>>>> at
>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>> 
>>>>>>>> 
>>>>> 
>>>> org.apache.kafka.queue.KafkaEventQueue$EventHandler.run(KafkaEventQueue.java:181)
>>>>>>>>>>>> at java.base/java.lang.Thread.run(Thread.java:840)
>>>>>>>>>>>> 
>>>>>>>>>>>> Caused by:
>>>>>>>> org.apache.kafka.common.errors.UnsupportedVersionException:
>>>>>>>>>>>> Directory assignment is not supported yet.
>>>>>>>>>>>> 
>>>>>>>>>>>> Is that expected? I guess with the metadata version set to
>>>>>>>> 3.6-IV2, it
>>>>>>>>>>>> makes sense that the request is not supported. But shouldn't then
>>>>>>>> the
>>>>>>>>>>>> request not be sent at all by the brokers? (I did not opened a
>>>> JIRA
>>>>>>>>> for
>>>>>>>>>>> it,
>>>>>>>>>>>> but I can open one if you agree this is not expected)
>>>>>>>>>>>> 
>>>>>>>>>>>> Thanks & Regards
>>>>>>>>>>>> Jakub
>>>>>>>>>>>> 
>>>>>>>>>>>> On Sat, Jan 13, 2024 at 8:03 AM Luke Chen <show...@gmail.com>
>>>>>>>> wrote:
>>>>>>>>>>>> 
>>>>>>>>>>>>> Hi Stanislav,
>>>>>>>>>>>>> 
>>>>>>>>>>>>> I commented in the "Apache Kafka 3.7.0 Release" thread, but
>>>> maybe
>>>>>>>>> you
>>>>>>>>>>>>> missed it.
>>>>>>>>>>>>> cross-posting here:
>>>>>>>>>>>>> 
>>>>>>>>>>>>> There is a bug KAFKA-16101
>>>>>>>>>>>>> <https://issues.apache.org/jira/browse/KAFKA-16101> reporting
>>>>>>>> that
>>>>>>>>>>>> "Kafka
>>>>>>>>>>>>> cluster will be unavailable during KRaft migration rollback".
>>>>>>>>>>>>> The impact for this issue is that if brokers try to rollback to
>>>>>>>> ZK
>>>>>>>>> mode
>>>>>>>>>>>>> during KRaft migration process, there will be a period of time
>>>>>>>> the
>>>>>>>>>>>> cluster
>>>>>>>>>>>>> is unavailable.
>>>>>>>>>>>>> Since ZK migrating to KRaft feature is a production ready
>>>>>>>> feature, I
>>>>>>>>>>>> think
>>>>>>>>>>>>> this should be addressed soon.
>>>>>>>>>>>>> Do you think this is a blocker for v3.7.0?
>>>>>>>>>>>>> 
>>>>>>>>>>>>> Thanks.
>>>>>>>>>>>>> Luke
>>>>>>>>>>>>> 
>>>>>>>>>>>>> On Sat, Jan 13, 2024 at 8:36 AM Chris Egerton <
>>>>>>>>> fearthecel...@gmail.com
>>>>>>>>>>>> 
>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>> 
>>>>>>>>>>>>>> Thanks, Kirk!
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> @Stanislav--do you believe that this warrants a new RC?
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> On Fri, Jan 12, 2024, 19:08 Kirk True <k...@kirktrue.pro>
>>>>>>>> wrote:
>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> Hi Chris/Stanislav,
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> I'm working on the 'Unable to find FetchSessionHandler' log
>>>>>>>>> problem
>>>>>>>>>>>>>>> (KAFKA-16029) and have put out a draft PR (
>>>>>>>>>>>>>>> https://github.com/apache/kafka/pull/15186). I will use the
>>>>>>>>>>>> quickstart
>>>>>>>>>>>>>>> approach as a second means to reproduce/verify while I wait
>>>>>>>> for
>>>>>>>>> the
>>>>>>>>>>>>> PR's
>>>>>>>>>>>>>>> Jenkins job to finish.
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> Thanks,
>>>>>>>>>>>>>>> Kirk
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> On Fri, Jan 12, 2024, at 11:31 AM, Chris Egerton wrote:
>>>>>>>>>>>>>>>> Hi Stanislav,
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> Thanks for running this release!
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> To verify, I:
>>>>>>>>>>>>>>>> - Built from source using Java 11 with both:
>>>>>>>>>>>>>>>> - - the 3.7.0-rc2 tag on GitHub
>>>>>>>>>>>>>>>> - - the kafka-3.7.0-src.tgz artifact from
>>>>>>>>>>>>>>>> 
>>>>>>>> https://home.apache.org/~stanislavkozlovski/kafka-3.7.0-rc2/
>>>>>>>>>>>>>>>> - Checked signatures and checksums
>>>>>>>>>>>>>>>> - Ran the quickstart using both:
>>>>>>>>>>>>>>>> - - The kafka_2.13-3.7.0.tgz artifact from
>>>>>>>>>>>>>>>> 
>>>>>>>> https://home.apache.org/~stanislavkozlovski/kafka-3.7.0-rc2/
>>>>>>>>>>> with
>>>>>>>>>>>>> Java
>>>>>>>>>>>>>>> 11
>>>>>>>>>>>>>>>> and Scala 13 in KRaft mode
>>>>>>>>>>>>>>>> - - Our shiny new broker Docker image,
>>>>>>>> apache/kafka:3.7.0-rc2
>>>>>>>>>>>>>>>> - Ran all unit tests
>>>>>>>>>>>>>>>> - Ran all integration tests for Connect and MM2
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> I found two minor areas for concern:
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> 1. (Possibly a blocker)
>>>>>>>>>>>>>>>> When running the quickstart, I noticed this ERROR-level log
>>>>>>>>>>> message
>>>>>>>>>>>>>> being
>>>>>>>>>>>>>>>> emitted frequently (not not every time) when I killed my
>>>>>>>>> console
>>>>>>>>>>>>>> consumer
>>>>>>>>>>>>>>>> via ctrl-C:
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> [2024-01-12 11:00:31,088] ERROR [Consumer
>>>>>>>>>>>>> clientId=console-consumer,
>>>>>>>>>>>>>>>> groupId=console-consumer-74388] Unable to find
>>>>>>>>>>> FetchSessionHandler
>>>>>>>>>>>>> for
>>>>>>>>>>>>>>> node
>>>>>>>>>>>>>>>> 1. Ignoring fetch response
>>>>>>>>>>>>>>>> (org.apache.kafka.clients.consumer.internals.AbstractFetch)
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> I see that this error message is already reported in
>>>>>>>>>>>>>>>> https://issues.apache.org/jira/browse/KAFKA-16029. I
>>>>>>>> think we
>>>>>>>>>>>> should
>>>>>>>>>>>>>>>> prioritize fixing it for this release. I know it's probably
>>>>>>>>>>> benign
>>>>>>>>>>>>> but
>>>>>>>>>>>>>>> it's
>>>>>>>>>>>>>>>> really not a good look for us when basic operations log
>>>>>>>> error
>>>>>>>>>>>>> messages,
>>>>>>>>>>>>>>> and
>>>>>>>>>>>>>>>> it may give new users some headaches.
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> 2. (Probably not a blocker)
>>>>>>>>>>>>>>>> The following unit tests failed the first time around, and
>>>>>>>>> all of
>>>>>>>>>>>>> them
>>>>>>>>>>>>>>>> passed the second time I ran them:
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> - (clients)
>>>>>>>>>>>>>>> 
>>>>>>>> ClientUtilsTest.testParseAndValidateAddressesWithReverseLookup()
>>>>>>>>>>>>>>>> - (clients) SelectorTest.testConnectionsByClientMetric()
>>>>>>>>>>>>>>>> - (clients)
>>>>>>>> Tls13SelectorTest.testConnectionsByClientMetric()
>>>>>>>>>>>>>>>> - (connect)
>>>>>>>>>>>>> TopicAdminTest.retryEndOffsetsShouldRetryWhenTopicNotFound
>>>>>>>>>>>>>> (I
>>>>>>>>>>>>>>>> thought I fixed this one! 🤬🤬)
>>>>>>>>>>>>>>>> - (core)
>>>>>>>>> ProducerIdManagerTest.testUnrecoverableErrors(Errors)[2]
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> Thanks again for your work on this release, and
>>>>>>>>> congratulations
>>>>>>>>>>> to
>>>>>>>>>>>>>> Kafka
>>>>>>>>>>>>>>>> Streams for having zero flaky unit tests during my
>>>>>>>>>>>>> highly-experimental
>>>>>>>>>>>>>>>> single laptop run!
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> Cheers,
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> Chris
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> On Thu, Jan 11, 2024 at 1:33 PM Stanislav Kozlovski
>>>>>>>>>>>>>>>> <stanis...@confluent.io.invalid> wrote:
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> Hello Kafka users, developers, and client-developers,
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> This is the first candidate for release of Apache Kafka
>>>>>>>>> 3.7.0.
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> Note it's named "RC2" because I had a few "failed" RCs
>>>>>>>> that
>>>>>>>>> I
>>>>>>>>>>> had
>>>>>>>>>>>>>>>>> cut/uploaded but ultimately had to scrap prior to
>>>>>>>> announcing
>>>>>>>>>>> due
>>>>>>>>>>>> to
>>>>>>>>>>>>>> new
>>>>>>>>>>>>>>>>> blockers arriving before I could even announce them.
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> Further - I haven't yet been able to set up the system
>>>>>>>> tests
>>>>>>>>>>>>>>> successfully.
>>>>>>>>>>>>>>>>> And the integration/unit tests do have a few failures
>>>>>>>> that I
>>>>>>>>>>> have
>>>>>>>>>>>>> to
>>>>>>>>>>>>>>> spend
>>>>>>>>>>>>>>>>> time triaging. I would appreciate any help in case anyone
>>>>>>>>>>> notices
>>>>>>>>>>>>> any
>>>>>>>>>>>>>>> tests
>>>>>>>>>>>>>>>>> failing that they're subject matters experts in. Expect
>>>>>>>> me
>>>>>>>>> to
>>>>>>>>>>>>> follow
>>>>>>>>>>>>>>> up in
>>>>>>>>>>>>>>>>> a day or two with more detailed analysis.
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> Major changes include:
>>>>>>>>>>>>>>>>> - Early Access to KIP-848 - the next generation of the
>>>>>>>>> consumer
>>>>>>>>>>>>>>> rebalance
>>>>>>>>>>>>>>>>> protocol
>>>>>>>>>>>>>>>>> - KIP-858: Adding JBOD support to KRaft
>>>>>>>>>>>>>>>>> - KIP-714: Observability into Client metrics via a
>>>>>>>>> standardized
>>>>>>>>>>>>>>> interface
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> Check more information in the WIP blog post:
>>>>>>>>>>>>>>>>> https://github.com/apache/kafka-site/pull/578
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> Release notes for the 3.7.0 release:
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> 
>>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>> 
>>>>>>>> 
>>>>> 
>>>> https://home.apache.org/~stanislavkozlovski/kafka-3.7.0-rc2/RELEASE_NOTES.html
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> *** Please download, test and vote by Thursday, January
>>>>>>>> 18,
>>>>>>>>> 9am
>>>>>>>>>>>> PT
>>>>>>>>>>>>>> ***
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> Usually these deadlines tend to be 2-3 days, but due to
>>>>>>>> this
>>>>>>>>>>>> being
>>>>>>>>>>>>>> the
>>>>>>>>>>>>>>>>> first RC and the tests not having ran yet, I am giving
>>>>>>>> it a
>>>>>>>>> bit
>>>>>>>>>>>>> more
>>>>>>>>>>>>>>> time.
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> Kafka's KEYS file containing PGP keys we use to sign the
>>>>>>>>>>> release:
>>>>>>>>>>>>>>>>> https://kafka.apache.org/KEYS
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> * Release artifacts to be voted upon (source and binary):
>>>>>>>>>>>>>>>>> 
>>>>>>>>> https://home.apache.org/~stanislavkozlovski/kafka-3.7.0-rc2/
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> * Docker release artifact to be voted upon:
>>>>>>>>>>>>>>>>> apache/kafka:3.7.0-rc2
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> * Maven artifacts to be voted upon:
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>> 
>>>> https://repository.apache.org/content/groups/staging/org/apache/kafka/
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> * Javadoc:
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>> 
>>>>>>>>> 
>>>> https://home.apache.org/~stanislavkozlovski/kafka-3.7.0-rc2/javadoc/
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> * Tag to be voted upon (off 3.7 branch) is the 3.7.0 tag:
>>>>>>>>>>>>>>>>> https://github.com/apache/kafka/releases/tag/3.7.0-rc2
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> * Documentation:
>>>>>>>>>>>>>>>>> https://kafka.apache.org/37/documentation.html
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> * Protocol:
>>>>>>>>>>>>>>>>> https://kafka.apache.org/37/protocol.html
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> * Successful Jenkins builds for the 3.7 branch:
>>>>>>>>>>>>>>>>> Unit/integration tests:
>>>>>>>>>>>>>>>>> 
>>>>>>>>> https://ci-builds.apache.org/job/Kafka/job/kafka/job/3.7/58/
>>>>>>>>>>>>>>>>> There are failing tests here. I have to follow up with
>>>>>>>>> triaging
>>>>>>>>>>>>> some
>>>>>>>>>>>>>> of
>>>>>>>>>>>>>>>>> the failures and figuring out if they're actual problems
>>>>>>>> or
>>>>>>>>>>>> simply
>>>>>>>>>>>>>>> flakes.
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> System tests:
>>>>>>>>>>>>>>> https://jenkins.confluent.io/job/system-test-kafka/job/3.7/
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> No successful system test runs yet. I am working on
>>>>>>>> getting
>>>>>>>>> the
>>>>>>>>>>>> job
>>>>>>>>>>>>>> to
>>>>>>>>>>>>>>> run.
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> * Successful Docker Image Github Actions Pipeline for 3.7
>>>>>>>>>>> branch:
>>>>>>>>>>>>>>>>> Attached are the scan_report and report_jvm output files
>>>>>>>>> from
>>>>>>>>>>> the
>>>>>>>>>>>>>>> Docker
>>>>>>>>>>>>>>>>> Build run:
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>> 
>>>>> https://github.com/apache/kafka/actions/runs/7486094960/job/20375761673
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> And the final docker image build job - Docker Build Test
>>>>>>>>>>>> Pipeline:
>>>>>>>>>>>>>>>>> https://github.com/apache/kafka/actions/runs/7486178277
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> The image is apache/kafka:3.7.0-rc2 -
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> 
>>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>> 
>>>>>>>> 
>>>>> 
>>>> https://hub.docker.com/layers/apache/kafka/3.7.0-rc2/images/sha256-5b4707c08170d39549fbb6e2a3dbb83936a50f987c0c097f23cb26b4c210c226?context=explore
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> /**************************************
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>>> Thanks,
>>>>>>>>>>>>>>>>> Stanislav Kozlovski
>>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>>> 
>>>>>>>>>>>>>> 
>>>>>>>>>>>>> 
>>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>>>> 
>>>>>>>>>>> --
>>>>>>>>>>> Best,
>>>>>>>>>>> Stanislav
>>>>>>>>>>> 
>>>>>>>>> 
>>>>>>>> 
>>>>>>> 
>>>>>> 
>>>>> 
>>>> 
>>> 
>>> 
>>> -- 
>>> Best,
>>> Stanislav
>> 
> 

Reply via email to