The last time paxos v2 blocked us in CASSANDRA-19617 which also
affected 4.1, I didn't get a sense of strong usage from the community,
so I agree that RC shouldn't be blocked but this can get fixed before
GA.  +1 from me.

Kind Regards,
Brandon

On Tue, Jun 25, 2024 at 11:11 PM Jon Haddad <j...@jonhaddad.com> wrote:
>
> 5.0 is a massive milestone.  A huge thank you to everyone that's invested 
> their time into the release.  I've done a lot of testing, benchmarking, and 
> tire kicking and it's truly mind blowing how much has gone into 5.0 and how 
> great it is for the community.
>
> I am a bit concerned that CASSANDRA-19668, which I found in 4.1, will also 
> affect 5.0.  This is a pretty serious bug, where using Paxos v2 + off heap 
> memtables can cause a SIGSEV process crash.  I've seen this happen about a 
> dozen times with a client over the last 3 months.  Since the new trie 
> memtables rely on off heap, and both Trie memtables & Paxos V2 is so 
> compelling (esp for multi-dc users), I think there's a good chance that we'll 
> be making an already bad problem even worse, for folks that use LWT.
>
> Unfortunately, until next week I'm unable to put any time into this; I'm on 
> vacation with my family.  I wish I had been able to confirm and raise this 
> issue as a 5.0 blocker sooner, but I've deliberately tried to keep work stuff 
> out of my mind.   Since I'm not 100% sure if this affects 5.0, I'm not 
> blocking the RC, but I don't feel comfortable putting a +1 on a release that 
> I'm at least 80% certain contains a process-crashing bug.
>
> I have a simple 4.1 patch in CASSANDRA-19668, but I haven't landed a commit 
> in several years and I have zero recollection of the entire process of 
> getting it in, nor have I spent any time writing unit or dtests in the C* 
> repo.  I ran a test of 160MM LWTs over several hours with my 4.1 branch and 
> didn't hit any issues, but my client ran for weeks without hitting it so it's 
> hard to say if I've actually addressed the problem, as it's a rare race 
> condition.  Fwiw, I don't need to be the one to handle CASSANDRA-19668, so if 
> someone wants to address it before me, please feel free.  It will likely take 
> me a lot longer to deal with than someone more involved with the process, and 
> I'd want 2 sets of eyes on it anyways given what I already mentioned 
> previously about committing and testing.
>
> Jon
>
>
> On Tue, Jun 25, 2024 at 2:53 PM Mick Semb Wever <m...@apache.org> wrote:
>>
>>
>>
>> .
>>
>>> Proposing the test build of Cassandra 5.0-rc1 for release.
>>>
>>> sha1: b43f0b2e9f4cb5105764ef9cf4ece404a740539a
>>> Git: https://github.com/apache/cassandra/tree/5.0-rc1-tentative
>>> Maven Artifacts: 
>>> https://repository.apache.org/content/repositories/orgapachecassandra-1336/org/apache/cassandra/cassandra-all/5.0-rc1/
>>
>>
>>
>> The three green CI runs for this are
>> - https://app.circleci.com/pipelines/github/driftx/cassandra?branch=5.0-rc1-2
>> - https://app.circleci.com/pipelines/github/driftx/cassandra?branch=5.0-rc1-3
>> - https://app.circleci.com/pipelines/github/driftx/cassandra?branch=5.0-rc1-4
>>
>>

Reply via email to