New Failures from Build Lead Week 6: *** CASSANDRA-18021 - Flaky org.apache.cassandra.distributed.test.ReprepareTestOldBehaviour#testReprepareMixedVersionWithoutReset - This existing ticket has been linked in butler to new failures on 3.11
*** CASSANDRA-17608 - Fix testMetricsWithRebuildAndStreamingToTwoNodes - Re-opened as intermittent failure occurred in build 1445 on trunk Several new failures had only a single occurrence; no new tickets were opened during this time. On Fri, Feb 10, 2023 at 12:44 AM Claude Warren, Jr via dev < dev@cassandra.apache.org> wrote: > New Failures from Build Lead Week 5 > > *** CASSANDRA-18198 - "AttributeError: module 'py' has no attribute 'io'" > reported in multiple tests > - reported in 4.1, 3.11, and 3.0 > - identified as a possible class loader issue associated with > CASSANDRA-18150 > > *** CASSANDRA-18191 - Native Transport SSL tests failing > - TestNativeTransportSSL.test_connect_to_ssl and > TestNativeTransportSSL.test_connect_to_ssl (novnode) > - TestNativeTransportSSL.test_connect_to_ssl_optional and > TestNativeTransportSSL.test_connect_to_ssl_optional (nvnode) > > > On Mon, Jan 23, 2023 at 10:10 PM Caleb Rackliffe <calebrackli...@gmail.com> > wrote: > >> New failures from Build Lead Week 4: >> >> *** CASSANDRA-18188 - Test failure in >> upgrade_tests.cql_tests.cls.test_limit_ranges >> - trunk >> - AttributeError: module 'py' has no attribute 'io' >> >> *** CASSANDRA-18189 - Test failure in >> cqlsh_tests.test_cqlsh_copy.TestCqlshCopy.test_bulk_round_trip_with_timeouts >> - 4.0 >> - assert 100000 == 94764 >> - other failures currently open in this test class, but at least >> superficially, different errors (see CASSANDRA-17322, CASSANDRA-18162) >> >> Timeouts continue to manifest in many places. >> >> On Sun, Jan 15, 2023 at 6:02 AM Mick Semb Wever <m...@apache.org> wrote: >> >>> *** The Butler (Build Lead) >>>> >>>> The introduction of Butler and the Build Lead was a wonderful >>>> improvement to our CI efforts. It has brought a lot of hygiene in >>>> listing out flakies as they happened. Noted that this has in-turn >>>> increased the burden in getting our major releases out, but that's to >>>> be seen as a one-off cost. >>>> >>> >>> >>> New Failures from Build Lead Week 3. >>> >>> >>> *** CASSANDRA-18156 >>> – >>> repair_tests.deprecated_repair_test.TestDeprecatedRepairNotifications.test_deprecated_repair_error_notification >>> - AssertionError: Node logs don't have an error message for the failed >>> repair >>> - hard regression >>> - 3.0, 3.11, >>> >>> *** CASSANDRA-18164 – CASTest Message serializedSize(12) does not match >>> what was written with serialize(out, 12) for verb >>> PAXOS2_COMMIT_AND_PREPARE_RSP >>> - serializer class org.apache.cassandra.net.Message$Serializer; >>> expected 1077, actual 1079 >>> - 4.1, trunk >>> >>> *** CASSANDRA-18158 >>> – >>> org.apache.cassandra.distributed.upgrade.MixedModeReadTest.mixedModeReadColumnSubsetDigestCheck >>> - Cannot achieve consistency level ALL >>> - 3.11, trunk >>> >>> *** CASSANDRA-18159 – repair_tests.repair_test.TestRepair.test_*dc_repair >>> - AssertionError: null >>> in MemtablePool$SubPool.released(MemtablePool.java:193) >>> - 3.11, 4.0, 4.1, trunk >>> >>> *** CASSANDRA-18160 >>> – >>> cdc_test.TestCDC.test_insertion_and_commitlog_behavior_after_reaching_cdc_total_space >>> - Found orphaned index file in after CDC state not in former >>> - 4.1, trunk >>> >>> *** CASSANDRA-18161 – >>> >>> org.apache.cassandra.transport.CQLConnectionTest.handleCorruptionOfLargeMessageFrame >>> - AssertionFailedError in >>> CQLConnectionTest.testFrameCorruption(CQLConnectionTest.java:491) >>> - 4.0, 4.1, trunk >>> >>> *** CASSANDRA-18162 – >>> cqlsh_tests.test_cqlsh_copy.TestCqlshCopy.test_bulk_round_trip_non_prepared_statements >>> - Inet address 127.0.0.3:7000 >>> <https://urldefense.com/v3/__http://127.0.0.3:7000__;!!PbtH5S7Ebw!YEoii6nwyF5UJmdW2-iNyty-vVKK9DSX9YlVQz6-_ah_qD977sCWk4JjyvTkmgaR0v4kGAUf9U82elTW$> >>> is not available: [Errno 98] Address already in use >>> - 3.0, 3.11, 4.0, 4.1, trunk >>> >>> *** CASSANDRA-18163 – >>> >>> transient_replication_test.TestTransientReplicationRepairLegacyStreaming.test_speculative_write_repair_cycle >>> - AssertionError Incoming stream entireSSTable >>> - 4.0, 4.1, trunk >>> >>> >>> While writing these up, some thoughts… >>> - While Butler reports failures against multiple branches, there's no >>> feedback/sync that the ticket needs its fixVersions updated when failures >>> happen in other branches after the ticket is created. >>> - In 4.0 onwards, a majority of the failures are timeouts (>900s), >>> reinforcing that the current main problem we are facing in ci-cassandra.a.o >>> is saturation/infra >>> >>> >>> >>> >>>