[ https://issues.apache.org/jira/browse/CASSANDRA-17081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17437438#comment-17437438 ]
David Capwell commented on CASSANDRA-17081: ------------------------------------------- Looking into CASSANDRA-17085 more I actually believe that this patch isn't the solution. When used to allow bootstrap to fail without shutting down the JVM, but something changed where we not shutdown (no non-daemon threads up); this then causes a race condition with this test as we only pass IFF we think node1 is up before node3 finishes shutting down. I am trying to figure out the before/after threads to see if this does in fact related to this ticket; as this ticket was stable 2 months back... after I know more about CASSANDRA-17085 I can confirm if this is in fact impacting this test causing it to be flaky. > Fix test: > bootstrap_test.py::TestBootstrap::test_bootstrap_with_reset_bootstrap_state > ------------------------------------------------------------------------------------- > > Key: CASSANDRA-17081 > URL: https://issues.apache.org/jira/browse/CASSANDRA-17081 > Project: Cassandra > Issue Type: Bug > Components: Test/dtest/python > Reporter: Josh McKenzie > Assignee: David Capwell > Priority: Normal > Fix For: NA > > > Seeing in circle and locally on trunk: > Looks like it's timing out waiting for the bootstrap to complete. > {code:java} > test_bootstrap_with_reset_bootstrap_state failed (1 runs remaining out of 2). > <class 'ccmlib.node.TimeoutError'> > 28 Oct 2021 19:03:53 [node3] after 120.39/120 seconds Missing: > ['127.0.0.1:7000.* is now UP'] not found in system.log: > Head: ERROR [Stream-Deserializer-/127.0.0.1:7000-20b885c > Tail: ...b336de0e72/nb-1-big-Data.db > ERROR [Stream-Deserializer-/127.0.0.1:7000-29a7cdb5] 2021-10-28 15:01:36,578 > StorageService.java:483 - Stopping gossiper > [<TracebackEntry > /Users/jmckenzie/src/cassandra-dtest/bootstrap_test.py:483> > <TracebackEntry /Users/jmckenzie/src/ccm/ccmlib/node.py:895> > <TracebackEntry /Users/jmckenzie/src/ccm/ccmlib/node.py:664> > <TracebackEntry /Users/jmckenzie/src/ccm/ccmlib/node.py:588> > <TracebackEntry /Users/jmckenzie/src/ccm/ccmlib/node.py:56>] > test_bootstrap_with_reset_bootstrap_state failed; it passed 0 out of the > required 1 times. > <class 'ccmlib.node.TimeoutError'> > 28 Oct 2021 19:08:23 [node3] after 120.41/120 seconds Missing: > ['127.0.0.1:7000.* is now UP'] not found in system.log: > Head: > Tail: ... > [<TracebackEntry > /Users/jmckenzie/src/cassandra-dtest/bootstrap_test.py:483> > <TracebackEntry /Users/jmckenzie/src/ccm/ccmlib/node.py:895> > <TracebackEntry /Users/jmckenzie/src/ccm/ccmlib/node.py:664> > <TracebackEntry /Users/jmckenzie/src/ccm/ccmlib/node.py:588> > <TracebackEntry /Users/jmckenzie/src/ccm/ccmlib/node.py:56>] > {code} > -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org