Hugo Torralbo created CASSANDRA-18814:
-----------------------------------------

             Summary: Repair hangs on Cassandra 4.0.11
                 Key: CASSANDRA-18814
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-18814
             Project: Cassandra
          Issue Type: Bug
          Components: Consistency/Repair
            Reporter: Hugo Torralbo
         Attachments: wcdb0-debug.log, wcdb1-debug.log, wcdb2-debug.log, 
wcdb3-debug.log

When we run a full repair on Cassandra 4.0.11, it hangs and doesn't evolve. 
What we noticed was this message:

_{*}"{*}WARN 
[Messaging-OUT-/214.5.143.5:7001->/214.5.143.4:7001-LARGE_MESSAGES] 2023-08-18 
07:02:54,862 OutboundConnection.java:488 - /214.5. 
143.5:7001->/214.5.143.4:7001-LARGE_MESSAGES-[no-channel] *dropping message of 
type VALIDATION_RSP due to error*_
_{*}java.nio.channels.ClosedChannelException{*}: null"_ 

in one of the nodes, in the validate phase the merkle tree.

Besides that, I found some connection reset , but we do not know if there is a 
relation it.

_18 07:02:54,860 OutboundConnection.java:1056 - 
/214.5.143.5:7001->/214.5.143.4:7001-LARGE_MESSAGES-94900b4d channel closed by 
provider
io.netty.channel.unix.Errors$NativeIoException: readAddress(..) failed: 
Connection reset by peer_

 

I have uploaded logs from all nodes 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org

Reply via email to