[ https://issues.apache.org/jira/browse/CASSANDRA-5393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13619597#comment-13619597 ]
Jason Brown commented on CASSANDRA-5393: ---------------------------------------- We've got an idea we're testing out here, and will hopefully post a patch in a day or so. > Add an Ack/Retry for merkle tree sending > ---------------------------------------- > > Key: CASSANDRA-5393 > URL: https://issues.apache.org/jira/browse/CASSANDRA-5393 > Project: Cassandra > Issue Type: Bug > Reporter: Jeremiah Jordan > Assignee: Jason Brown > > Can we add an Ack/Retry around passing merle tree's around in repair? If the > following fails, the repair hangs for ever on the coordinating node. > https://github.com/apache/cassandra/blob/cassandra-1.1.10/src/java/org/apache/cassandra/service/AntiEntropyService.java#L242 > {noformat} > Message message = TreeResponseVerbHandler.makeVerb(local, > validator); > if > (!validator.request.endpoint.equals(FBUtilities.getBroadcastAddress())) > logger.info(String.format("[repair #%s] Sending completed > merkle tree to %s for %s", validator.request.sessionid, > validator.request.endpoint, validator.request.cf)); > ms.sendOneWay(message, validator.request.endpoint); > {noformat} > If the message asking for merkle tree's gets lost, coordinating node hangs > for ever as well. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira