[ https://issues.apache.org/jira/browse/CASSANDRA-8437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14498236#comment-14498236 ]
Sylvain Lebresne commented on CASSANDRA-8437: --------------------------------------------- What I'm saying is that with {{IN}}, a single call to {{SP.fetchRows}} ends up doing more than one internal query. What we expose currently is 1) the number of calls to {{SP.fetchRows}} and 2) the total number of mismatches, which you can have more than one per call to {{SP.fetchRows}}. And so those shouldn't be compared. That still left 2 possibilities: # compare the number of calls to {{SP.fetchRows}} to the number of time at least one mismatch happens during such call. That's what the current patch does. # compare the number of internal queries with the number of total mismatches. Both are valid solution, but I feel that's 2) is a bit more natural. And If I've suggested 1) before, I've just changed my mind. > Track digest mismatch ratio > --------------------------- > > Key: CASSANDRA-8437 > URL: https://issues.apache.org/jira/browse/CASSANDRA-8437 > Project: Cassandra > Issue Type: Improvement > Reporter: Sylvain Lebresne > Assignee: Benjamin Lerer > Priority: Minor > Fix For: 2.1.5 > > Attachments: CASSANDRA-8437-V2.txt, CASSANDRA-8437.txt > > > I don't believe we track how often read results in a digest mismatch but we > should since that could directly impact read performance in practice. > Once we have that data, it might be that some workloads (write heavy most > likely) ends up with enough mismatches that going to the data read is more > efficient in practice. What we do about it it step 2 however, but getting the > data is easy enough. -- This message was sent by Atlassian JIRA (v6.3.4#6332)