[ 
https://issues.apache.org/jira/browse/CASSANDRA-8437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14498236#comment-14498236
 ] 

Sylvain Lebresne commented on CASSANDRA-8437:
---------------------------------------------

What I'm saying is that with {{IN}}, a single call to {{SP.fetchRows}} ends up 
doing more than one internal query. What we expose currently is 1) the number 
of calls to {{SP.fetchRows}} and 2) the total number of mismatches, which you 
can have more than one per call to {{SP.fetchRows}}. And so those shouldn't be 
compared. That still left 2 possibilities:
# compare the number of calls to {{SP.fetchRows}} to the number of time at 
least one mismatch happens during such call. That's what the current patch does.
# compare the number of internal queries with the number of total mismatches.

Both are valid solution, but I feel that's 2) is a bit more natural. And If 
I've suggested 1) before, I've just changed my mind.


> Track digest mismatch ratio
> ---------------------------
>
>                 Key: CASSANDRA-8437
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-8437
>             Project: Cassandra
>          Issue Type: Improvement
>            Reporter: Sylvain Lebresne
>            Assignee: Benjamin Lerer
>            Priority: Minor
>             Fix For: 2.1.5
>
>         Attachments: CASSANDRA-8437-V2.txt, CASSANDRA-8437.txt
>
>
> I don't believe we track how often read results in a digest mismatch but we 
> should since that could directly impact read performance in practice.
> Once we have that data, it might be that some workloads (write heavy most 
> likely) ends up with enough mismatches that going to the data read is more 
> efficient in practice. What we do about it it step 2 however, but getting the 
> data is easy enough.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to