[ 
https://issues.apache.org/jira/browse/CASSANDRA-10661?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15113051#comment-15113051
 ] 

DOAN DuyHai commented on CASSANDRA-10661:
-----------------------------------------

Hello [~xedin], it's me again.

 I've had some discussion with search people and they told me that wildcard 
searches (name like "\*xxxxx\*") are very expensive. Classical data structure 
like suffix trees are adapted for suffix searching (name like "xxx\*"). For 
prefix search (name like "\*xxx") they're creating a *reversed* index. Does it 
mean that the CONTAINS mode (formerly named SUFFIX) is more expensive than the 
NORMAL search mode ? If yes, how much expensive is it (x2 ? order of magnitude 
?)

 Second question, more related to the impl, since you query the nodes following 
the token range and do not hit all nodes like normal secondary index, does it 
imply that *sorting*  (ORDER BY) is no longer relevant since you do not 
retrieve all possible results ? (I've seen in QueryPlan.MAX_ROWS that there is 
a hard-coded limit of 10 000 results)

 Sorry to annoy you with my questions but they are important so that we, 
evangelists, can give the right use-cases for users and especially deter them 
from mis-using SASI when it's not appropriate or when the search cost is 
prohibitive.

> Integrate SASI to Cassandra
> ---------------------------
>
>                 Key: CASSANDRA-10661
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-10661
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Local Write-Read Paths
>            Reporter: Pavel Yaskevich
>            Assignee: Pavel Yaskevich
>              Labels: sasi
>             Fix For: 3.x
>
>
> We have recently released new secondary index engine 
> (https://github.com/xedin/sasi) build using SecondaryIndex API, there are 
> still couple of things to work out regarding 3.x since it's currently 
> targeted on 2.0 released. I want to make this an umbrella issue to all of the 
> things related to integration of SASI, which are also tracked in 
> [sasi_issues|https://github.com/xedin/sasi/issues], into mainline Cassandra 
> 3.x release.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to