[ https://issues.apache.org/jira/browse/CASSANDRA-10661?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15113051#comment-15113051 ]
DOAN DuyHai commented on CASSANDRA-10661: ----------------------------------------- Hello [~xedin], it's me again. I've had some discussion with search people and they told me that wildcard searches (name like "\*xxxxx\*") are very expensive. Classical data structure like suffix trees are adapted for suffix searching (name like "xxx\*"). For prefix search (name like "\*xxx") they're creating a *reversed* index. Does it mean that the CONTAINS mode (formerly named SUFFIX) is more expensive than the NORMAL search mode ? If yes, how much expensive is it (x2 ? order of magnitude ?) Second question, more related to the impl, since you query the nodes following the token range and do not hit all nodes like normal secondary index, does it imply that *sorting* (ORDER BY) is no longer relevant since you do not retrieve all possible results ? (I've seen in QueryPlan.MAX_ROWS that there is a hard-coded limit of 10 000 results) Sorry to annoy you with my questions but they are important so that we, evangelists, can give the right use-cases for users and especially deter them from mis-using SASI when it's not appropriate or when the search cost is prohibitive. > Integrate SASI to Cassandra > --------------------------- > > Key: CASSANDRA-10661 > URL: https://issues.apache.org/jira/browse/CASSANDRA-10661 > Project: Cassandra > Issue Type: Improvement > Components: Local Write-Read Paths > Reporter: Pavel Yaskevich > Assignee: Pavel Yaskevich > Labels: sasi > Fix For: 3.x > > > We have recently released new secondary index engine > (https://github.com/xedin/sasi) build using SecondaryIndex API, there are > still couple of things to work out regarding 3.x since it's currently > targeted on 2.0 released. I want to make this an umbrella issue to all of the > things related to integration of SASI, which are also tracked in > [sasi_issues|https://github.com/xedin/sasi/issues], into mainline Cassandra > 3.x release. -- This message was sent by Atlassian JIRA (v6.3.4#6332)