[ 
https://issues.apache.org/jira/browse/CASSANDRA-10489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14949492#comment-14949492
 ] 

Jonathan Shook commented on CASSANDRA-10489:
--------------------------------------------

So, against a non-indexed field, the processing bound will be the size of the 
partition. If you only hold a scoreboard of limit items in memory and stream 
through the rest, replacing items, the memory requirements are lower, but the 
IO requirements could be substantial. If you do this with RF>1 and CL>1, then 
you may have semantics of result merging at the coordinator, but this should 
still be bounded to the result size and not the search space.

I would like for us to consider this operation for indexed fields and 
non-indexed fields as separate features, possibly putting the non-indexed 
version behind a warning or such. I'm sure some will absolutely try to sort 
10^9 items with limit 10. At least they should know that it has a completely 
different op cost.


> arbitrary order by on partitions
> --------------------------------
>
>                 Key: CASSANDRA-10489
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-10489
>             Project: Cassandra
>          Issue Type: Improvement
>            Reporter: Jon Haddad
>            Priority: Minor
>
> We've got aggregations, we might as well allow sorting rows within a 
> partition on arbitrary fields.  Currently the advice is "do it client side", 
> but when combined with a LIMIT clause it makes sense do this server side.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to