[jira] [Commented] (LUCENE-7618) Hypothetical perf improvements in DocValuesRangeQuery: reducing comparisons for some queries/segments

Adrien Grand (JIRA) Wed, 04 Jan 2017 02:50:16 -0800

    [ 
https://issues.apache.org/jira/browse/LUCENE-7618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15797922#comment-15797922
 ]


Adrien Grand commented on LUCENE-7618:
--------------------------------------

I could be wrong, but it looks to me that the cut that we are trying to cut 
here is low compared to the cost of running a query, like checking live docs, 
running first the approximation and then the two-phase confirmation, calling 
the collector, etc. Adding more implementations might also make Hotspot's job 
more complicated.

I am not entirely sure what was your motivation for reviewing doc values 
ranges, but in case you are looking at improving range performance, I have an 
open semi-related issue that tries to give a way to either execute the range on 
points (or legacy numerics, it does not matter) or doc values depending on 
which would be more efficient: LUCENE-7055.

> Hypothetical perf improvements in DocValuesRangeQuery: reducing comparisons 
> for some queries/segments
> -----------------------------------------------------------------------------------------------------
>
>                 Key: LUCENE-7618
>                 URL: https://issues.apache.org/jira/browse/LUCENE-7618
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Hoss Man
>         Attachments: LUCENE-7618.patch
>
>
> In reviewing the DocValuesRangeQuery code, it occured to me that there 
> _might_ be some potential performance optimizations possible in a few cases 
> relating queries that involve explicitly specified open ranges (ie: min or 
> max are null) or in the case of SortedSet: range queries that are 
> *effectively* open ended on particular segments, because the min/max are 
> below/above the minOrd/maxOrd for the segment.
> Since these seemed like semi-common situations (open ended range queries are 
> fairly common in my experience, i'm not sure about the secondary SortedSet 
> "ord" case, but it seemd potentially promising particularly for fields like 
> incrementing ids, or timestamps, where values are added sequentially and 
> likeley to be clustered together) I did a bit of experimenting and wanted to 
> post my findings in jira -- patch & details to follow in comments.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (LUCENE-7618) Hypothetical perf improvements in DocValuesRangeQuery: reducing comparisons for some queries/segments

Reply via email to