[ 
https://issues.apache.org/jira/browse/LUCENE-6766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15279810#comment-15279810
 ] 

Michael McCandless commented on LUCENE-6766:
--------------------------------------------

I tested master vs patch indexing performance on luceneutil's "wikimedium10m" 
docs.  I ran indexing 5 times each.  This is just a "first do no harm test", 
i.e. in both cases I'm indexing without an index sort.

I use SMS, and frequent flushing, so this is a very merge-heavy benchmark.

Master:

{noformat}
/l/logs/before0.log:Indexer: finished (675550 msec)
/l/logs/before1.log:Indexer: finished (671058 msec)
/l/logs/before2.log:Indexer: finished (683297 msec)
/l/logs/before3.log:Indexer: finished (670856 msec)
/l/logs/before4.log:Indexer: finished (671516 msec)
{noformat}

Patch:

{noformat}
/l/logs/after0.log:Indexer: finished (673302 msec)
/l/logs/after1.log:Indexer: finished (674855 msec)
/l/logs/after2.log:Indexer: finished (679655 msec)
/l/logs/after3.log:Indexer: finished (680151 msec)
/l/logs/after4.log:Indexer: finished (681921 msec)
{noformat}

Net/net I think any performance hit is very small, well within measurement 
noise.


> Make index sorting a first-class citizen
> ----------------------------------------
>
>                 Key: LUCENE-6766
>                 URL: https://issues.apache.org/jira/browse/LUCENE-6766
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Adrien Grand
>            Priority: Minor
>         Attachments: LUCENE-6766.patch, LUCENE-6766.patch, LUCENE-6766.patch
>
>
> Today index sorting is a very expert feature. You need to use a custom merge 
> policy, custom collectors, etc. I would like to explore making it a 
> first-class citizen so that:
>  - the sort order could be configured on IndexWriterConfig
>  - segments would record the sort order that was used to write them
>  - IndexSearcher could automatically early terminate when computing top docs 
> on a sort order that is a prefix of the sort order of a segment (and if the 
> user is not interested in totalHits).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to