[
https://issues.apache.org/jira/browse/LUCENE-6766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15279810#comment-15279810
]
Michael McCandless commented on LUCENE-6766:
--------------------------------------------
I tested master vs patch indexing performance on luceneutil's "wikimedium10m"
docs. I ran indexing 5 times each. This is just a "first do no harm test",
i.e. in both cases I'm indexing without an index sort.
I use SMS, and frequent flushing, so this is a very merge-heavy benchmark.
Master:
{noformat}
/l/logs/before0.log:Indexer: finished (675550 msec)
/l/logs/before1.log:Indexer: finished (671058 msec)
/l/logs/before2.log:Indexer: finished (683297 msec)
/l/logs/before3.log:Indexer: finished (670856 msec)
/l/logs/before4.log:Indexer: finished (671516 msec)
{noformat}
Patch:
{noformat}
/l/logs/after0.log:Indexer: finished (673302 msec)
/l/logs/after1.log:Indexer: finished (674855 msec)
/l/logs/after2.log:Indexer: finished (679655 msec)
/l/logs/after3.log:Indexer: finished (680151 msec)
/l/logs/after4.log:Indexer: finished (681921 msec)
{noformat}
Net/net I think any performance hit is very small, well within measurement
noise.
> Make index sorting a first-class citizen
> ----------------------------------------
>
> Key: LUCENE-6766
> URL: https://issues.apache.org/jira/browse/LUCENE-6766
> Project: Lucene - Core
> Issue Type: Improvement
> Reporter: Adrien Grand
> Priority: Minor
> Attachments: LUCENE-6766.patch, LUCENE-6766.patch, LUCENE-6766.patch
>
>
> Today index sorting is a very expert feature. You need to use a custom merge
> policy, custom collectors, etc. I would like to explore making it a
> first-class citizen so that:
> - the sort order could be configured on IndexWriterConfig
> - segments would record the sort order that was used to write them
> - IndexSearcher could automatically early terminate when computing top docs
> on a sort order that is a prefix of the sort order of a segment (and if the
> user is not interested in totalHits).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]