[
https://issues.apache.org/jira/browse/LUCENE-4752?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13598252#comment-13598252
]
David Smiley commented on LUCENE-4752:
--------------------------------------
Shai,
I don't think its enough to have a SortingSegmentMerger. That's the simple
part, thanks to your just-finished SortingAtomicReader. You pointed out that
sorted segments alone will result in segments that are internally sorted and
thus should compress a little better, but are effectively random-access between
segments. So if some thousand adjacent documents need to be retrieved, it'll
probably have to touch every segment, versus one or two. Granted the newly
rewritten title I chose for this issue "merge segments to sort them" is limited
to just making a SortingSegmentMerger. Should another issue be filed with a
title such as "MergePolicy to sort across segments"? The key word being
"across" -- I'm not sure how that could be clarified in a succinct title.
> Merge segments to sort them
> ---------------------------
>
> Key: LUCENE-4752
> URL: https://issues.apache.org/jira/browse/LUCENE-4752
> Project: Lucene - Core
> Issue Type: New Feature
> Components: core/index
> Reporter: David Smiley
> Assignee: Adrien Grand
>
> It would be awesome if Lucene could write the documents out in a segment
> based on a configurable order. This of course applies to merging segments
> to. The benefit is increased locality on disk of documents that are likely to
> be accessed together. This often applies to documents near each other in
> time, but also spatially.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]