[
https://issues.apache.org/jira/browse/LUCENE-2369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Toke Eskildsen updated LUCENE-2369:
-----------------------------------
Attachment: LUCENE-2369.patch
Maintenance patch bringing the code up to date with Lucene trunk@1145556
(2011-07-13). Patch and test by e.g.
{code}
svn co http://svn.apache.org/repos/asf/lucene/dev/trunk@1145556 lucene-2369
cd lucene-2369
patch -p0 < LUCENE-2369.patch
cd modules/queryparser/
ant
cd ../../lucene/contrib/exposed/
ant compile-test
cd ../..
java -cp
lib/junit-4.7.jar:build/classes/test/:build/classes/java:build/contrib/exposed/classes/java/:build/contrib/exposed/classes/test/
org.junit.runner.JUnitCore
org.apache.lucene.search.exposed.facet.TestHierarchicalFacets
{code}
> Locale-based sort by field with low memory overhead
> ---------------------------------------------------
>
> Key: LUCENE-2369
> URL: https://issues.apache.org/jira/browse/LUCENE-2369
> Project: Lucene - Java
> Issue Type: New Feature
> Components: core/search
> Reporter: Toke Eskildsen
> Priority: Minor
> Labels: memory, search, sort, time
> Attachments: LUCENE-2369.patch, LUCENE-2369.patch, LUCENE-2369.patch,
> LUCENE-2369.patch, LUCENE-2369.patch, lucene-2369-20101011.patch
>
>
> The current implementation of locale-based sort in Lucene uses the FieldCache
> which keeps all sort terms in memory. Beside the huge memory overhead,
> searching requires comparison of terms with collator.compare every time,
> making searches with millions of hits fairly expensive.
> This proposed alternative implementation is to create a packed list of
> pre-sorted ordinals for the sort terms and a map from document-IDs to entries
> in the sorted ordinals list. This results in very low memory overhead and
> faster sorted searches, at the cost of increased startup-time. As the
> ordinals can be resolved to terms after the sorting has been performed, this
> approach supports fillFields=true.
> This issue is related to https://issues.apache.org/jira/browse/LUCENE-2335
> which contain previous discussions on the subject.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]