[
https://issues.apache.org/jira/browse/LUCENE-4527?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13490179#comment-13490179
]
Robert Muir commented on LUCENE-4527:
-------------------------------------
And of course for this test (and any other test of compressing stored fields),
that
"fake" wikipedia corpus being used is totally invalid because all documents are
truncated to a specific length :)
> CompressingStoredFieldsFormat: encode numStoredFields more efficiently
> ----------------------------------------------------------------------
>
> Key: LUCENE-4527
> URL: https://issues.apache.org/jira/browse/LUCENE-4527
> Project: Lucene - Core
> Issue Type: Improvement
> Reporter: Adrien Grand
> Assignee: Adrien Grand
> Priority: Minor
> Fix For: 4.1
>
> Attachments: LUCENE-4527.patch
>
>
> Another interesting idea from Robert: many applications have a schema and all
> documents are likely to have the same number of stored fields. We could save
> space by using packed ints and the same kind of optimization as {{ForUtil}}
> (requiring only one VInt if all values are equal).
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]