[
https://issues.apache.org/jira/browse/LUCENE-7463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Adrien Grand updated LUCENE-7463:
---------------------------------
Attachment: LUCENE-7463.patch
Here is a patch that adds a Lucene70Codec and a Lucene70DocValuesFormat. The
latter is mostly the same as Lucene54DocValuesFormat, the two differences are
that it uses nextSetBit operations on the bitset representing live docs in
order to have faster iteration (instead of testing each bit sequentially), and
that it bumps the threshold for sparse encoding to 10% instead of 1%. The goal
is not really to make it the final 7.0 codec but rather to have something that
we will be able to compare next iterations with.
> Create a Lucene70DocValuesFormat
> --------------------------------
>
> Key: LUCENE-7463
> URL: https://issues.apache.org/jira/browse/LUCENE-7463
> Project: Lucene - Core
> Issue Type: Improvement
> Reporter: Adrien Grand
> Priority: Minor
> Fix For: 7.0
>
> Attachments: LUCENE-7463.patch
>
>
> Even if it is not optimal, I think it would help to create a
> Lucene70DocValuesFormat now by copying the current Lucene54DocValuesFormat
> and including some minor changes like making the sparse case use a true
> iterator API as described in LUCENE-7457 (which should make it to
> Lucene54DocValuesFormat a well so that merging from an old codec would be
> efficient) as well as raising the threshold to enable sparse encoding and
> using nextSetBit operations when iterating bit sets, which cannot be done
> easily in Lucene54DocValuesFormat because we'd need to add a couple trailing
> bytes to make sure we can read a long at any valid index.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]