[
https://issues.apache.org/jira/browse/LUCENE-8705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ignacio Vera updated LUCENE-8705:
---------------------------------
Issue Type: Improvement (was: Bug)
> Compress BKD trees by encoding the difference between two dimensions
> --------------------------------------------------------------------
>
> Key: LUCENE-8705
> URL: https://issues.apache.org/jira/browse/LUCENE-8705
> Project: Lucene - Core
> Issue Type: Improvement
> Reporter: Adrien Grand
> Priority: Minor
>
> When serializing BKD trees to disk, for each block we look at the common
> prefix for each dimension in isolation and only encode those common prefixes
> once for the entire block. Now that we have range fields and shapes so that
> several dimensions are storing related data, we might occasionally have
> longer common prefixes when comparing with values in other dimensions. For
> instance when indexing narrow ranges in a range field, we might get better
> compression on the second dimension by encoding suffixes that differ with the
> first dimension. This is also an obvious win if we are indexing lines or
> points as shapes, since we have dimensions that record exactly the same
> values in that case.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]