[ 
https://issues.apache.org/jira/browse/LUCENE-7351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Adrien Grand updated LUCENE-7351:
---------------------------------
    Attachment: LUCENE-7351.patch

Hmm I can remove both actually, they do not bring value now that the detection 
of whether doc ids are sorted is based on the doc ids themselves rather than 
the fact that there is a single value in a block.

> BKDWriter should compress doc ids when all values in a block are the same
> -------------------------------------------------------------------------
>
>                 Key: LUCENE-7351
>                 URL: https://issues.apache.org/jira/browse/LUCENE-7351
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Adrien Grand
>            Priority: Minor
>         Attachments: LUCENE-7351.patch, LUCENE-7351.patch, LUCENE-7351.patch
>
>
> BKDWriter writes doc ids using 4 bytes per document. I think it should 
> compress similarly to postings when all docs in a block have the same packed 
> value. This can happen either when a field has a default value which is 
> common across documents or when quantization makes the number of unique 
> values so small that a large index will necessarily have blocks that all 
> contain the same value (eg. there are only 63490 unique half-float values).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to