[ 
https://issues.apache.org/jira/browse/LUCENE-9815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17292425#comment-17292425
 ] 

Robert Muir commented on LUCENE-9815:
-------------------------------------

I still think the use-case/premise here is wrong. We shouldn't even have it as 
an option to compress the sorted docvalues term dictionary, it should just be 
what happens.

And any bad code hashing on term bytes and stuff instead of using ordinals 
should just be fixed.

> PerField formats can select the format based on FieldInfo
> ---------------------------------------------------------
>
>                 Key: LUCENE-9815
>                 URL: https://issues.apache.org/jira/browse/LUCENE-9815
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Bruno Roustant
>            Priority: Minor
>
> PerFieldDocValuesFormat and PerFieldPostingsFormat currently only select the 
> format based on the field name.
> If we improve them to also support the selection based on the FieldInfo, it 
> will be possible to select based on some FieldInfo attribute, DocValuesType, 
> etc.
> +Example use-case:+
>  It will be possible to adapt the compression mode of doc values fields 
> easily based on the DocValuesType. E.g. compressing sorted and not binary doc 
> values.
> > User creates a new custom codec which provides a custom DocValuesFormat 
> > which extends PerFieldDocValuesFormat and implements the method
>  DocValuesFormat getDocValuesFormatForField(FieldInfo fieldInfo).
>  This method provides either a standard Lucene80DocValuesFormat (no 
> compression) or another new custom DocValuesFormat extending 
> Lucene80DocValuesFormat with BEST_COMPRESSION mode.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to