[ 
https://issues.apache.org/jira/browse/SOLR-14454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17097683#comment-17097683
 ] 

Michael Gibney commented on SOLR-14454:
---------------------------------------

[PR #1478|https://github.com/apache/lucene-solr/pull/1478] adds support in 
three phases.
 # 
[aaf3bf1c|https://github.com/apache/lucene-solr/pull/1478/commits/aaf3bf1c3b98eab087991371a3a88dcd56274090]
 introduces minimal changes that extend {{useDocValuesAsStored}} and 
{{/export}} support for custom utf8 field types with {{DocValuesType.BINARY}}
 # 
[409cb811|https://github.com/apache/lucene-solr/pull/1478/commits/409cb81120066adda5ddb58abfc5b2166dc89c7e]
 adds a concrete implementation that simply writes the raw utf8 binary 
representation to binary docValues
 # 
[462a9a7e|https://github.com/apache/lucene-solr/pull/1478/commits/462a9a7ecf32132369052ae0f82075975f5f7cbd]
 extends the simple implementation to support per-value, configurable Deflate 
compression of binary docValues

> support for UTF-8 (string) types with DocValuesType.BINARY
> ----------------------------------------------------------
>
>                 Key: SOLR-14454
>                 URL: https://issues.apache.org/jira/browse/SOLR-14454
>             Project: Solr
>          Issue Type: New Feature
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: Schema and Analysis
>    Affects Versions: master (9.0)
>            Reporter: Michael Gibney
>            Priority: Minor
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> The goal is to add support for string fields with arbitrarily large values in 
> the {{/export}} handler and streaming expressions.
> {{StrField}} values are currently limited to 32766 bytes for the case where 
> {{indexed=true}} or {{docValues=true}}. Exceeding this value triggers an 
> "immense field" warning, and causes indexing to fail for the associated input 
> doc.
> Configuring a {{StrField}} field as "{{indexed=false docValues=false}}" 
> removes this size limitation, so it is already possible to have large 
> _stored_ {{StrField}} values. But the "{{docValues=true}}" prerequisite for 
> the {{/export}} handler (and consequently for streaming expressions) limits 
> the size of field that can be used in conjunction with these features.
> Adding support for UTF-8/string field types with {{DocValuesType.BINARY}} 
> would address this limitation and allow considerable flexibility in the 
> implementation of custom field types. N.b.: this would address field value 
> retrieval use cases only (e.g., {{/export}} and {{useDocValuesAsStored}}); 
> neither sorting nor faceting would be supported.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to