[ https://issues.apache.org/jira/browse/SOLR-14454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17097683#comment-17097683 ]
Michael Gibney commented on SOLR-14454: --------------------------------------- [PR #1478|https://github.com/apache/lucene-solr/pull/1478] adds support in three phases. # [aaf3bf1c|https://github.com/apache/lucene-solr/pull/1478/commits/aaf3bf1c3b98eab087991371a3a88dcd56274090] introduces minimal changes that extend {{useDocValuesAsStored}} and {{/export}} support for custom utf8 field types with {{DocValuesType.BINARY}} # [409cb811|https://github.com/apache/lucene-solr/pull/1478/commits/409cb81120066adda5ddb58abfc5b2166dc89c7e] adds a concrete implementation that simply writes the raw utf8 binary representation to binary docValues # [462a9a7e|https://github.com/apache/lucene-solr/pull/1478/commits/462a9a7ecf32132369052ae0f82075975f5f7cbd] extends the simple implementation to support per-value, configurable Deflate compression of binary docValues > support for UTF-8 (string) types with DocValuesType.BINARY > ---------------------------------------------------------- > > Key: SOLR-14454 > URL: https://issues.apache.org/jira/browse/SOLR-14454 > Project: Solr > Issue Type: New Feature > Security Level: Public(Default Security Level. Issues are Public) > Components: Schema and Analysis > Affects Versions: master (9.0) > Reporter: Michael Gibney > Priority: Minor > Time Spent: 10m > Remaining Estimate: 0h > > The goal is to add support for string fields with arbitrarily large values in > the {{/export}} handler and streaming expressions. > {{StrField}} values are currently limited to 32766 bytes for the case where > {{indexed=true}} or {{docValues=true}}. Exceeding this value triggers an > "immense field" warning, and causes indexing to fail for the associated input > doc. > Configuring a {{StrField}} field as "{{indexed=false docValues=false}}" > removes this size limitation, so it is already possible to have large > _stored_ {{StrField}} values. But the "{{docValues=true}}" prerequisite for > the {{/export}} handler (and consequently for streaming expressions) limits > the size of field that can be used in conjunction with these features. > Adding support for UTF-8/string field types with {{DocValuesType.BINARY}} > would address this limitation and allow considerable flexibility in the > implementation of custom field types. N.b.: this would address field value > retrieval use cases only (e.g., {{/export}} and {{useDocValuesAsStored}}); > neither sorting nor faceting would be supported. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org