[ 
https://issues.apache.org/jira/browse/OAK-10384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17770128#comment-17770128
 ] 

Julian Reschke commented on OAK-10384:
--------------------------------------

trunk: 
[1e55c01b22|https://github.com/apache/jackrabbit-oak/commit/1e55c01b22396239653549b3684bd9d71c606307]

> Fix stripping of large indexed ordered properties
> -------------------------------------------------
>
>                 Key: OAK-10384
>                 URL: https://issues.apache.org/jira/browse/OAK-10384
>             Project: Jackrabbit Oak
>          Issue Type: Bug
>          Components: lucene
>            Reporter: Amit Jain
>            Assignee: Amit Jain
>            Priority: Major
>             Fix For: 1.58.0
>
>
> Currently, the ordered indexed properties are truncated at the max length 
> supported by lucene at 32766 in the 
> [LuceneDocumentMaker|https://github.com/apache/jackrabbit-oak/blob/trunk/oak-lucene/src/main/java/org/apache/jackrabbit/oak/plugins/index/lucene/LuceneDocumentMaker.java#L290-L294].
> The problem is lucene uses a class {{BytesRef}} to represent strings which 
> converts it to UTF-8. It then uses the length from this converted string to 
> enforce the limit. The transformation between java unicode string to utf-8 
> can cause the length to increase for non-ascii characters.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to