[ 
https://issues.apache.org/jira/browse/LUCENE-8572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16695760#comment-16695760
 ] 

Octavian Mocanu commented on LUCENE-8572:
-----------------------------------------

Hi [~romseygeek],

Trying e.g. at 
{code:java}
/org/apache/lucene/queryparser/flexible/standard/parser/EscapeQuerySyntaxImpl.java:83
 {code}
in
{code:java}
private static final CharSequence escapeTerm(CharSequence term, Locale 
locale){code}
with

 
{code:java}
term = "İpone " [304, 112, 111, 110, 101, 32]
locale = "us"
{code}

result -> *StringIndexOutOfBoundsException*

(it'll only work when having locale = "tr")

> StringIndexOutOfBoundsException in parser/EscapeQuerySyntaxImpl.java
> --------------------------------------------------------------------
>
>                 Key: LUCENE-8572
>                 URL: https://issues.apache.org/jira/browse/LUCENE-8572
>             Project: Lucene - Core
>          Issue Type: Bug
>          Components: core/queryparser
>    Affects Versions: 6.3
>            Reporter: Octavian Mocanu
>            Priority: Major
>
> With "lucene-queryparser-6.3.0", specifically in
> "org/apache/lucene/queryparser/flexible/standard/parser/EscapeQuerySyntaxImpl.java"
>  
> when escaping strings containing extended unicode chars, and with a locale 
> distinct from that of the character set the string uses, the process fails, 
> with a "java.lang.StringIndexOutOfBoundsException".
>  
> The reason is that the comparison is done by previously converting all of the 
> characters of the string to lower case chars, and by doing this, the original 
> string size isn't anymore the same, but less, as of the transformed one, so 
> that executing
>  
> org/apache/lucene/queryparser/flexible/standard/parser/EscapeQuerySyntaxImpl.java:89
> fails with a java.lang.StringIndexOutOfBoundsException.
> I wonder whether the transformation to lower case is really needed when 
> treating the escape chars, since by avoiding it, the error may be avoided.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to