Steve Rowe created LUCENE-7533:
----------------------------------
Summary: Classic query parser: autoGeneratePhraseQueries=true
doesn't work when splitOnWhitespace=false
Key: LUCENE-7533
URL: https://issues.apache.org/jira/browse/LUCENE-7533
Project: Lucene - Core
Issue Type: Bug
Affects Versions: 6.2.1, 6.2, 6.3
Reporter: Steve Rowe
LUCENE-2605 introduced the classic query parser option to not split on
whitespace prior to performing analysis.
When splitOnWhitespace=false, the output from analysis can now come from
multiple whitespace-separated tokens, which breaks code assumptions when
autoGeneratePhraseQueries=true: for this combination of options, it's not
appropriate to auto-quote multiple non-overlapping tokens produced by analysis.
E.g. simple whitespace tokenization over the query "some words" will produce
the token sequence ("some", "words"), and even when
autoGeneratePhraseQueries=true, we should not be creating a phrase query here.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]