[ 
https://issues.apache.org/jira/browse/LUCENE-7533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Steve Rowe updated LUCENE-7533:
-------------------------------
    Attachment: LUCENE-7533-disallow-option-combo.patch

Patch that disallows autoGeneratePhraseQueries=true when 
splitOnWhitespace=false.

This is ready to go.  I'm going to commit shortly.

> Classic query parser: autoGeneratePhraseQueries=true doesn't work when 
> splitOnWhitespace=false
> ----------------------------------------------------------------------------------------------
>
>                 Key: LUCENE-7533
>                 URL: https://issues.apache.org/jira/browse/LUCENE-7533
>             Project: Lucene - Core
>          Issue Type: Bug
>    Affects Versions: 6.2, 6.3, 6.2.1
>            Reporter: Steve Rowe
>         Attachments: LUCENE-7533-disallow-option-combo.patch, 
> LUCENE-7533.patch
>
>
> LUCENE-2605 introduced the classic query parser option to not split on 
> whitespace prior to performing analysis.
> From the javadocs for QueryParser.setAutoGeneratePhraseQueries(): 
> bq.phrase queries will be automatically generated when the analyzer returns 
> more than one term from whitespace delimited text.
> When splitOnWhitespace=false, the output from analysis can now come from 
> multiple whitespace-separated tokens, which breaks code assumptions when 
> autoGeneratePhraseQueries=true: for this combination of options, it's not 
> appropriate to auto-quote multiple non-overlapping tokens produced by 
> analysis.  E.g. simple whitespace tokenization over the query "some words" 
> will produce the token sequence ("some", "words"), and even when 
> autoGeneratePhraseQueries=true, we should not be creating a phrase query here.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to