[ 
https://issues.apache.org/jira/browse/SOLR-3636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13435631#comment-13435631
 ] 

Jack Krupansky commented on SOLR-3636:
--------------------------------------

I checked the code, both in 4x and 3.6, and it in fact uses the actual number 
of optional of terms generated in the top-level BooleanQuery, not the "number 
of terms found by edismax from the original query" as stated here in the issue 
description.

I suspect that this is a variation of SOLR-3589: "Edismax parser does not honor 
mm parameter if analyzer splits a token", where the low-level Lucene query 
parser code passes a single term to the field analyzer and gets multiple terms 
back and autoGeneratePhraseQueries is false, so the terms are ORed because that 
is the default operator. The Lucene level code does not know about "mm", Solr, 
or any request parameters at all.

Hmmm... maybe the solution to some of these issues is that if mm is 100%, Solr 
should implicitly set the default query operator directly to "AND", which would 
give the Lucene code the information needed to generate an AND rather than an 
OR.


                
> edismax, synonyms and mm=100%
> -----------------------------
>
>                 Key: SOLR-3636
>                 URL: https://issues.apache.org/jira/browse/SOLR-3636
>             Project: Solr
>          Issue Type: Bug
>          Components: query parsers
>            Reporter: Lance Norskog
>            Priority: Minor
>             Fix For: 4.0
>
>
> There is a problem with query-side synonyms, edismax and must-match=100%. 
> edismax interprets must-match=100% as "number of terms found by edismax from 
> the original query". These terms go through the query analyzer, and the 
> synonym filter creates more terms, *but* the must-match term count is not 
> incremented. Thus, given a synonym of
> {code}
> monkeyhouse => monkey house
> {code}
> the query {{q=big+monkeyhouse&mm=100%}} becomes (effectively) 
> {{q=big+monkey+house&mm=2}}. This query finds documents matching only two out 
> of three terms {{big+monkey, monkey+house, big+house}}.
> This might also be a problem in dismax.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to