Markus Jelsma created SOLR-13009:
------------------------------------

             Summary: ExtendedSolrQueryParser cannot handle with KeywordRepeat, 
Synonyms and minShouldMatch
                 Key: SOLR-13009
                 URL: https://issues.apache.org/jira/browse/SOLR-13009
             Project: Solr
          Issue Type: Bug
      Security Level: Public (Default Security Level. Issues are Public)
          Components: query parsers
    Affects Versions: 7.5
            Reporter: Markus Jelsma
             Fix For: master (8.0)


As described on the mailing list [1].

Consider this example synonym set [bier,brouw] where bier means beer and brouw 
is the stemmed version of brouwsel (brewage, concoction), and consider these 
parameters on /select:
{code}qf=content_nl&defType=edismax&mm=2<-1 5<-2 6<90%25{code}

The queries q=bier and q=brouw both give the desired results. However, for 
q=brouwsel something (partially) unexpected happens:
{code}+(((content_nl:brouwsel Synonym(content_nl:bier 
content_nl:brouw))~2)){code}

This results in a BooleanQuery where, due to mm=2, both clauses need to match, 
giving very few matches. Removing KeywordRepeat or setting mm=1 of course fixes 
the problem, but that is not what we want.

Will shortly attatch a patch containing a unit test proving the problem is real.

[1] 
http://lucene.472066.n3.nabble.com/KeywordRepeat-stemming-single-term-synonyms-and-minimum-should-match-edismax-td4413690.html



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to