Robert Muir created LUCENE-4991:
-----------------------------------

             Summary: QueryParser doesnt handle synonyms correctly for chinese
                 Key: LUCENE-4991
                 URL: https://issues.apache.org/jira/browse/LUCENE-4991
             Project: Lucene - Core
          Issue Type: Bug
          Components: modules/queryparser
            Reporter: Robert Muir


As reported multiple times on the user list:
http://find.searchhub.org/document/eaf0e88a6a0d4d1f
http://find.searchhub.org/document/abf28043c52b6efc
http://find.searchhub.org/document/1313794632c90826

The logic here is not forming the right query structures and ignoring 
positionIncrementAttribute from the tokenStream.

* when default operator is AND, you can see it more clearly, as synonyms are 
wrongly inserted as additional MUST terms:
    expected:<+field:中 +(field:国 field:國)> 
    but was:<+field:中 +field:国 +field:國>
* even when default operator is OR, its still wrong, because we ignore posInc 
and this means coord computation is not correct (so scoring is wrong)

This also screws up scoring and queries for decompounding too (because they go 
thru this exact situation if they add the original compound as a synonym).


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to