Atin created SOLR-14436:
---------------------------

             Summary: When using Synonym Graph Filter, Solr does not tokenize 
query-string if it has multi-word synonym
                 Key: SOLR-14436
                 URL: https://issues.apache.org/jira/browse/SOLR-14436
             Project: Solr
          Issue Type: Bug
      Security Level: Public (Default Security Level. Issues are Public)
          Components: query, Schema and Analysis
    Affects Versions: 8.3.1
            Reporter: Atin
         Attachments: Scenario1.png, Scenario2.png

 

While using Synonym Graph Filter,  if the query string contains a multi-word 
synonym, it considers that multi-word synonym as a single term and does not 
tokenize it further.

 

For example- *soap powder* is a search *query* which is also a _multi-word 
synonym_ in the synonym file as-
{quote}s(104254535,1,'soap powder',n,1,1).
s(104254535,2,'built-soap powder',n,1,0).
s(104254535,3,'washing powder',n,1,0).{quote}
 
There are 2 documents having _soap_(2) and _powder_(1) altogether.
doc1: "Sunny Berlin breast tumors soap powder"
doc2: "She is in soap Berlin today"
 
 
+Scenario 1 (screenshot attached)+ 
 *without* Synonym Graph Filter => 2 docs returned  , as it checks for *"soap"* 
 and *"powder"* separately.
 
+Scenario 2 (screenshot attached)+ 
*with* Synonym Graph Filter => only 1 doc returned, but 2 were expected. Here 
only *"soap powder"* is being checked and it is not tokenized into "soap" and 
"powder" and searched further.

Is it possible to expand query string - *soap powder* as:

Synonym(soap powder) + Synonym(soap) + Synonym(powder)

 

Thank You.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to