Atin created SOLR-14436: --------------------------- Summary: When using Synonym Graph Filter, Solr does not tokenize query-string if it has multi-word synonym Key: SOLR-14436 URL: https://issues.apache.org/jira/browse/SOLR-14436 Project: Solr Issue Type: Bug Security Level: Public (Default Security Level. Issues are Public) Components: query, Schema and Analysis Affects Versions: 8.3.1 Reporter: Atin Attachments: Scenario1.png, Scenario2.png
While using Synonym Graph Filter, if the query string contains a multi-word synonym, it considers that multi-word synonym as a single term and does not tokenize it further. For example- *soap powder* is a search *query* which is also a _multi-word synonym_ in the synonym file as- {quote}s(104254535,1,'soap powder',n,1,1). s(104254535,2,'built-soap powder',n,1,0). s(104254535,3,'washing powder',n,1,0).{quote} There are 2 documents having _soap_(2) and _powder_(1) altogether. doc1: "Sunny Berlin breast tumors soap powder" doc2: "She is in soap Berlin today" +Scenario 1 (screenshot attached)+ *without* Synonym Graph Filter => 2 docs returned , as it checks for *"soap"* and *"powder"* separately. +Scenario 2 (screenshot attached)+ *with* Synonym Graph Filter => only 1 doc returned, but 2 were expected. Here only *"soap powder"* is being checked and it is not tokenized into "soap" and "powder" and searched further. Is it possible to expand query string - *soap powder* as: Synonym(soap powder) + Synonym(soap) + Synonym(powder) Thank You. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org