Hi,
I am using Solr 7.7 in Cloud with the default query parser and similarity
algorithm. I get the following results with these queries:
q= "Wolfgang Amadeus Mozart": 8834 results.
q= "Wolfgang Amadeus Mozart" OR "Wolfgang Amadeus Mozart": 8831 results.
To my surprise, I get 3 fewer results with the second query, and I have
seen that those 3 documents contain the same words in a different order
("Mozart Wolfgang Amadeus").
In case it is relevant, the field used for the query is a textual field,
with regular normalization (see below):
<fieldType name="text_general" class="solr.TextField"
positionIncrementGap="100">
<analyzer type="index">
<tokenizer class="solr.WhitespaceTokenizerFactory" />
<filter class="solr.WordDelimiterGraphFilterFactory"
splitOnNumerics="0"/>
<filter class="solr.FlattenGraphFilterFactory"/>
<filter class="solr.LowerCaseFilterFactory" />
<filter class="solr.ASCIIFoldingFilterFactory"/>
</analyzer>
<analyzer type="query">
<tokenizer class="solr.WhitespaceTokenizerFactory" />
<filter class="solr.WordDelimiterGraphFilterFactory"
splitOnNumerics="0"/>
<filter class="solr.LowerCaseFilterFactory" />
<filter class="solr.ASCIIFoldingFilterFactory"/>
</analyzer>
</fieldType>
Does anybody know why this is happening?
Thanks in advance for your help.
Mónica
--
Disclaimer: This email and any files transmitted with it are confidential
and intended solely for the use of the individual or entity to whom they
are
addressed. If you have received this email in error please notify the
system manager. If you are not the named addressee you should not
disseminate,
distribute or copy this email. Please notify the sender
immediately by email if you have received this email by mistake and delete
this email from your
system.