You might try www.Splainer.io - it will help you see where the matches are happening.

Cheers

Charlie

On 01/07/2021 16:27, Erik Hatcher wrote:
Mónica - that is a surprising difference.   Could you add &debug=true and share 
the parsed queries you're getting in both cases?

What is the layout of your collection?   multi-sharded?  how many shards?   Is 
the index in flux (things being added or deleted) between those queries?   Any 
other architecturally relevant factors involved?

Best,
        Erik


On Jul 1, 2021, at 11:10 AM, Mónica Marrero <[email protected]> wrote:

Hi,

I am using Solr 7.7 in Cloud with the default query parser and similarity
algorithm. I get the following results with these queries:

q= "Wolfgang Amadeus Mozart": 8834 results.
q= "Wolfgang Amadeus Mozart" OR "Wolfgang Amadeus Mozart": 8831 results.

To my surprise, I get 3 fewer results with the second query, and I have
seen that those 3 documents contain the same words in a different order
("Mozart Wolfgang Amadeus").

In case it is relevant, the field used for the query is a textual field,
with regular normalization (see below):

<fieldType name="text_general" class="solr.TextField"
positionIncrementGap="100">
            <analyzer type="index">
                <tokenizer class="solr.WhitespaceTokenizerFactory" />
                <filter class="solr.WordDelimiterGraphFilterFactory"
splitOnNumerics="0"/>
                <filter class="solr.FlattenGraphFilterFactory"/>
                <filter class="solr.LowerCaseFilterFactory" />
                <filter class="solr.ASCIIFoldingFilterFactory"/>
            </analyzer>
            <analyzer type="query">
                <tokenizer class="solr.WhitespaceTokenizerFactory" />
                <filter class="solr.WordDelimiterGraphFilterFactory"
splitOnNumerics="0"/>
                <filter class="solr.LowerCaseFilterFactory" />
                <filter class="solr.ASCIIFoldingFilterFactory"/>
            </analyzer>
        </fieldType>


Does anybody know why this is happening?

Thanks in advance for your help.

Mónica

--
Disclaimer: This email and any files transmitted with it are confidential
and intended solely for the use of the individual or entity to whom they
are
addressed. If you have received this email in error please notify the
system manager. If you are not the named addressee you should not
disseminate,
distribute or copy this email. Please notify the sender
immediately by email if you have received this email by mistake and delete
this email from your
system.


--
Charlie Hull - Managing Consultant at OpenSource Connections Limited <www.o19s.com> Founding member of The Search Network <https://thesearchnetwork.com/> and co-author of Searching the Enterprise <https://opensourceconnections.com/about-us/books-resources/>
tel/fax: +44 (0)8700 118334
mobile: +44 (0)7767 825828

Reply via email to