bq: metatag.description field has highest boost and documents with a
match in this field should come first

This is something of a misunderstanding. The boost you've added to the
metatag.description field is a _factor_ in the score calculations, not
an ordering. In your case you're using edismax apparently (inferred by
the "max of", you didn't say) which scores based on the _maximum_ of
all the clauses and it happens that by the time you score the "title"
field in the first doc it has a higher score than the
metatag.description field in the second.... My guess is that the title
field in the first doc is much shorter than the title field in the
second...

And as an aside, periods aren't recommended in field names. They work
as far as we know, but going forward aren't officially supported.

Best,
Erick

On Mon, Jun 20, 2016 at 7:07 AM, Megha Bhandari <mbhanda...@sapient.com> wrote:
> Hi
>
> Problem statement : metatag.description field has highest boost and documents 
> with a match in this field should come first. However 
> Silver-Choice-5000-E.pdf comes before /contact-us/florida even though the 
> search term matches more fields in the /contact-us/florida page. In 
> Silver-Choice-5000-E.pdf matches are found in title and _text fields. In 
> /contact-us/florida matches are found in metatag.description,title,h1,_text 
> fields.
>
> We have the following in solrconfig.xml
>
> <requestHandler name="/elevate" class="solr.SearchHandler" startup="lazy">
>     <lst name="defaults">
>                   <str name="defType">edismax</str>
>       <str name="qf">
>         metatag.keywords^900000.1 metatag.description^500000.1 title^10000.1 
> h1^1000.7 h2^700.6 h3^10.1 h4^5.4 h5^1.3 h6^1.2 _text_^1.0
>       </str>
>
> When searching for Florida we get the following results.
> ------------------------------------------
>
> {
>
>         
> 'id'=>'/content/dam/uhcdotcom/en/qa_workarea/Silver-Choice-5000-E.pdf',
>
>         'title'=>'Florida',
>
>         'metatag.description'=>'mental health',
>
>         'itemtype'=>'pdf',
>
>         'playerid'=>'',
>
>         'playerkey'=>'',
>
>         'metatag.topresultthumbnailalt'=>'Florida',
>
>         'lang'=>'en',
>
>         'metatag.hideininternalsearch'=>'false'},
>
>       {
>
>         'lang'=>'en',
>
>         'metatag.topresultthumbnailurl'=>'',
>
>         'id'=>'https://10.209.5.171/contact-us/florida',
>
>         'title'=>'Florida',
>
>         'metatag.topresultthumbnailalt'=>'',
>
>         'metatag.hideininternalsearch'=>'false',
>
>         'metatag.description'=>'Contact UnitedHealthcare in Florida.'}
>
> -----------------------------------------------------------
> With following debug information
>
> '/content/dam/uhcdotcom/en/qa_workarea/Silver-Choice-5000-E.pdf'=>'
> 0.050655644 = max of:
>   0.050655644 = weight(title:florida in 0) [ClassicSimilarity], result of:
>     0.050655644 = score(doc=0,freq=1.0), product of:
>       0.0075930804 = queryWeight, product of:
>         10000.1 = boost
>         6.6712904 = idf(docFreq=21, maxDocs=6389)
>         1.1381613E-7 = queryNorm
>       6.6712904 = fieldWeight in 0, product of:
>         1.0 = tf(freq=1.0), with freq of:
>           1.0 = termFreq=1.0
>         6.6712904 = idf(docFreq=21, maxDocs=6389)
>         1.0 = fieldNorm(doc=0)
>   3.462133E-7 = weight(_text_:florida in 0) [ClassicSimilarity], result of:
>     3.462133E-7 = score(doc=0,freq=2.0), product of:
>       4.222856E-7 = queryWeight, product of:
>         3.710244 = idf(docFreq=424, maxDocs=6389)
>         1.1381613E-7 = queryNorm
>       0.8198558 = fieldWeight in 0, product of:
>         1.4142135 = tf(freq=2.0), with freq of:
>           2.0 = termFreq=2.0
>         3.710244 = idf(docFreq=424, maxDocs=6389)
>         0.15625 = fieldNorm(doc=0)
> ',
>       'https://10.209.5.171/contact-us/florida'=>'
> 0.02968075 = max of:
>   0.0011872416 = weight(title:florida in 380) [ClassicSimilarity], result of:
>     0.0011872416 = score(doc=380,freq=1.0), product of:
>       0.0075930804 = queryWeight, product of:
>         10000.1 = boost
>         6.6712904 = idf(docFreq=21, maxDocs=6389)
>         1.1381613E-7 = queryNorm
>       0.15635836 = fieldWeight in 380, product of:
>         1.0 = tf(freq=1.0), with freq of:
>           1.0 = termFreq=1.0
>         6.6712904 = idf(docFreq=21, maxDocs=6389)
>         0.0234375 = fieldNorm(doc=380)
>   5.724965E-9 = weight(_text_:florida in 380) [ClassicSimilarity], result of:
>     5.724965E-9 = score(doc=380,freq=14.0), product of:
>       4.222856E-7 = queryWeight, product of:
>         3.710244 = idf(docFreq=424, maxDocs=6389)
>         1.1381613E-7 = queryNorm
>       0.013557092 = fieldWeight in 380, product of:
>         3.7416575 = tf(freq=14.0), with freq of:
>           14.0 = termFreq=14.0
>         3.710244 = idf(docFreq=424, maxDocs=6389)
>         9.765625E-4 = fieldNorm(doc=380)
>   8.445298E-5 = weight(h1:florida in 380) [ClassicSimilarity], result of:
>     8.445298E-5 = score(doc=380,freq=1.0), product of:
>       8.387785E-4 = queryWeight, product of:
>         1000.7 = boost
>         7.3644376 = idf(docFreq=10, maxDocs=6389)
>         1.1381613E-7 = queryNorm
>       0.10068567 = fieldWeight in 380, product of:
>         1.0 = tf(freq=1.0), with freq of:
>           1.0 = termFreq=1.0
>         7.3644376 = idf(docFreq=10, maxDocs=6389)
>         0.013671875 = fieldNorm(doc=380)
>   0.02968075 = weight(metatag.description:florida in 380) 
> [ClassicSimilarity], result of:
>     0.02968075 = score(doc=380,freq=1.0), product of:
>       0.3796503 = queryWeight, product of:
>         500000.1 = boost
>         6.6712904 = idf(docFreq=21, maxDocs=6389)
>         1.1381613E-7 = queryNorm
>       0.07817918 = fieldWeight in 380, product of:
>         1.0 = tf(freq=1.0), with freq of:
>           1.0 = termFreq=1.0
>         6.6712904 = idf(docFreq=21, maxDocs=6389)
>         0.01171875 = fieldNorm(doc=380)
> ',
>
> Thanks for any help in understanding and tuning Solr to behave correctly.
>
> Regards
> Megha

Reply via email to