After further investigation we have found that latest inserted documents are 
getting higher priority and coming on top of the search results and ignoring 
the field boosting in case the time difference of document insertion is a day.

Is there a configuration to switch off insertion time factor. As per our 
understanding field boosting should take precedence.

Thanks in advance for any inputs you can give.

-----Original Message-----
From: Megha Bhandari [mailto:mbhanda...@sapient.com] 
Sent: Monday, June 20, 2016 4:37 PM
To: solr-user@lucene.apache.org
Subject: Solr 5.5 | Field boosting not working as per expectation

Hi

Problem statement : metatag.description field has highest boost and documents 
with a match in this field should come first. However Silver-Choice-5000-E.pdf 
comes before /contact-us/florida even though the search term matches more 
fields in the /contact-us/florida page. In Silver-Choice-5000-E.pdf matches are 
found in title and _text fields. In /contact-us/florida matches are found in 
metatag.description,title,h1,_text fields.

We have the following in solrconfig.xml

<requestHandler name="/elevate" class="solr.SearchHandler" startup="lazy">
    <lst name="defaults">
                  <str name="defType">edismax</str>
      <str name="qf">
        metatag.keywords^900000.1 metatag.description^500000.1 title^10000.1 
h1^1000.7 h2^700.6 h3^10.1 h4^5.4 h5^1.3 h6^1.2 _text_^1.0
      </str>

When searching for Florida we get the following results.
------------------------------------------

{

        'id'=>'/content/dam/uhcdotcom/en/qa_workarea/Silver-Choice-5000-E.pdf',

        'title'=>'Florida',

        'metatag.description'=>'mental health',

        'itemtype'=>'pdf',

        'playerid'=>'',

        'playerkey'=>'',

        'metatag.topresultthumbnailalt'=>'Florida',

        'lang'=>'en',

        'metatag.hideininternalsearch'=>'false'},

      {

        'lang'=>'en',

        'metatag.topresultthumbnailurl'=>'',

        'id'=>'https://10.209.5.171/contact-us/florida',

        'title'=>'Florida',

        'metatag.topresultthumbnailalt'=>'',

        'metatag.hideininternalsearch'=>'false',

        'metatag.description'=>'Contact UnitedHealthcare in Florida.'}

-----------------------------------------------------------
With following debug information

'/content/dam/uhcdotcom/en/qa_workarea/Silver-Choice-5000-E.pdf'=>'
0.050655644 = max of:
  0.050655644 = weight(title:florida in 0) [ClassicSimilarity], result of:
    0.050655644 = score(doc=0,freq=1.0), product of:
      0.0075930804 = queryWeight, product of:
        10000.1 = boost
        6.6712904 = idf(docFreq=21, maxDocs=6389)
        1.1381613E-7 = queryNorm
      6.6712904 = fieldWeight in 0, product of:
        1.0 = tf(freq=1.0), with freq of:
          1.0 = termFreq=1.0
        6.6712904 = idf(docFreq=21, maxDocs=6389)
        1.0 = fieldNorm(doc=0)
  3.462133E-7 = weight(_text_:florida in 0) [ClassicSimilarity], result of:
    3.462133E-7 = score(doc=0,freq=2.0), product of:
      4.222856E-7 = queryWeight, product of:
        3.710244 = idf(docFreq=424, maxDocs=6389)
        1.1381613E-7 = queryNorm
      0.8198558 = fieldWeight in 0, product of:
        1.4142135 = tf(freq=2.0), with freq of:
          2.0 = termFreq=2.0
        3.710244 = idf(docFreq=424, maxDocs=6389)
        0.15625 = fieldNorm(doc=0)
',
      'https://10.209.5.171/contact-us/florida'=>'
0.02968075 = max of:
  0.0011872416 = weight(title:florida in 380) [ClassicSimilarity], result of:
    0.0011872416 = score(doc=380,freq=1.0), product of:
      0.0075930804 = queryWeight, product of:
        10000.1 = boost
        6.6712904 = idf(docFreq=21, maxDocs=6389)
        1.1381613E-7 = queryNorm
      0.15635836 = fieldWeight in 380, product of:
        1.0 = tf(freq=1.0), with freq of:
          1.0 = termFreq=1.0
        6.6712904 = idf(docFreq=21, maxDocs=6389)
        0.0234375 = fieldNorm(doc=380)
  5.724965E-9 = weight(_text_:florida in 380) [ClassicSimilarity], result of:
    5.724965E-9 = score(doc=380,freq=14.0), product of:
      4.222856E-7 = queryWeight, product of:
        3.710244 = idf(docFreq=424, maxDocs=6389)
        1.1381613E-7 = queryNorm
      0.013557092 = fieldWeight in 380, product of:
        3.7416575 = tf(freq=14.0), with freq of:
          14.0 = termFreq=14.0
        3.710244 = idf(docFreq=424, maxDocs=6389)
        9.765625E-4 = fieldNorm(doc=380)
  8.445298E-5 = weight(h1:florida in 380) [ClassicSimilarity], result of:
    8.445298E-5 = score(doc=380,freq=1.0), product of:
      8.387785E-4 = queryWeight, product of:
        1000.7 = boost
        7.3644376 = idf(docFreq=10, maxDocs=6389)
        1.1381613E-7 = queryNorm
      0.10068567 = fieldWeight in 380, product of:
        1.0 = tf(freq=1.0), with freq of:
          1.0 = termFreq=1.0
        7.3644376 = idf(docFreq=10, maxDocs=6389)
        0.013671875 = fieldNorm(doc=380)
  0.02968075 = weight(metatag.description:florida in 380) [ClassicSimilarity], 
result of:
    0.02968075 = score(doc=380,freq=1.0), product of:
      0.3796503 = queryWeight, product of:
        500000.1 = boost
        6.6712904 = idf(docFreq=21, maxDocs=6389)
        1.1381613E-7 = queryNorm
      0.07817918 = fieldWeight in 380, product of:
        1.0 = tf(freq=1.0), with freq of:
          1.0 = termFreq=1.0
        6.6712904 = idf(docFreq=21, maxDocs=6389)
        0.01171875 = fieldNorm(doc=380)
',

Thanks for any help in understanding and tuning Solr to behave correctly.

Regards
Megha

Reply via email to