Solr scoring confusion

2015-02-13 Thread Scott Johnson
We are getting inconsistent scoring results in Solr. It works about 95% of
the time, where a search on one term returns the results which equal exactly
that one term at the top, and results with multiple terms that also contain
that one term are returned lower. Occasionally, however, if a subset of the
data has been re-indexed (the same data just added to the index again) then
the results will be slightly off, for example the data from the earlier
index will get a higher score than it should, until we re-index all the
data.

 

Our assumption here is that setting omitNorms to false, then indexing the
data, then searching, should result in scores where the data with an exact
match has a higher score. We usually see this but not always. Is something
added to the score besides the value that is being searched that we are not
understaning?

 

Thanks.

..
Scott Johnson
Data Advantage Group, Inc.

604 Mission Street 
San Francisco, CA 94105 
Office:   +1.415.947.0400 x204
Fax:  +1.415.947.0401

Take the first step towards a successful
meta data initiative with MetaCenter - 
the only plug and play, real-time 
meta data solution.http://www.dag.com/ www.dag.com 
..

 



Re: Solr scoring confusion

2015-02-13 Thread Otis Gospodnetic
Hi Scott,

Try optimizing after reindexing and this should go away. Had to do with 
updated/deleted docs participating in score computation.

Otis
 

 On Feb 13, 2015, at 18:29, Scott Johnson sjohn...@dag.com wrote:
 
 We are getting inconsistent scoring results in Solr. It works about 95% of
 the time, where a search on one term returns the results which equal exactly
 that one term at the top, and results with multiple terms that also contain
 that one term are returned lower. Occasionally, however, if a subset of the
 data has been re-indexed (the same data just added to the index again) then
 the results will be slightly off, for example the data from the earlier
 index will get a higher score than it should, until we re-index all the
 data.
 
 
 
 Our assumption here is that setting omitNorms to false, then indexing the
 data, then searching, should result in scores where the data with an exact
 match has a higher score. We usually see this but not always. Is something
 added to the score besides the value that is being searched that we are not
 understaning?
 
 
 
 Thanks.
 
 ..
 Scott Johnson
 Data Advantage Group, Inc.
 
 604 Mission Street 
 San Francisco, CA 94105 
 Office:   +1.415.947.0400 x204
 Fax:  +1.415.947.0401
 
 Take the first step towards a successful
 meta data initiative with MetaCenter - 
 the only plug and play, real-time 
 meta data solution.http://www.dag.com/ www.dag.com 
 ..