Re: Results driving me nuts!

2011-03-13 Thread Ahmet Arslan
--- On Sun, 3/13/11, Andy Newby wrote: > From: Andy Newby > Subject: Results driving me nuts! > To: solr-user@lucene.apache.org > Date: Sunday, March 13, 2011, 10:38 PM > Hi, > > Ok, I'm really really trying to get my head around this, > but I just can't :/ > > Here are 2 example records, bot

RE: Results driving me nuts!

2011-03-14 Thread cbennett
> -Original Message- > From: Ahmet Arslan [mailto:iori...@yahoo.com] > Sent: Sunday, March 13, 2011 6:25 PM > To: solr-user@lucene.apache.org; andy.ne...@gmail.com > Subject: Re: Results driving me nuts! > > > --- On Sun, 3/13/11, Andy Newby wrote: > > &

Re: Results driving me nuts!

2011-03-14 Thread Jonathan Rochkind
On 3/13/2011 6:24 PM, Ahmet Arslan wrote: http://lucene.apache.org/java/2_9_1/api/core/org/apache/lucene/search/Similarity.html#formula_norm I can see that the one with 5 matches is longer than the other. Shorter documents are favored in solr/lucene with length normalization factor. Is there

Re: Results driving me nuts!

2011-03-14 Thread Markus Jelsma
You can use omitNorms="true" for any given field. Length normalization will be disabled and index-time boosting will not be available any more. TermFrequencies can also be disabled by setting omitTermFreqAndPositions="true" for any given field. Omitting TF can be very useful if you need an easy

Re: Results driving me nuts!

2011-03-14 Thread Jonathan Rochkind
Aha. Yeah, I've read the documentation several times,but still find myself confused. But do I understand this right now: If I do omitNorms="true", but still leave "term freq and positions" in default case (ie, NOT omitTermFreqAndPositions="true") ... then a document with more occurences of a

Re: Results driving me nuts!

2011-03-14 Thread Markus Jelsma
On Monday 14 March 2011 17:27:05 Jonathan Rochkind wrote: > Aha. Yeah, I've read the documentation several times,but still find > myself confused. > > But do I understand this right now: > > If I do omitNorms="true", but still leave "term freq and positions" in > default case (ie, NOT omitTerm