Re: BoostingTermQuery scoring

2008-11-07 Thread Peter Keegan
boost:(+petroleum +engineer +refinery) (+contents:(+petroleum +engineer +refinery) +((*:* -boost:petroleum) (*:* -boost:engineer) (*:* -boost:refinery))) That's an interesting solution. Would this result in many more documents being visited by the scorer, possibly impacting

Re: BoostingTermQuery scoring

2008-11-06 Thread Grant Ingersoll
Not sure, but it sounds like you are interested in a higher level Query, kind of like the BooleanQuery, but then part of it sounds like it is per document, right? Is it that you want to deal with multiple payloads in a document, or multiple BTQs in a bigger query? On Nov 4, 2008, at 9:42

Re: BoostingTermQuery scoring

2008-11-06 Thread Peter Keegan
Let me give some background on the problem behind my question. Our index contains many fields (title, body, date, city, etc). Most queries search all fields, but for best performance, we create an additional 'contents' field that contains all terms from all fields so that only one field needs to

Re: BoostingTermQuery scoring

2008-11-06 Thread Peter Keegan
I've discovered another flaw in using this technique: (+contents:petroleum +contents:engineer +contents:refinery) (+boost:petroleum +boost:engineer +boost:refinery) It's possible that the first clause will produce a matching doc and none of the terms in the second clause are used to score that

RE: BoostingTermQuery scoring

2008-11-06 Thread Steven A Rowe
Hi Peter, On 11/06/2008 at 4:25 PM, Peter Keegan wrote: I've discovered another flaw in using this technique: (+contents:petroleum +contents:engineer +contents:refinery) (+boost:petroleum +boost:engineer +boost:refinery) It's possible that the first clause will produce a matching doc and

BoostingTermQuery scoring

2008-11-04 Thread Peter Keegan
I'm using BoostingTermQuery to boost the score of documents with terms containing payloads (boost value 1). I'd like to change the scoring behavior such that if a query contains multiple BoostingTermQuery terms (either required or optional), documents containing more matching terms with payloads