RE: Scoring Technique based on Relevance Feeback & other Parameters

2006-08-23 Thread Russell M. Allen
I have a similar interest in specifying a custom scoring strategy. I previously posted about it under the subject "Scoring a document (count?)" on 7/27/06. In brief, I want a documents score to be a count of term matches. This is nearly identical to a SQL count() functionality. If you are able

RE: Scoring a document (count?)

2006-08-04 Thread Russell M. Allen
uld return the raw tf as a score, as an exercise. It is not a product level implementation of course, but if you think this will help you (?) I can share the code. (Would have responded sooner, for my working computer went off for a few days with a fan error...:-). Regards, Doron "Russell M.

RE: Scoring a document (count?)

2006-07-31 Thread Russell M. Allen
ormance considerations, depending on the star| volume of the data... Regards, Doron "Russell M. Allen" <[EMAIL PROTECTED]> wrote on 27/07/2006 09:02:46: > I am curious about the potential use of document scoring as a means to > extract additional data from an index. Spe

RE: Scoring a document (count?)

2006-07-31 Thread Russell M. Allen
7;ve got the stored value of the "horror" field to tell you how many movies they've been in. : Date: Thu, 27 Jul 2006 12:02:46 -0400 : From: Russell M. Allen <[EMAIL PROTECTED]> : Reply-To: java-user@lucene.apache.org : To: java-user@lucene.apache.org : Subject: Scoring a

RE: About search performance

2006-07-31 Thread Russell M. Allen
You should build your own performance test cases to see what works for your data. That being said, here are some numbers from a similar test I ran: I did the following: 1) run a single term query which resulted in about half of the total set of documents being returned. (~36,000) 2) built a Bo

Scoring a document (count?)

2006-07-27 Thread Russell M. Allen
I am curious about the potential use of document scoring as a means to extract additional data from an index. Specifically, I would like the score to be a count of how many times a particular field matched a set of terms. For example, I am indexing movie-stars (Each document is a movie-star). A

RE: PrefixQuery rewrite() bug, ignores max clause count

2006-07-17 Thread Russell M. Allen
: 1) colorzing your mail doesn't play nicely with the mailing list ... : sending "diffs" is the prefered way to show potential changes to the code. So I discovered on sending the message. :) : 2) assuming i understand what it is you've changed, you've "worked arround" the TooManyClausesException

PrefixQuery rewrite() bug, ignores max clause count

2006-07-14 Thread Russell M. Allen
We recently ran into an issue while executing a simple prefix search "name:b*", which results in a BooleanQuery$TooManyClauses exception. At first I found it odd that a single clause query was causing this, but as I dug into the code I found where the PrefixQuery rewrites itself as a BooleanQuery.