You might want to see a post I just made to the thread with this long
subject:
"single field code ready - Re: URL to compare 2 Similarity's ready-- Re:
Scoring benchmark evaluation. Was RE: How to proceed with Bug 31841 -
MultiSearcher problems with Similarity.docFreq() ?"
I've done an exampl
Doug Cutting wrote:
Paul Elschot wrote:
I learned a lot by adding some javadocs to such classes. I suppose Doug
added the Expert markings, but I don't know their precise purpose.
The "Expert" declaration is meant to indicate that most users should not
need to understand the feature. Lucene's API
;
> -Original Message-
> From: Otis Gospodnetic [mailto:[EMAIL PROTECTED] Sent:
> Monday, February 07, 2005 12:15 PM To: Lucene Developers List
> Subject: Re: Study Group (WAS Re: Normalized Scoring)
>
> I think I see what you are after. I'm after the same knowledge.
Paul Elschot wrote:
I learned a lot by adding some javadocs to such classes. I suppose Doug
added the Expert markings, but I don't know their precise purpose.
The "Expert" declaration is meant to indicate that most users should not
need to understand the feature. Lucene's API seeks to be both sim
PROTECTED]
Sent: Monday, February 07, 2005 12:15 PM
To: Lucene Developers List
Subject: Re: Study Group (WAS Re: Normalized Scoring)
I think I see what you are after. I'm after the same knowledge. :)
The only things that I can recommend are books:
Modern Information Retrieval
Managing Gigaby
I think I see what you are after. I'm after the same knowledge. :)
The only things that I can recommend are books:
Modern Information Retrieval
Managing Gigabytes
And online resources like:
http://finance.groups.yahoo.com/group/mg/ (note the weird host name)
http://www.sims.berkeley.edu/~hear
There are a series of good course notes from the
Stanford course on IR:
http://www.stanford.edu/class/cs276/handouts/lecture1.pdf
to
http://www.stanford.edu/class/cs276/handouts/lecture16.pdf
These are from the course by Hinrich Schutze who
co-authored "Foundations of Statistical Natural
Languag
I think I see what you are after. I'm after the same knowledge. :)
The only things that I can recommend are books:
Modern Information Retrieval
Managing Gigabytes
And online resources like:
http://finance.groups.yahoo.com/group/mg/ (note the weird host name)
http://www.sims.berkeley.edu/
Hey Paul, thanks for responding.
On Sun, 6 Feb 2005 13:26:24 +0100, Paul Elschot wrote:
>
> Tuning the scoring is difficult because one needs to avoid the trap
> of optimizing for the test collection and test queries at hand. The
> interplays between query structure, coord(), idf() and tf() add to
On Sunday 06 February 2005 10:14, Kelvin Tan wrote:
> Wouldn't it be great if we can form a study-group of Lucene folks who want
to take the "next step"? I feel uneasy posting non-Lucene specific questions
to dev or user even if its related to IR.
>
> Feels to me like there could be a couple like
Wouldn't it be great if we can form a study-group of Lucene folks who want to
take the "next step"? I feel uneasy posting non-Lucene specific questions to
dev or user even if its related to IR.
Feels to me like there could be a couple like us, who didn't do a dissertation
in IR, but would like
11 matches
Mail list logo