Re: Lucene Internals question

2007-01-23 Thread Yonik Seeley
On 1/23/07, Grant Ingersoll <[EMAIL PROTECTED]> wrote: You might also be interested in https://issues.apache.org/jira/browse/ LUCENE-755 (aka the Payloads patch) which will enable storing information at the token level and allow for plugging in more scoring options related to it. There has been

Re: Lucene Internals question

2007-01-23 Thread Grant Ingersoll
You might also be interested in https://issues.apache.org/jira/browse/ LUCENE-755 (aka the Payloads patch) which will enable storing information at the token level and allow for plugging in more scoring options related to it. There has been a variety of discussions over on java-dev related t

Re: Lucene Internals question

2007-01-22 Thread Nicolas Lalevée
Le Lundi 22 Janvier 2007 20:44, EDMOND KEMOKAI a écrit : > Hmm..doesn't lucene scoring determine how relevant a document is to your > query? That is what PageRank and HITS do as well, I believe. Page and > document are the same, if you want to index a page you'll obviously try to > convert it into

Re: Lucene Internals question

2007-01-22 Thread Peter W .
Hello, Lucene gives the best documents for a given query and PageRank uses citation analysis with similar results but requires a large set of metadata to complete. Scoring in Lucene delivers pure search while PageRank attempts to establish source authority. I''m not strong in math, those

Re: Lucene Internals question

2007-01-22 Thread Mark Miller
Well first Lucene checks all of the other documents in the world for any that that refer to the document that your adding to Lucene...and then...oh wait... http://lucene.apache.org/java/docs/api/org/apache/lucene/search/Similarity.html EDMOND KEMOKAI wrote: Hmm..doesn't lucene scoring determi

Re: Lucene Internals question

2007-01-22 Thread EDMOND KEMOKAI
Hmm..doesn't lucene scoring determine how relevant a document is to your query? That is what PageRank and HITS do as well, I believe. Page and document are the same, if you want to index a page you'll obviously try to convert it into a document. PageRank does link analysis to determine how relevan

Re: Lucene Internals question

2007-01-22 Thread Nicolas Lalevée
Le Lundi 22 Janvier 2007 19:33, EDMOND KEMOKAI a écrit : > Hi All > This is a question for those familiar with lucene document scoring. How > does it compare with googles PageRank or HITS, or are they very different? > I have being looking at the PageRank algorithm but I'll need to brush-off > my m

Lucene Internals question

2007-01-22 Thread EDMOND KEMOKAI
Hi All This is a question for those familiar with lucene document scoring. How does it compare with googles PageRank or HITS, or are they very different? I have being looking at the PageRank algorithm but I'll need to brush-off my math skills before delving into it:) -- "talk trash and carry a sm