PageRanking with Lucene

2009-07-19 Thread prashant ullegaddi
Hi, We have some 50M pages, and we also have computed PageRanks of those pages. What's the best way to combine lucene's score with PageRank? Regards, Prashant.

RE: PageRanking with Lucene

2009-07-19 Thread Uwe Schindler
9, 2009 1:55 PM > To: java-user@lucene.apache.org > Subject: PageRanking with Lucene > > Hi, > > We have some 50M pages, and we also have computed PageRanks of those > pages. > What's the best way to combine lucene's sc

Re: PageRanking with Lucene

2009-07-21 Thread Grant Ingersoll
I'd probably look at the function package in Lucene. While the document boost can be used, it may not give you the granularity you need, as you only have something like 6 bits of representation. Some people have also done some things like a field with a single token that contains a payloa

Re: PageRanking with Lucene

2009-07-22 Thread prashant ullegaddi
Is it that boost of a Document is stored in 6-bits? On Wed, Jul 22, 2009 at 8:26 AM, Grant Ingersoll wrote: > I'd probably look at the function package in Lucene. While the document > boost can be used, it may not give you the granularity you need, as you only > have something like 6 bits of rep

Re: PageRanking with Lucene

2009-07-23 Thread Grant Ingersoll
On Jul 22, 2009, at 6:30 AM, prashant ullegaddi wrote: Is it that boost of a Document is stored in 6-bits? Kind of, the boost is stored in the norm, which also includes other factors like length normalization. There is one byte for all of those factors, whereas w/ the function approach,