Re: Frequencies sorted by frequencies

Grant Ingersoll Wed, 28 May 2008 11:34:46 -0700

I think you could override all the Similarity factors except tf() with1, such that the term frequency is the only factor in the scoring.Then you just submit the term as a query. Note, I think you will needto override the similarity during indexing, too, so that norm lengthis turned off, too. Note, I haven't tried it :-). Use the explain()functionality to double check. At any rate, it should be quick to test.


See 
http://lucene.apache.org/java/2_3_2/api/org/apache/lucene/search/Similarity.html


-Grant


On May 28, 2008, at 10:48 AM, Hider, Sandy wrote:

Hi All,

I am trying to figure out a quick way to find the top N documentssorted

by frequency of a term.

I found:

IndexRead.termDocs()

which provides an enumeration of doc() and freq() but it returns an
enumeration sorted by doc number.   Is there a way to get the results
sorted by freq?  Or is there another query I can run the find these
results?

Thanks in advance,

Sandy


--------------------------
Grant Ingersoll
http://www.lucidimagination.com

Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ








---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Frequencies sorted by frequencies

Reply via email to