Stemmed terms/common terms

Alf Eaton Thu, 16 Aug 2007 07:19:16 -0700

A couple of questions about term frequencies and stemming:

- What's the best way to get the most common unstemmed form of aPorter-stemmed word from the index? For example given the stem'walk', find that 'walking' is the most common full word in the index.

- Is there a way to get a list of all the terms in the index (ormaybe just the top n) ordered by descending frequency of usage? Iimagine it's related to docFreq, but can't see how to get a list ofterms in all documents.

I'm using PyLucene and Solr, so if there are easy solutions in eitherof those that would be ideal.


Thanks,
alf.



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Stemmed terms/common terms

Reply via email to