On 2010-09-06 17:15, Yonik Seeley wrote:
On Mon, Sep 6, 2010 at 10:52 AM, Roland Villemoes<r...@alpha-solutions.dk>  
wrote:
How can I retrieve all words from a Solr core?
I need a list of all the words and how often they occur in the index.

http://wiki.apache.org/solr/TermsComponent

It doesn't currently stream though, so requesting *all* at once might
take too much memory.  One workaround is to page via terms.lower and
terms.limit.
Perhaps we should consider adding streaming to the terms component
though.  Would you mind opening a JIRA issue?

This would be nice also for building a spellchecker in another core (instead of using the current sub-index hack).


--
Best regards,
Andrzej Bialecki     <><
 ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com

Reply via email to