On 2010-09-06 17:15, Yonik Seeley wrote:
On Mon, Sep 6, 2010 at 10:52 AM, Roland Villemoes<r...@alpha-solutions.dk>
wrote:
How can I retrieve all words from a Solr core?
I need a list of all the words and how often they occur in the index.
http://wiki.apache.org/solr/TermsComponent
It doesn't currently stream though, so requesting *all* at once might
take too much memory. One workaround is to page via terms.lower and
terms.limit.
Perhaps we should consider adding streaming to the terms component
though. Would you mind opening a JIRA issue?
This would be nice also for building a spellchecker in another core
(instead of using the current sub-index hack).
--
Best regards,
Andrzej Bialecki <><
___. ___ ___ ___ _ _ __________________________________
[__ || __|__/|__||\/| Information Retrieval, Semantic Web
___|||__|| \| || | Embedded Unix, System Integration
http://www.sigram.com Contact: info at sigram dot com