Hi brent3600, You can use NLP4L for this purpose. NLP4L is good at counting the number of words not only in whole index but also in a set of documents. There is a tutorial for this function.
Count the number of words http://nlp4l.github.io/tutorial_ja.html#useNLP Sorry but the tutorial is written in Japanese now. We'll provide English tutorial soon. Until then please use translation service to read it in English. :) Koji On 2015/04/30 7:34, brent3600 wrote:
We are indexing collections of documents (files) with SOLR, and would like the following capability: Export or pull from SOLR the list of terms that have been indexed for a document or set of documents, along with the term frequency count. 1. Does SOLR already provide an API or method to accomplish this? 2. If not, is there an add-on module that provides this functionality? 3. If not, is it technically feasible at a low level of effort to add this functionality? - brent3600 -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-export-the-list-of-terms-indexed-in-Solr-tp4203124.html Sent from the Solr - User mailing list archive at Nabble.com.