: Iterator<ContentStream> getContentStreams();
:
: Consider the case where you iterate through a local file system.

right, a fixed size in memory array can be iterated, but an unbounded
stream of objects from an external source can't allways be read into an
array effectively -- so when it doubt go with the Iterator (or my
favorite:  Iterable)

: In addition to RequestProcessors, maybe there should be a general
: DocumentProcessor
:
: interface SolrDocumentParser
: {
:   Document parse(ContentStream content);
: }
:
: solrconfig could register "text/html" -> HtmlDocumentParser, and
: RequestProcessors could share the same parser.

what else would the RequestProcessor do if it was delegating all of the
parsing to something else?



-Hoss

Reply via email to