All -

Do we want to remove the functionality in ParserPostProcessor that
accumulates the full text into the "fulltext" property in the Metadata?

If I understand correctly, one of the reasons for the ContentHandler
architecture was to support not reading all content into memory.  Doesn't
this read the entire parsed content into memory?  And, if the wrapped parser
does the same, and the external parser implementation (e.g. Poi) does the
same, then the maximum document size we can support becomes much smaller?

- Keith

-- 
View this message in context: 
http://www.nabble.com/Fulltext-Metadata-Property--tf4643633.html#a13263876
Sent from the Apache Tika - Development mailing list archive at Nabble.com.

Reply via email to