All - Do we want to remove the functionality in ParserPostProcessor that accumulates the full text into the "fulltext" property in the Metadata?
If I understand correctly, one of the reasons for the ContentHandler architecture was to support not reading all content into memory. Doesn't this read the entire parsed content into memory? And, if the wrapped parser does the same, and the external parser implementation (e.g. Poi) does the same, then the maximum document size we can support becomes much smaller? - Keith -- View this message in context: http://www.nabble.com/Fulltext-Metadata-Property--tf4643633.html#a13263876 Sent from the Apache Tika - Development mailing list archive at Nabble.com.
