Ability to limit the amount of extracted text
---------------------------------------------
Key: TIKA-261
URL: https://issues.apache.org/jira/browse/TIKA-261
Project: Tika
Issue Type: New Feature
Components: parser
Reporter: Jukka Zitting
Priority: Minor
It would be nice to have some generic mechanism to limit the amount of text
extracted from a single document. This would be especially useful for anything
that buffers the entire result of text extraction in memory, and it could also
be used to improve performance in cases where having the entire contents of
huge documents is not that important.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.