Support external parser programs
--------------------------------
Key: TIKA-99
URL: https://issues.apache.org/jira/browse/TIKA-99
Project: Tika
Issue Type: New Feature
Reporter: Jukka Zitting
Priority: Minor
There should be a parser component (like ExternalParser) that invokes an
external command line application, feeds the given document as input to the
application, and returns the output from the application as the extracted text
(or xhtml) content. This would allow integration with tools like catdoc or
pdf2txt.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.