[
https://issues.apache.org/jira/browse/TIKA-99?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris A. Mattmann updated TIKA-99:
----------------------------------
Component/s: parser
> Support external parser programs
> --------------------------------
>
> Key: TIKA-99
> URL: https://issues.apache.org/jira/browse/TIKA-99
> Project: Tika
> Issue Type: New Feature
> Components: parser
> Reporter: Jukka Zitting
> Priority: Minor
>
> There should be a parser component (like ExternalParser) that invokes an
> external command line application, feeds the given document as input to the
> application, and returns the output from the application as the extracted
> text (or xhtml) content. This would allow integration with tools like catdoc
> or pdf2txt.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.