Support external parser programs
--------------------------------

                 Key: TIKA-99
                 URL: https://issues.apache.org/jira/browse/TIKA-99
             Project: Tika
          Issue Type: New Feature
            Reporter: Jukka Zitting
            Priority: Minor


There should be a parser component (like ExternalParser) that invokes an 
external command line application, feeds the given document as input to the 
application, and returns the output from the application as the extracted text 
(or xhtml) content. This would allow integration with tools like catdoc or 
pdf2txt.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to