PDFParser causes Problems when using encrypted PDF documents
------------------------------------------------------------
Key: TIKA-223
URL: https://issues.apache.org/jira/browse/TIKA-223
Project: Tika
Issue Type: Bug
Components: parser
Affects Versions: 0.3
Environment: Java 1.5.x on MAC, WIN, LIN
Reporter: Joachim Zittmayr
Fix For: 0.4
The PDFParser.parse() method decrypts the document for the metadata already and
then passes it over to PDF2XHTML.process(), which in turn calls the inherited
getText(). This calls writeText(), which tries to decrypt the PDDocument again,
but this will fail as it is already decrypted. The solution would be to
override writeText(), without the document.isEncrypted check.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.