[
https://issues.apache.org/jira/browse/TIKA-223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12730186#action_12730186
]
Chris A. Mattmann commented on TIKA-223:
----------------------------------------
Hi All:
Is there a patch for this issue, which includes e.g., a unit test for
verification? I'm trying to get the 0.4 RC together and this is one of the 2
only remaining open issues. Please let me know. I'll use the same approach as
for the other open issue. If I don't hear back from anyone in the next 48 hrs,
I'll assume it's OK to push this to 0.5. If I do hear back and there is
significant support to push this to 0.5, I'll do so sooner. If not, can we get
a patch together ASAP? I'd like to cut an RC this week and call for a vote?
My vote is -1 that this is a blocker for 0.4 and +1 to move this to 0.5.
Cheers,
Chris
> PDFParser causes Problems when using encrypted PDF documents
> ------------------------------------------------------------
>
> Key: TIKA-223
> URL: https://issues.apache.org/jira/browse/TIKA-223
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 0.3
> Environment: Java 1.5.x on MAC, WIN, LIN
> Reporter: Joachim Zittmayr
> Fix For: 0.4
>
> Original Estimate: 2h
> Remaining Estimate: 2h
>
> The PDFParser.parse() method decrypts the document for the metadata already
> and then passes it over to PDF2XHTML.process(), which in turn calls the
> inherited getText(). This calls writeText(), which tries to decrypt the
> PDDocument again, but this will fail as it is already decrypted. The solution
> would be to override writeText(), without the document.isEncrypted check.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.