[ 
https://issues.apache.org/jira/browse/PDFBOX-1872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler resolved PDFBOX-1872.
----------------------------------------

       Resolution: Fixed
    Fix Version/s: 2.0.0
                   1.8.7

The issue was that the metadata is *not* encrypted and pdfbox doesn't care 
about that and decrypts the metadata stream which led to scrambled data.

I've fixed the issue in revisions http://svn.apache.org/r1606319 (trunk) and 
http://svn.apache.org/r1606321 (1.8 branch)

[~brzrkr] Thanks for the patient and for insisting when I closed the issue some 
time ago.

> PDMetadata.exportXMPMetadata fails when Metadata has encrypted stream
> ---------------------------------------------------------------------
>
>                 Key: PDFBOX-1872
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1872
>             Project: PDFBox
>          Issue Type: Bug
>          Components: JempBox, PDModel
>    Affects Versions: 1.8.3
>         Environment: Not sure it matters, but Solaris (SunOS 5.10), java 
> 1.6.0_19,
>            Reporter: Pat Hickey
>            Assignee: Andreas Lehmkühler
>            Priority: Minor
>             Fix For: 1.8.7, 2.0.0
>
>
> When the Metadata is encoded with the Crypt filter, exportMetadata() fails to 
> parse the XML. My guess is that PDDocumentCatalog.getMetadata() gives 
> PDMetadata the raw stream, instead of the filtered one. Then 
> PDMetadata.exportXMPMetadata() calls XMPMetadata.load(), which cannot parse 
> the encrypted stream.  
> While I cannot post the document (proprietary), the outline shown by 
> PDFDebugger goes like this:
> Root:Dictionary(Catalog)
> + AcroForm:Dictionary
> - Metadata:Stream(Metadata:XML)
>  - Filter:Array
>      o [0] Crypt
>   o Length:6302
>   o Subtype:XML
>   o Type:Metadata



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to