[ 
https://issues.apache.org/jira/browse/PDFBOX-2595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2595:
---------------------------------------
    Fix Version/s: 2.0.0

> Pdfbox always sets the second part of documentID to the same value
> ------------------------------------------------------------------
>
>                 Key: PDFBOX-2595
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2595
>             Project: PDFBox
>          Issue Type: Bug
>          Components: PDModel
>    Affects Versions: 1.8.8, 2.0.0
>            Reporter: Asger Askov Blekinge
>             Fix For: 2.0.0
>
>
> From COSWriter.java lines 1268 (trunk) 1500 (1.8)
> COSString firstID = missingID ? new COSString( md5.digest() ) : 
> (COSString)idArray.get(0);
> COSString secondID = new COSString( md5.digest() );
> Notice that digest is called in both lines
> The javadoc for md5.digest() goes 
> "Completes the hash computation by performing final operations such as 
> padding. The digest is reset after this call is made."
> So, if missingID == true then the secondID will always be the hash of the 
> empty string.
> Acrobat reader uses this value for it's search index, as hinted at here
> https://stackoverflow.com/a/20201670



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to