[
https://issues.apache.org/jira/browse/PDFBOX-2595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andreas Lehmkühler updated PDFBOX-2595:
---------------------------------------
Fix Version/s: 2.0.0
> Pdfbox always sets the second part of documentID to the same value
> ------------------------------------------------------------------
>
> Key: PDFBOX-2595
> URL: https://issues.apache.org/jira/browse/PDFBOX-2595
> Project: PDFBox
> Issue Type: Bug
> Components: PDModel
> Affects Versions: 1.8.8, 2.0.0
> Reporter: Asger Askov Blekinge
> Fix For: 2.0.0
>
>
> From COSWriter.java lines 1268 (trunk) 1500 (1.8)
> COSString firstID = missingID ? new COSString( md5.digest() ) :
> (COSString)idArray.get(0);
> COSString secondID = new COSString( md5.digest() );
> Notice that digest is called in both lines
> The javadoc for md5.digest() goes
> "Completes the hash computation by performing final operations such as
> padding. The digest is reset after this call is made."
> So, if missingID == true then the secondID will always be the hash of the
> empty string.
> Acrobat reader uses this value for it's search index, as hinted at here
> https://stackoverflow.com/a/20201670
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)