Asger Askov Blekinge created PDFBOX-2595:
--------------------------------------------
Summary: Pdfbox always sets the second part of documentID to the
same value
Key: PDFBOX-2595
URL: https://issues.apache.org/jira/browse/PDFBOX-2595
Project: PDFBox
Issue Type: Bug
Components: PDModel
Affects Versions: 1.8.8, 2.0.0
Reporter: Asger Askov Blekinge
>From COSWriter.java lines 1268 (trunk) 1500 (1.8)
COSString firstID = missingID ? new COSString( md5.digest() ) :
(COSString)idArray.get(0);
COSString secondID = new COSString( md5.digest() );
Notice that digest is called in both lines
The javadoc for md5.digest() goes
"Completes the hash computation by performing final operations such as padding.
The digest is reset after this call is made."
So, if missingID == true then the secondID will always be the hash of the empty
string.
Acrobat reader uses this value for it's search index, as hinted at here
https://stackoverflow.com/a/20201670
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)