java.io.IOException: Error converting date when trying to get the cration date.
--------------------------------------------------------------------------------
Key: PDFBOX-977
URL: https://issues.apache.org/jira/browse/PDFBOX-977
Project: PDFBox
Issue Type: Bug
Components: Utilities
Affects Versions: 1.5.0
Reporter: Franck Valentin
Hi,
when I try to get the creation date of these documents
http://www.ebi.ac.uk/pdbe/docs/dbdoc/MSDSD_license4.pdf
http://www.ebi.ac.uk/panda/Publications/evgeni-paper11.pdf
http://www.ebi.ac.uk/luscombe/docs/aa_base.pdf
http://www.ebi.ac.uk/2can/pdf/nar_interpro.pdf
http://www.ebi.ac.uk/asd/altextron/gcag.pdf
I get an IOException:
Caused by: java.io.IOException: Error converting
date:B<It©4Ã@/Vo<U+0097><U+0090><U+008C>o²
at
org.apache.pdfbox.util.DateConverter.toCalendar(DateConverter.java:297)
~[ebinocle-indexer-1.0-SNAPSHOT-jar-with-dependencies.jar:na]
at
org.apache.pdfbox.util.DateConverter.toCalendar(DateConverter.java:175)
~[ebinocle-indexer-1.0-SNAPSHOT-jar-with-dependencies.jar:na]
at org.apache.pdfbox.cos.COSDictionary.getDate(COSDictionary.java:797)
~[ebinocle-indexer-1.0-SNAPSHOT-jar-with-dependencies.jar:na]
at
org.apache.pdfbox.pdmodel.PDDocumentInformation.getCreationDate(PDDocumentInformation.java:210)
~[ebinocle-indexer-1.0-SNAPSHOT-jar-with-dependencies.jar:na]
at main.parsers.PDFParser.addCreationDate(PDFParser.java:82) ~[na:na]
at main.parsers.PDFParser.parseAndIndex(PDFParser.java:105) ~[na:na]
... 9 common frames omitted
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira