java.io.IOException: Error converting date when trying to get the cration date. --------------------------------------------------------------------------------
Key: PDFBOX-977 URL: https://issues.apache.org/jira/browse/PDFBOX-977 Project: PDFBox Issue Type: Bug Components: Utilities Affects Versions: 1.5.0 Reporter: Franck Valentin Hi, when I try to get the creation date of these documents http://www.ebi.ac.uk/pdbe/docs/dbdoc/MSDSD_license4.pdf http://www.ebi.ac.uk/panda/Publications/evgeni-paper11.pdf http://www.ebi.ac.uk/luscombe/docs/aa_base.pdf http://www.ebi.ac.uk/2can/pdf/nar_interpro.pdf http://www.ebi.ac.uk/asd/altextron/gcag.pdf I get an IOException: Caused by: java.io.IOException: Error converting date:B<It©4Ã@/Vo<U+0097><U+0090><U+008C>o² at org.apache.pdfbox.util.DateConverter.toCalendar(DateConverter.java:297) ~[ebinocle-indexer-1.0-SNAPSHOT-jar-with-dependencies.jar:na] at org.apache.pdfbox.util.DateConverter.toCalendar(DateConverter.java:175) ~[ebinocle-indexer-1.0-SNAPSHOT-jar-with-dependencies.jar:na] at org.apache.pdfbox.cos.COSDictionary.getDate(COSDictionary.java:797) ~[ebinocle-indexer-1.0-SNAPSHOT-jar-with-dependencies.jar:na] at org.apache.pdfbox.pdmodel.PDDocumentInformation.getCreationDate(PDDocumentInformation.java:210) ~[ebinocle-indexer-1.0-SNAPSHOT-jar-with-dependencies.jar:na] at main.parsers.PDFParser.addCreationDate(PDFParser.java:82) ~[na:na] at main.parsers.PDFParser.parseAndIndex(PDFParser.java:105) ~[na:na] ... 9 common frames omitted -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira