java.io.IOException: Error converting date  when trying to get the cration date.
--------------------------------------------------------------------------------

                 Key: PDFBOX-977
                 URL: https://issues.apache.org/jira/browse/PDFBOX-977
             Project: PDFBox
          Issue Type: Bug
          Components: Utilities
    Affects Versions: 1.5.0
            Reporter: Franck Valentin


Hi,

when I try to get the creation date of these documents
   http://www.ebi.ac.uk/pdbe/docs/dbdoc/MSDSD_license4.pdf
   http://www.ebi.ac.uk/panda/Publications/evgeni-paper11.pdf
   http://www.ebi.ac.uk/luscombe/docs/aa_base.pdf
   http://www.ebi.ac.uk/2can/pdf/nar_interpro.pdf
   http://www.ebi.ac.uk/asd/altextron/gcag.pdf

I get an IOException:
Caused by: java.io.IOException: Error converting 
date:B<It©4Ñ@/Vo<U+0097><U+0090><U+008C>o²
        at 
org.apache.pdfbox.util.DateConverter.toCalendar(DateConverter.java:297) 
~[ebinocle-indexer-1.0-SNAPSHOT-jar-with-dependencies.jar:na]
        at 
org.apache.pdfbox.util.DateConverter.toCalendar(DateConverter.java:175) 
~[ebinocle-indexer-1.0-SNAPSHOT-jar-with-dependencies.jar:na]
        at org.apache.pdfbox.cos.COSDictionary.getDate(COSDictionary.java:797) 
~[ebinocle-indexer-1.0-SNAPSHOT-jar-with-dependencies.jar:na]
        at 
org.apache.pdfbox.pdmodel.PDDocumentInformation.getCreationDate(PDDocumentInformation.java:210)
 ~[ebinocle-indexer-1.0-SNAPSHOT-jar-with-dependencies.jar:na]
        at main.parsers.PDFParser.addCreationDate(PDFParser.java:82) ~[na:na]
        at main.parsers.PDFParser.parseAndIndex(PDFParser.java:105) ~[na:na]
        ... 9 common frames omitted


--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to