I'm using the PDFBOX 0.7.4 together with Aperture. While crawling, I often get the following warning and exception; "[Aug 14 14:26:34] WARN (PdfExtractor.java:169) - Exception while extracting creation date of file:////De-fs003/projects/Active/EC305479%20-%20EUTELSAT%20W2M%20&%20I3K/SC200129%20-%20EUTELSAT%20W2M%20&%20I3K/02%20-%20Quality%20Plans/Standards/bssc2000(1)i10.PDF java.io.IOException: Error converting date:26 May 2000 11:25 at org.pdfbox.util.DateConverter.toCalendar(DateConverter.java:254) at org.pdfbox.util.DateConverter.toCalendar(DateConverter.java:134) at org.pdfbox.cos.COSDictionary.getDate(COSDictionary.java:797) at org.pdfbox.pdmodel.PDDocumentInformation.getCreationDate(PDDocumentInformation.java:232) at org.semanticdesktop.aperture.extractor.pdf.PdfExtractor.extractNormalMetadata(PdfExtractor.java:166) at org.semanticdesktop.aperture.extractor.pdf.PdfExtractor.processDocument(PdfExtractor.java:103) at org.semanticdesktop.aperture.extractor.pdf.PdfExtractor.extract(PdfExtractor.java:62)" I have found the following thread which seems to discuss my problem http://issues.apache.org/jira/browse/PDFBOX-465 but dont see any solution. Is there a solution? Or should I simply read the creation date myself? Thanks, Gert.
Please help Logica to respect the environment by not printing this email / Pour contribuer comme Logica au respect de l'environnement, merci de ne pas imprimer ce mail / Bitte drucken Sie diese Nachricht nicht aus und helfen Sie so Logica dabei, die Umwelt zu schützen. / Por favor ajude a Logica a respeitar o ambiente nao imprimindo este correio electronico. This e-mail and any attachment is for authorised use by the intended recipient(s) only. It may contain proprietary material, confidential information and/or be subject to legal privilege. It should not be copied, disclosed to, retained or used by, any other party. If you are not an intended recipient then please promptly delete this e-mail and any attachment and all copies and inform the sender. Thank you.
