Hi,
Am 13.10.2012 10:08, schrieb wangxinjing1988:
Hello,my Friends.I had the problem of using PDFbox.When I wanted using PDFbox to
read file of pdf in java environment, Most files can be read.but a few of files
can't be read.For example notes.pdf.the detail of exception:as felow,
java.io.IOException: Error: Expected an integer type, actual='Warning'
at org.apache.pdfbox.pdfparser.BaseParser.readInt(BaseParser.java:1384)
at org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:517)
at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:184)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1069)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1036)
at com.schoolknowledge.lucene.search.Search.main(Search.java:123)
please ,help. stone.wng
Without having a hand on the document this isjust a guess. You should try the
new non-sequential parser using PDDocument#loadNonSeq instead of PDDocument#load.
BR
Andreas Lehmkühler