Did you try the ToXMLHandler?
From: yeshwanth kumar [mailto:yeshwant...@gmail.com]
Sent: Monday, June 30, 2014 4:50 PM
To: Allison, Timothy B.
Subject: Re: Stack Overflow Question
hi tim,
i tried in all possible ways,
instead of reading entire zip file i parsed individual zipentries,
but even
Good to hear. Let us know if you have any other questions or when you run into
surprises.
From: yeshwanth kumar [mailto:yeshwant...@gmail.com]
Sent: Tuesday, July 01, 2014 10:23 AM
To: Allison, Timothy B.
Subject: Re: Stack Overflow Question
hi tim,
i forgot to change the BodyContentHandler
On Fri, 27 Jun 2014, Daniel Gibby wrote:
java.io.IOException: Error: Header doesn't contain versioninfo
at org.apache.pdfbox.pdfparser.PDFParser.parseHeader(PDFParser.java:335)
at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:177)
at
I'll send a note over the the PDFBox list and ask what they think.
Thanks,
Daniel
On 7/1/2014 11:51 AM, Nick Burch wrote:
On Fri, 27 Jun 2014, Daniel Gibby wrote:
java.io.IOException: Error: Header doesn't contain versioninfo
at
mailto:dgi...@edirectpublishing.comOn 7/1/2014 12:04 PM, Jukka Zitting wrote:
The TaggedInputStream class [1] was designed for such cases where we
want to distinguish between IOExceptions thrown by the underlying
InputStream and those thrown by the library processing the stream. It
can be
I'm also wondering whether the AutoDetectParser would handle these
IOExceptions differently than the PDFParser.
Does AutoDetectParser just hand off to the appropriate class, such as
PDFParser?
In other words, would my problem be solved by using AutoDetectParser
instead of going straight to