The beginning of the PDF is OK.
I assume you're not using the latest version (1.8.8.). However I don't
know if that is the cause. You are using this call:
public static PDDocument load( InputStream input )
Make sure that you are really reading a PDF from the start when working
concu
%PDF-1.3
%«Ïè¢
8 0 obj
On 25 February 2015 at 23:23, Tilman Hausherr wrote:
> Please upload the PDF somewhere, or open it in an editor and copy & paste
> the first three lines.
>
> Tilman
>
> Am 26.02.2015 um 08:02 schrieb Swapnil Raverkar:
>
> Hi,
>>
>> We are randomly getting following except
Please upload the PDF somewhere, or open it in an editor and copy &
paste the first three lines.
Tilman
Am 26.02.2015 um 08:02 schrieb Swapnil Raverkar:
Hi,
We are randomly getting following exception while parsing PDF documents. We
observed that we get this exception while parsing multiple f
Hi,
We are randomly getting following exception while parsing PDF documents. We
observed that we get this exception while parsing multiple files
concurrently :
"Error: Header doesn't contain versioninfo"
Caused by: java.io.IOException: Error: Header doesn't contain versioninfo
at org.apache.pd
Are you running on a headless system, such as a server? If so, you probably
don’t have any fonts installed. Even though you’re just doing text extraction,
this matters because the dimensions of the characters need to be taken into
account and many PDFs do not embed the fonts which they depend on
Hi Kevin,
I haven't committed it because I would like the opinion of John first.
If you're building from source you could make the change yourself with
the patch that I attached.
Tilman
Am 25.02.2015 um 15:04 schrieb Kevin Morin:
Hi Tilman,
great news! When do you think this will be availa
Thanks for your help Tilman, I have just tried 2.0 and it works well. Some API
changes have to be taken into account, but not too much of a problem in this
case.
Alan.
Alan Masters | Principal Analyst / Programmer | IT Department
Direct tel: 01908 747126 | email: amast...@nhbc.co.uk
NHBC | NH
Hello,
I'm extracting text from PDFs using PDFTextStripperByArea and get a lot of
these in the log:
Feb 25, 2015 2:01:44 PM org.apache.pdfbox.pdmodel.font.ExternalFonts
getTrueTypeFallbackFont
SEVERE: No TTF fallback font for 'Helvetica'
Feb 25, 2015 2:01:44 PM org.apache.pdfbox.pdmodel.font.PDT
Hi Tilman,
great news! When do you think this will be available in the snapshot?
BR
Kevin
Le 24/02/2015 00:36, Tilman Hausherr a écrit :
Some good news: your file can be rendered with twelvemonkeys. See at
the bottom of
https://issues.apache.org/jira/browse/PDFBOX-2128
Tilman
Am 03.02.2015
9 matches
Mail list logo