Ok, phew. Yes, they are, but we’re not…yet. ☺
Tika 1.13 should be around the corner, and that’ll include PDFBox 2.0 (and
Jempbox!).
Best,
Tim
From: Chris Bamford [mailto:cbamf...@mimecast.com]
Sent: Friday, April 22, 2016 1:05 PM
To: user@tika.apache.org
Subject: Re: Jempbox runtime
Thanks.
No, it was my confusion - PDFBox (which is also part of our app) has recently
dropped it (see http://pdfbox.apache.org/2.0/migration.html).
So we may be actively managing it out - will revisit and hopefully all will be
good.
Cheers,
- Chris
Chris Bamford
Lead Software Engineer
m: +44
That should be in our tika-parsers’ pom
1.8.11
So, um, where did you see that we had dropped Jempbox? I know that we wanted
to at some point, but XMPBox only works on PDF/A so we aren’t going to move to
that any time soon.
Cheers,
Tim
From: Chris Bamford [mailto:cbamf...@mimecas
Hi Tim,
Nice to hear from you too - and thanks for the quick reply!
Good to know about the dependency, will try to include it (what version do you
recommend?).
Thanks
- Chris
Chris Bamford
Lead Software Engineer
m: +44 7860 405292
p: +44 207 847 8700
w: www.mimecast.com
Address click here: ww
Hi Chris,
Good to hear from you. We do still use Jempbox in 1.12 for the PDFParser and
the JempboxExtractor. The RTF must have an embedded PDF or Jpeg or another
image file.
Is there any chance Maven is not smiling upon you with transitive
dependencies? When you bundle your app are you in
Hi
I recently upgraded to tika 1.12 from 1.7 and read the notes about Jempbox
being no longer used. My pom now pulls in 1.12 versions of tika-core,
tika-parsers, tika-xmp and tika-bundle.
The app is running well but very occasionally we see:
java.lang.NoClassDefFoundError: org/apache/jempbox/