On Thu, 15 Oct 2015, Adam Retter wrote:
However,

java -Dtika.config=/tmp/tika-config.xml -cp
/Users/aretter/Downloads/tika-core-1.10.jar:/Users/aretter/Downloads/tika-parsers-1.10.jar:/Users/aretter/Downloads/pdfbox-2.0.0-20151014.234027-1764.jar:/Users/aretter/Downloads/fontbox-2.0.0-20151014.233904-1817.jar:/Users/aretter/Downloads/jempbox-2.0.0-20140823.120514-532.jar:/Users/aretter/.m2/repository/commons-logging/commons-logging/1.2/commons-logging-1.2.jar
ExtractTest

Still returns no text from the PDF. Is it just the case that Tika
doesn't work with PDFBox version 2.0.0-SNAPSHOT?

Tika 1.10 does not. There's a JIRA with the work to upgrade Tika to PDFBox 2.0, you'll need to take your Tika build from the branch mentioned there if you want to use PDFBox 2. (Well, that, or wait a bit)

Nick

Reply via email to