Hi

the steps shown @

http://wiki.apache.org/solr/ExtractingRequestHandler#Upgrading_Tika

worked for me. I use the following versions for extracting MS-Office and PDF files:

tika 0.9
pdfbox 1.6.0
poi 3.8beta3

This combination turned out to be the most failsafe at the moment.

Cheers
-Tom

On 08/19/2011 01:44 PM, nirnaydewan wrote:
As in Tika 0.9, the formatting issue for extracting content from PDF&  DOC
files have been fixed, i want to integrate this in my existing Solr project.

Please let me know the steps.

All i have is the downloaded folder of Solr 3.3.0 and currently using the
attached Jetty server only. This version contains Tika 0.8 version.

All i need in simple steps as to how to replace 0.8 with 0.9



Thanks

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Tika-0-9-integration-in-Solr-3-3-0-tp3267799p3267799.html
Sent from the Apache Tika - Development mailing list archive at Nabble.com.



--
Author of the book "Plone 3 Multimedia" - http://amzn.to/dtrp0C

Tom Gross
email.............@toms-projekte.de
skype.....................tom_gross
web.........http://toms-projekte.de
blog...http://blog.toms-projekte.de

Reply via email to