We are successfully extracting PDF content with Solr 3.1 and Tika 0.9. Replace fontbox-1.3.1.jar jempbox-1.3.1.jar pdfbox-1.3.1.jar tika-core-0.8.jar tika-parsers-0.8.jar
with fontbox-1.4.0.jar jempbox-1.4.0.jar pdfbox-1.4.0.jar tika-core-0.9.jar tika-parsers-0.9.jar I'm not entirely certain, if a recompile of Solr was necessary or not. Andreas ________________________________ From: Surendra <csnsha...@gmail.com> To: solr-user@lucene.apache.org Sent: Tue, June 21, 2011 5:18:31 AM Subject: Re: upgrading to Tika 0.9 on Solr 1.4.1 Hi Andreas I tried solr 3.1 as well as 3.2... i was not able to overcome these issues with the newer versions too. For me, I need the attr_content:* should return me results (with 1.4.1 this is successful) which is not happening . It indexes well in 3.1 but in 3.2 i have the following issue. Invalid version or the data in not in 'javabin' format --Surendra