Re: upgrading to Tika 0.9 on Solr 1.4.1

2012-02-24 Thread bing
Hi, all, I tried to upgrade tika0.8 to tika0.10 on solr3.3.0, following the similar steps, but failed. 1. Replace the following jars in /contrib/extraction/ fontbox-1.6.0, jempbox-1.6.0, pdfbox-1.6.0, tika-core-0.10, tika-parsers-0.10; 2. Copy all the jars in /contrib/langid/* from solr3.5.0

Re: upgrading to Tika 0.9 on Solr 1.4.1

2011-07-06 Thread Surendra
I have upgraded my Solr Distribution to 3.2 and also the referring jars of my application (especially the solr jar was 1.4.1 in my application which calls solr...hence causing javabin exception...) . Also updated the pdfbox/jempbox/fontbox to latest versions and Tika to 0.9 version...which made

Re: upgrading to Tika 0.9 on Solr 1.4.1

2011-06-22 Thread Surendra
Hi Chris ,Andreas I have upgraded to solr 3.2 ... everything seems fine now. I will have to integrate this to my application and observe if any further issues...again thanks for your patience and time... --Surendra

Re: upgrading to Tika 0.9 on Solr 1.4.1

2011-06-22 Thread Mattmann, Chris A (388J)
Glad it worked out! Cheers, Chris On Jun 22, 2011, at 5:14 AM, Surendra wrote: Hi Chris ,Andreas I have upgraded to solr 3.2 ... everything seems fine now. I will have to integrate this to my application and observe if any further issues...again thanks for your patience and time...

Re: upgrading to Tika 0.9 on Solr 1.4.1

2011-06-21 Thread Surendra
Hi Chris I did a proper checkout of TIKA 0.9 and built the jars as specified in the http://tika.apache.org/0.9/gettingstarted.html; and replaced the existing tika0.4 jars with 0.9 jars. I don't see any difference. The documents are getting indexed but the fmap.content(attr_content) is still not

Re: upgrading to Tika 0.9 on Solr 1.4.1

2011-06-21 Thread Surendra
Hi Andreas I tried solr 3.1 as well as 3.2... i was not able to overcome these issues with the newer versions too. For me, I need the attr_content:* should return me results (with 1.4.1 this is successful) which is not happening . It indexes well in 3.1 but in 3.2 i have the following issue.

Re: upgrading to Tika 0.9 on Solr 1.4.1

2011-06-21 Thread Mattmann, Chris A (388J)
Hi Surendra, Thanks. Besides replacing the tika-*-0.9.jar files, you also need to replace the dependency jar files for the other libs as well since they have been upgraded. It's also possible that b/c of API changes, Solr 1.4.1 won't work with Tika 0.9 without modifying the

Re: upgrading to Tika 0.9 on Solr 1.4.1

2011-06-21 Thread Andreas Kemkes
certain, if a recompile of Solr was necessary or not. Andreas From: Surendra csnsha...@gmail.com To: solr-user@lucene.apache.org Sent: Tue, June 21, 2011 5:18:31 AM Subject: Re: upgrading to Tika 0.9 on Solr 1.4.1 Hi Andreas I tried solr 3.1 as well as 3.2... i

Re: upgrading to Tika 0.9 on Solr 1.4.1

2011-06-20 Thread Surendra
Mattmann, Chris A (388J chris.a.mattmann at jpl.nasa.gov writes: Hi Jo, You may consider checking out Tika trunk, where we recently have a Tika JAX-RS web service [1] committed as part of the tika-server module. You could probably wire DIH into it and accomplish the same thing. Cheers,

Re: upgrading to Tika 0.9 on Solr 1.4.1

2011-06-20 Thread Mattmann, Chris A (388J)
Hi Surendra, On Jun 20, 2011, at 4:59 AM, Surendra wrote: Hey Chris I have added tika-core 0.9 and tika-parsers 0.9 to Solr1.4.1 (extraction/lib) after building them using the source provided by TIKA. Now I have an issue with this. I am working with extracting PDF content using Solr. I

Re: upgrading to Tika 0.9 on Solr 1.4.1

2011-06-20 Thread Andreas Kemkes
, 2011 7:18:34 AM Subject: Re: upgrading to Tika 0.9 on Solr 1.4.1 Hi Surendra, On Jun 20, 2011, at 4:59 AM, Surendra wrote: Hey Chris I have added tika-core 0.9 and tika-parsers 0.9 to Solr1.4.1 (extraction/lib) after building them using the source provided by TIKA. Now I have an issue

upgrading to Tika 0.9 on Solr 1.4.1

2011-02-25 Thread jo
I have tried the steps indicated here: http://wiki.apache.org/solr/ExtractingRequestHandler http://wiki.apache.org/solr/ExtractingRequestHandler and when I try to parse a document nothing would happen, no error.. I have copied the jar files everywhere, and nothing.. can anyone give me the steps

Re: upgrading to Tika 0.9 on Solr 1.4.1

2011-02-25 Thread Jan Høydahl
Your best bet is perhaps upgrading to latest 1.4 branch, i.e. 1.4.2-dev (http://svn.apache.org/repos/asf/lucene/solr/branches/branch-1.4/) It includes Tika 0.8-SNAPSHOT and is a compatible drop-in (war/jar) replacement with lots of other bug fixes you'd also like (check changes.txt). svn co

Re: upgrading to Tika 0.9 on Solr 1.4.1

2011-02-25 Thread Markus Jelsma
You don't want to use 0.8 if you're parsing PDF. Your best bet is perhaps upgrading to latest 1.4 branch, i.e. 1.4.2-dev (http://svn.apache.org/repos/asf/lucene/solr/branches/branch-1.4/) It includes Tika 0.8-SNAPSHOT and is a compatible drop-in (war/jar) replacement with lots of other bug

Re: upgrading to Tika 0.9 on Solr 1.4.1

2011-02-25 Thread Mattmann, Chris A (388J)
Hi Jo, You may consider checking out Tika trunk, where we recently have a Tika JAX-RS web service [1] committed as part of the tika-server module. You could probably wire DIH into it and accomplish the same thing. Cheers, Chris [1] https://issues.apache.org/jira/browse/TIKA-593 On Feb 24,

Re: upgrading to Tika 0.9 on Solr 1.4.1

2011-02-25 Thread jo
You guys are great.. I will stick for now to the release version and if I have problem parsing I will give the branch jars a try the reason I am looking for upgrading tika is because tika keeps improving on things like languages, mime type support, and so on thanks again JO -- View

Re: upgrading to Tika 0.9 on Solr 1.4.1

2011-02-25 Thread Darx Oman
hi if you want to index pdf files then use tika 0.6 because 0.7 and 0.8 does not detect the correctly the pdfParse

Re: upgrading to Tika 0.9 on Solr 1.4.1

2011-02-25 Thread Andreas Kemkes
@lucene.apache.org Sent: Fri, February 25, 2011 10:33:39 AM Subject: Re: upgrading to Tika 0.9 on Solr 1.4.1 hi if you want to index pdf files then use tika 0.6 because 0.7 and 0.8 does not detect the correctly the pdfParse

Re: upgrading to Tika 0.9 on Solr 1.4.1

2011-02-25 Thread Mattmann, Chris A (388J)
) Andreas From: Darx Oman darxo...@gmail.com To: solr-user@lucene.apache.org Sent: Fri, February 25, 2011 10:33:39 AM Subject: Re: upgrading to Tika 0.9 on Solr 1.4.1 hi if you want to index pdf files then use tika 0.6 because 0.7 and 0.8 does