Hi, all,
I tried to upgrade tika0.8 to tika0.10 on solr3.3.0, following the similar
steps, but failed.
1. Replace the following jars in /contrib/extraction/
fontbox-1.6.0, jempbox-1.6.0, pdfbox-1.6.0, tika-core-0.10,
tika-parsers-0.10;
2. Copy all the jars in /contrib/langid/* from solr3.5.0
I have upgraded my Solr Distribution to 3.2 and also the referring jars of my
application (especially the solr jar was 1.4.1 in my application which calls
solr...hence causing javabin exception...) . Also updated the
pdfbox/jempbox/fontbox to latest versions and Tika to 0.9 version...which made
Hi Chris ,Andreas
I have upgraded to solr 3.2 ... everything seems fine now. I will have to
integrate this to my application and observe if any further issues...again
thanks for your patience and time...
--Surendra
Glad it worked out!
Cheers,
Chris
On Jun 22, 2011, at 5:14 AM, Surendra wrote:
Hi Chris ,Andreas
I have upgraded to solr 3.2 ... everything seems fine now. I will have to
integrate this to my application and observe if any further issues...again
thanks for your patience and time...
Hi Chris
I did a proper checkout of TIKA 0.9 and built the jars as specified in the
http://tika.apache.org/0.9/gettingstarted.html; and replaced the existing
tika0.4 jars with 0.9 jars. I don't see any difference. The documents are
getting indexed but the fmap.content(attr_content) is still not
Hi Andreas
I tried solr 3.1 as well as 3.2... i was not able to overcome these issues with
the newer versions too. For me, I need the attr_content:* should return me
results (with 1.4.1 this is successful) which is not happening . It indexes well
in 3.1 but in 3.2 i have the following issue.
Hi Surendra,
Thanks. Besides replacing the tika-*-0.9.jar files, you also need to replace
the dependency jar files for the other libs as well since they have been
upgraded. It's also possible that b/c of API changes, Solr 1.4.1 won't work
with Tika 0.9 without modifying the
certain, if a recompile of Solr was necessary or not.
Andreas
From: Surendra csnsha...@gmail.com
To: solr-user@lucene.apache.org
Sent: Tue, June 21, 2011 5:18:31 AM
Subject: Re: upgrading to Tika 0.9 on Solr 1.4.1
Hi Andreas
I tried solr 3.1 as well as 3.2... i
Mattmann, Chris A (388J chris.a.mattmann at jpl.nasa.gov writes:
Hi Jo,
You may consider checking out Tika trunk, where we recently have a Tika JAX-RS
web service [1] committed as
part of the tika-server module. You could probably wire DIH into it and
accomplish the same thing.
Cheers,
Hi Surendra,
On Jun 20, 2011, at 4:59 AM, Surendra wrote:
Hey Chris
I have added tika-core 0.9 and tika-parsers 0.9 to Solr1.4.1 (extraction/lib)
after building them using the source provided by TIKA. Now I have an issue
with
this. I am working with extracting PDF content using Solr. I
, 2011 7:18:34 AM
Subject: Re: upgrading to Tika 0.9 on Solr 1.4.1
Hi Surendra,
On Jun 20, 2011, at 4:59 AM, Surendra wrote:
Hey Chris
I have added tika-core 0.9 and tika-parsers 0.9 to Solr1.4.1 (extraction/lib)
after building them using the source provided by TIKA. Now I have an issue
I have tried the steps indicated here:
http://wiki.apache.org/solr/ExtractingRequestHandler
http://wiki.apache.org/solr/ExtractingRequestHandler
and when I try to parse a document nothing would happen, no error.. I have
copied the jar files everywhere, and nothing.. can anyone give me the steps
Your best bet is perhaps upgrading to latest 1.4 branch, i.e. 1.4.2-dev
(http://svn.apache.org/repos/asf/lucene/solr/branches/branch-1.4/)
It includes Tika 0.8-SNAPSHOT and is a compatible drop-in (war/jar) replacement
with lots of other bug fixes you'd also like (check changes.txt).
svn co
You don't want to use 0.8 if you're parsing PDF.
Your best bet is perhaps upgrading to latest 1.4 branch, i.e. 1.4.2-dev
(http://svn.apache.org/repos/asf/lucene/solr/branches/branch-1.4/) It
includes Tika 0.8-SNAPSHOT and is a compatible drop-in (war/jar)
replacement with lots of other bug
Hi Jo,
You may consider checking out Tika trunk, where we recently have a Tika JAX-RS
web service [1] committed as part of the tika-server module. You could probably
wire DIH into it and accomplish the same thing.
Cheers,
Chris
[1] https://issues.apache.org/jira/browse/TIKA-593
On Feb 24,
You guys are great.. I will stick for now to the release version and if I
have problem parsing I will give the branch jars a try the reason I am
looking for upgrading tika is because tika keeps improving on things like
languages, mime type support, and so on
thanks again
JO
--
View
hi
if you want to index pdf files then use tika 0.6
because 0.7 and 0.8 does not detect the correctly the pdfParse
@lucene.apache.org
Sent: Fri, February 25, 2011 10:33:39 AM
Subject: Re: upgrading to Tika 0.9 on Solr 1.4.1
hi
if you want to index pdf files then use tika 0.6
because 0.7 and 0.8 does not detect the correctly the pdfParse
)
Andreas
From: Darx Oman darxo...@gmail.com
To: solr-user@lucene.apache.org
Sent: Fri, February 25, 2011 10:33:39 AM
Subject: Re: upgrading to Tika 0.9 on Solr 1.4.1
hi
if you want to index pdf files then use tika 0.6
because 0.7 and 0.8 does
19 matches
Mail list logo