Re: TikaLanguageIdentifierUpdateProcessorFactory(since Solr3.5.0) to be used in Solr3.3.0?

2012-02-27 Thread Erick Erickson
My *real* suggestion would be to not do it. Write a SolrJ
program that uses whatever version of Tika you want
to download and use *that* to index rather than try to
sort through the various jar dependencies in Solr. It'd be
safer.

Otherwise, you're on your own here.

Here's some example code:

http://www.lucidimagination.com/blog/2012/02/14/indexing-with-solrj/

Best
Erick

On Sun, Feb 26, 2012 at 9:01 PM, bing jsuser1...@hotmail.com wrote:
 Hi, Erick,

 My idea is to use Tika0.10 in Dspace1.7.2, which is based on two steps:

 1. Upgrade Solr1.4.1 to Solr3.3.0 in Dspace1.7.2
 In the following link, upgraded Solr  Lucene 3.3.0 has been resolved.
 https://jira.duraspace.org/browse/DS-980

 2. Upgrade to Tika0.10 in Solr3.3.0
 In the following link, people has tried to upgrade Tika0.8 to Tika0.9.
 http://lucene.472066.n3.nabble.com/upgrading-to-Tika-0-9-on-Solr-1-4-1-td2570526.html

 I was thinking, if both the above two steps can be achieved, then maybe I
 can get it done. What is your suggestion?

 Thank you.

 Best Regards,
 Bing

 --
 View this message in context: 
 http://lucene.472066.n3.nabble.com/TikaLanguageIdentifierUpdateProcessorFactory-since-Solr3-5-0-to-be-used-in-Solr3-3-0-tp3771620p3779437.html
 Sent from the Solr - User mailing list archive at Nabble.com.


Re: TikaLanguageIdentifierUpdateProcessorFactory(since Solr3.5.0) to be used in Solr3.3.0?

2012-02-27 Thread bing
HI, Erick, 

I can write SolrJ client to call Tika, but I am not certain where to invoke
the client. In my case, I work on Dspace to call Solr, and I suppose the
client should be invoked in-between Dspace and Solr. That is, Dspace invokes
SolrJ client when doing index/query,  which call Tika and Solr. Do you think
it is reasonable? 

Best Regards, 
Bing 

--
View this message in context: 
http://lucene.472066.n3.nabble.com/TikaLanguageIdentifierUpdateProcessorFactory-since-Solr3-5-0-to-be-used-in-Solr3-3-0-tp3771620p3782793.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: TikaLanguageIdentifierUpdateProcessorFactory(since Solr3.5.0) to be used in Solr3.3.0?

2012-02-27 Thread Erick Erickson
It runs any place that has access to the raw files and an HTTP connection
to the Solr server, which is another way of saying sounds good to me.

Erick

On Mon, Feb 27, 2012 at 9:18 PM, bing jsuser1...@hotmail.com wrote:
 HI, Erick,

 I can write SolrJ client to call Tika, but I am not certain where to invoke
 the client. In my case, I work on Dspace to call Solr, and I suppose the
 client should be invoked in-between Dspace and Solr. That is, Dspace invokes
 SolrJ client when doing index/query,  which call Tika and Solr. Do you think
 it is reasonable?

 Best Regards,
 Bing

 --
 View this message in context: 
 http://lucene.472066.n3.nabble.com/TikaLanguageIdentifierUpdateProcessorFactory-since-Solr3-5-0-to-be-used-in-Solr3-3-0-tp3771620p3782793.html
 Sent from the Solr - User mailing list archive at Nabble.com.


Re: TikaLanguageIdentifierUpdateProcessorFactory(since Solr3.5.0) to be used in Solr3.3.0?

2012-02-27 Thread bing
Hi, Erick, 

I get your point. Thank you so much. 

Best Regards, 
Bing

--
View this message in context: 
http://lucene.472066.n3.nabble.com/TikaLanguageIdentifierUpdateProcessorFactory-since-Solr3-5-0-to-be-used-in-Solr3-3-0-tp3771620p3782938.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: TikaLanguageIdentifierUpdateProcessorFactory(since Solr3.5.0) to be used in Solr3.3.0?

2012-02-26 Thread bing
Hi, Erick, 

My idea is to use Tika0.10 in Dspace1.7.2, which is based on two steps:

1. Upgrade Solr1.4.1 to Solr3.3.0 in Dspace1.7.2 
In the following link, upgraded Solr  Lucene 3.3.0 has been resolved. 
https://jira.duraspace.org/browse/DS-980

2. Upgrade to Tika0.10 in Solr3.3.0 
In the following link, people has tried to upgrade Tika0.8 to Tika0.9.  
http://lucene.472066.n3.nabble.com/upgrading-to-Tika-0-9-on-Solr-1-4-1-td2570526.html

I was thinking, if both the above two steps can be achieved, then maybe I
can get it done. What is your suggestion? 

Thank you. 

Best Regards, 
Bing 

--
View this message in context: 
http://lucene.472066.n3.nabble.com/TikaLanguageIdentifierUpdateProcessorFactory-since-Solr3-5-0-to-be-used-in-Solr3-3-0-tp3771620p3779437.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: TikaLanguageIdentifierUpdateProcessorFactory(since Solr3.5.0) to be used in Solr3.3.0?

2012-02-25 Thread Erick Erickson
Well, you can give it a try, I don't know if anyone's done that
before. And you're on your own, I haven't a clue what
the results would be...

Sorry I can't be more help here...
Erick

On Thu, Feb 23, 2012 at 10:44 PM, bing jsuser1...@hotmail.com wrote:
 Hi, all,

 I am using
 org.apache.solr.update.processor.TikaLanguageIdentifierUpdateProcessorFactory
 (since Solr3.5.0) to do language detection, and it's cool.

 An issue: if I deploy Solr3.3.0, is it possible to import that factory in
 Solr3.5.0 to be used in Solr3.3.0?

 Why I stick on Solr3.3.0 is because I am working on Dspace (discovery) to
 call solr, and for now the highest version that Solr can be upgraded to is
 3.3.0.

 I would hope to do this while keep Dspace + Solr at the most. Say, import
 that factory into Solr3.3.0, is it possible? Does any one happen to know
 certain way to solve this?

 Best Regards,
 Bing

 --
 View this message in context: 
 http://lucene.472066.n3.nabble.com/TikaLanguageIdentifierUpdateProcessorFactory-since-Solr3-5-0-to-be-used-in-Solr3-3-0-tp3771620p3771620.html
 Sent from the Solr - User mailing list archive at Nabble.com.


TikaLanguageIdentifierUpdateProcessorFactory(since Solr3.5.0) to be used in Solr3.3.0?

2012-02-23 Thread bing
Hi, all, 

I am using
org.apache.solr.update.processor.TikaLanguageIdentifierUpdateProcessorFactory
(since Solr3.5.0) to do language detection, and it's cool.
 
An issue: if I deploy Solr3.3.0, is it possible to import that factory in
Solr3.5.0 to be used in Solr3.3.0? 

Why I stick on Solr3.3.0 is because I am working on Dspace (discovery) to
call solr, and for now the highest version that Solr can be upgraded to is
3.3.0.

I would hope to do this while keep Dspace + Solr at the most. Say, import
that factory into Solr3.3.0, is it possible? Does any one happen to know
certain way to solve this?

Best Regards, 
Bing

--
View this message in context: 
http://lucene.472066.n3.nabble.com/TikaLanguageIdentifierUpdateProcessorFactory-since-Solr3-5-0-to-be-used-in-Solr3-3-0-tp3771620p3771620.html
Sent from the Solr - User mailing list archive at Nabble.com.