Hi Manikanda,

Nutch has solr integration at present. It is not important for Solr working
style and work area. Nutch index your data over the http address. At the
present Nutch doesnt index to Solr cloud. If you have Solr Cloud Cluster,
Nutch can index only one instance.

Talat



2014/1/5 Manikandan Saravanan <[email protected]>

> I’m running Nutch 2.2.1 on top of a Hadoop 1.7 cluster and I’m using Gora
> to store the crawled data on a remote Cassandra 2.0.4 cluster.
>
> I wish to setup a Solrcloud cluster and index the crawled data on it. Can
> I do that by integrating Nutch and Solr? The tutorial in the website tells
> how to integrate when the crawl data is stored on the filesystem and when
> Solr is running locally. My situation is this:
>
> 1. Crawled data is on a remote Cassandra cluster.
> 2. The Solr cloud will again be remote.
>
> All nodes are, however, in the same data centre and just two racks apart.
> I’m running everything on the private IP within the datacenter and nothing
> goes to the public internet.
>
> --
> Manikandan Saravanan
> Architect - Technology
> TheSocialPeople




-- 
Talat UYARER
Websitesi: http://talat.uyarer.com
Twitter: http://twitter.com/talatuyarer
Linkedin: http://tr.linkedin.com/pub/talat-uyarer/10/142/304

Reply via email to