Hi,

We currently have a master-slave setup for solr with two slave servers. We
are using Solrj (stream-update-solr-server) to index master slave, which
takes 6 hours to index around 15 million documents.

I would like to explore hadoop, in particularly for indexing job using
mapreduce approach. 

- I have read some comments on the JIRA tickets, but it still seems unclear
how this setup will work. 
- I am not sure as what tasks will be done at map phase and what on reduce
phase. 
- And would it merge the multiple indices together into one during reduce
phase or is this a separate task out of mapreduce?

Any directions and guidance over this setup would be highly appreciated.

Thanks in advance,
-Ali
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/solr-with-hadoop-tp482688p914483.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to