There is still some work to be done to make DIH play nicely with SolrCloud in 
terms of failover.

https://issues.apache.org/jira/browse/SOLR-4058 is one of the issues that 
should be addressed.

I think I made another issue or two, but I don't remember them offhand.

- Mark

On Mar 13, 2013, at 11:39 AM, jimtronic <jimtro...@gmail.com> wrote:

> I'm curious how people are using DIH with SolrCloud.
> 
> I have cron jobs set up to trigger the dataimports which come from both xml
> files and a sql database. Some are frequent small delta imports while others
> are larger daily xml imports.
> 
> Here's what I've tried:
> 
> 1. Set up a micro box that sends the dataimport requests to a load balancer
> using cron. This didn't work because frequent requests would get spread
> around and at one point all my nodes were doing the dataimport requests at
> the same time.
> 
> 2. Designate one box as the indexer and call dataimport via localhost. The
> problem here is that I now have a single point of failure for indexing -- I
> always have to have that box running. I love that SolrCloud is distributed
> so I can have 3 boxes in my cluster and I don't care which one goes down.
> 
> I don't really know what the solution is, but I guess it would be nice if
> the dataimport was cloud aware. Meaning that the cluster knows an update is
> happening on one of the boxes and won't let another one start. That way I
> could just send the dataimport request up through the load balancer and
> forget about it.
> 
> Anyway, I thought I would see how others are handling this issue.
> 
> Cheers, Jim
> 
> 
> 
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/Scaling-SolrCloud-and-DIH-tp4047049.html
> Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to