I've had trouble figuring out what options exist if I want to perform all
indexing off of the production servers (I'd like to keep them only for user
queries).

 

We index data in batches roughly daily, ideally I'd index all solr cloud
shards offline, then move the final index files to the solr cloud instance
that needs it and flip a switch and have it use the new index.

 

Is this possible via either:

1.       Doing the indexing in Hadoop?? (this would be ideal as we have a
significant investment in a hadoop cluster already), or

2.       Maintaining a separate "master" server that handles indexing and
the nodes that receive user queries update their index from there (I seem to
recall reading about this configuration in 3.x, but now we're using solr
cloud)

 

Is there some ideal solution I can use to "protect" the production solr
instances from degraded performance during large index processing periods?

 

Thanks!

David

Reply via email to