Re: SOLR Data Backup

2018-01-22 Thread Rick Leir
. > >BTW, why do we not recommend having Solr as a source of truth? > One reason is that you might want to tune the analysis chain and then reindex. Or your data gets progressively larger, and you want to be able to recover from an OOM during indexing. Rick -- Sorry for being brief.

Re: SOLR Data Backup

2018-01-19 Thread S G
Another option is to have CDCR enabled for Solr and replicate your data to another Solr cluster continuously. BTW, why do we not recommend having Solr as a source of truth? On Thu, Jan 18, 2018 at 4:08 AM, Florian Gleixner wrote: > Am 18.01.2018 um 10:21 schrieb Wael Kader: > >

Re: SOLR Data Backup

2018-01-18 Thread Florian Gleixner
Am 18.01.2018 um 10:21 schrieb Wael Kader: > Hello, > > Whats the best way to do a backup of the SOLR data. > I have a single node solr server and I want to always keep a copy of the > data I have. > > Is replication an option for what I want ? > > I would like to get some tutorials and papers

Re: SOLR Data Backup

2018-01-18 Thread Emir Arnautović
Hi Wael, I am not sure about moving data in HDFS but you should be able to set up slave without reindexing. Did you start the first node in standalone mode? You need to check if replication handler is enabled (should be by default) and set up slave to pull data from the first node. Note that

Re: SOLR Data Backup

2018-01-18 Thread Wael Kader
Hi, The data is always changing for me so I think I can try the replication option. I am using cloudera and the data is saved in HDFS. Is it possible for me to move the data while the index is running without any problems ? I would also like to know if its possible to setup slave/master

Re: SOLR Data Backup

2018-01-18 Thread Charlie Hull
On 18/01/2018 10:06, Wael Kader wrote: Hi, Its not possible for me to re-index the data in some of my indexes is only saved in SOLR. I need this solution to make sure that in case the live index fails, I can move to the backup or replicated index. OK, so now it's down to you to decide whether

Re: SOLR Data Backup

2018-01-18 Thread Emir Arnautović
Hi Weal, In general, it is not recommended to use Solr as a primary storage so it is better to store your data somewhere else. That will allow you o reindex if needed and also allow you to not store some field in index and make it more efficient. When it comes to your original question, it

Re: SOLR Data Backup

2018-01-18 Thread Wael Kader
Hi, Its not possible for me to re-index the data in some of my indexes is only saved in SOLR. I need this solution to make sure that in case the live index fails, I can move to the backup or replicated index. Thanks, Wael On Thu, Jan 18, 2018 at 11:41 AM, Charlie Hull

Re: SOLR Data Backup

2018-01-18 Thread Emir Arnautović
Hi Weal, If you want HA and FT you have to have at least two Solr nodes and 3 zookeeper nodes (if you plan on using SolrCloud). If you want just to be sure you don’t have to reindex your data in case something goes wrong, you can use Solr backup feature:

Re: SOLR Data Backup

2018-01-18 Thread Charlie Hull
On 18/01/2018 09:21, Wael Kader wrote: Hello, Whats the best way to do a backup of the SOLR data. I have a single node solr server and I want to always keep a copy of the data I have. Is replication an option for what I want ? I would like to get some tutorials and papers if possible on the

SOLR Data Backup

2018-01-18 Thread Wael Kader
Hello, Whats the best way to do a backup of the SOLR data. I have a single node solr server and I want to always keep a copy of the data I have. Is replication an option for what I want ? I would like to get some tutorials and papers if possible on the method that should be used in case its