RE: True master-master fail-over without data gaps

Robert Petersen Wed, 09 Mar 2011 09:07:53 -0800

...but the index resides on disk doesn't it???  lol

-----Original Message-----
From: Otis Gospodnetic [mailto:otis_gospodne...@yahoo.com] 
Sent: Wednesday, March 09, 2011 9:06 AM
To: solr-user@lucene.apache.org
Subject: Re: True master-master fail-over without data gaps


Hi,



----- Original Message ----
>
> I'd honestly think about buffer the incoming documents in some store
that's  
>actually made for fail-over persistence reliability, maybe CouchDB or
something.  
>And then that's taking care of not losing anything, and the problem
becomes how  
>we make sure that our solr master indexes are kept in sync with the
actual  
>persistent store; which I'm still not sure about, but I'm thinking it's
a  
>simpler problem. The right tool for the right job, that kind of
failover  
>persistence is not solr's specialty. 
>

But check this!  In some cases one is not allowed to save content to
disk (think 
copyrights).  I'm not making this up - we actually have a customer with
this 
"cannot save to disk" (but can index) requirement.

So buffering to disk is not an option, and buffering in memory is not
practical 
because of the input document rate and their size.

Otis
----
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/



> From: Otis Gospodnetic [otis_gospodne...@yahoo.com]
> Sent:  Tuesday, March 08, 2011 11:45 PM
> To: solr-user@lucene.apache.org
> Subject:  True master-master fail-over without data gaps
> 
> Hello,
> 
> What are  some common or good ways to handle indexing (master)
fail-over?
> Imagine you  have a continuous stream of incoming documents that you
have to
> index without  losing any of them (or with losing as few of them as
possible).
> How do you  set up you masters?
> In other words, you can't just have 2 masters where the  secondary is
the
> Repeater (or Slave) of the primary master and replicates the  index 
>periodically:
> you need to have 2 masters that are in sync at all  times!
> How do you achieve that?
> 
> * Do you just put N masters behind a  LB VIP, configure them both to
point to 
>the
> index on some shared storage  (e.g. SAN), and count on the LB to
fail-over to 
>the
> secondary master when the  primary becomes unreachable?
> If so, how do you deal with index locks?   You use the Native lock and
count 
on
> it disappearing when the primary master  goes down?  That means you
count on 
>the
> whole JVM process dying, which  may not be the case...
> 
> * Or do you use tools like DRBD, Corosync,  Pacemaker, etc. to keep 2
masters
> with 2 separate indices in sync, while  making sure you write to only
1 of 
them
> via LB VIP or otherwise?
> 
> * Or  ...
> 
> 
> This thread is on a similar topic, but is inconclusive:
>   http://search-lucene.com/m/aOsyN15f1qd1
> 
> Here is another similar  thread, but this one doesn't cover how 2
masters are
> kept in sync at all  times:
>   http://search-lucene.com/m/aOsyN15f1qd1
> 
> Thanks,
> Otis
> ----
> Sematext  :: http://sematext.com/ ::  Solr - Lucene - Nutch
> Lucene ecosystem search :: http://search-lucene.com/
> 
>

RE: True master-master fail-over without data gaps

Reply via email to