Re: replication usage? creating dupes?

Damien Katz Tue, 15 Jul 2008 08:45:55 -0700


On Jul 15, 2008, at 11:22 AM, Sho Fukamachi wrote:

O Wise ones,
While attempting to use Futon's built-in replicator function to synca local DB with a (brand new) remote one, the replication kepttiming out. I restarted it several times, and after it finallycompleted I was delighted to find it actually had created morerecords on the remote than exist locally. Hooray, free records!

Replication causing duplicate records? We've never seen that, and itshouldn't be possible. Maybe you replicated to 2 different sourcedatabases to the same target?

Unfortunately they seem to be dupes. It was only about 3000 records,1000 or so records are dupes. This leaves me with a couple ofquestions:
- is there a "safe" way to do replication that doesn't create dupes?

Using the HTTP replicator is the correct way, you issue a HTTPreplication request and it performs the replication. Futon uses that.

- is Couch really sensitive to network fluctuations? I admit, I'm onthe other side of the planet as the test server, but no packet lossor anything I can detect

No, replication can fail at any point in the process and it willrecover without problem at the next replication.

- what is the current best practise to keep two databases in sync?Ie, 2-way (multi master) replication. No dupes. Assume imperfectnetwork (ie, over public internet). This is kind of one of thereasons I am using Couch for this project so .. I would like to doit right!

Replicate the databases on a schedule. CouchDB will figure out whathas changed incrementally.

I also wonder if anyone has started work on a 3rd partysynchronisation tool yet? I'm thinking something that justperiodically queries both DBs, makes a list of unsynced _ids and/or_revs and then PUTS one to the other as necessary. Maybe somethingnice in Ruby? Not that I'm knocking Futon of course, it's just thatin-browser JS seems a little .. fragile, especially after today'sexperience.

The replicator is actually written in Erlang. There is currently anissue with http-timeouts during long replicatons, and we need to dothe replication async with the browser request to fix it. Howeverthere should never be new records (with new IDs) created duringreplication, it doesn't work that way.



Thanks in advance for any suggestions/wisdom.

Sho

If you can zip up the two databases and mail them to me, or post themsome where publicly accessible, I can take a look and see if I canfigure out what happened.

Re: replication usage? creating dupes?

Reply via email to