Re: couchdb transactions changes

Antony Blakey Sun, 08 Feb 2009 19:30:01 -0800


On 09/02/2009, at 1:07 PM, Damien Katz wrote:

On Feb 8, 2009, at 9:24 PM, Chris Anderson wrote:
On Sun, Feb 8, 2009 at 5:54 PM, Damien Katz <[email protected]>wrote:
It's possible to use MVCC for replication. You'll need to createspecialHTTP command to return you all the documents you are interested ina singlerequest, and a special replicator that uses that command and loadsthose
documents and writes them to the destination.

This sound a lot like the notification view Damien's been talking
about, where clients can register to be told about database updates
that match particular functions.
The main problem I see with MVCC replication is that if it dies inthe
middle, you might not be able to restart it right where you left off.
That would be a big problem of replicating huge databases.Everything must come over in one transaction.

You could still do that incrementally e.g. it wouldn't have to load ina single request. The key is that the replication shows MVCCboundaries i.e. add a marker in the replication stream to indicatewhen you passed an MVCC commit point. The current model would ignoresuch markers.- nothing else is required I think. You could even cycleas long as there were new MVCC states, which would give the same'includes-updates-as-they-come-in' form of replication, but withsomewhat more consistency. If these restart points were included inthe replication stream, then systems that wanted to allow replicationrollback (see below) could reset the rollback MVCC state when they getan end-of-MVCC state marker.

If you cared however, and if your application model allowed it, thenyou could guarantee consistency by only accepting completedreplications. I imagine you would need to be able to 'undo' anincomplete replication, which would be a matter of allowing the db tobe rolled back to the MVCC state that was in effect when thereplication started. This would prevent permanent lockup, and I'm sureyou'd want this facility to be enabled/disabled in configuration.

I want to stress that I know this is only useful for a certain classof use, but I don't think it negatively impacts other uses, so onlythose uses that want it would pay for it.

Also, this doesn't resolve the cluster-ACID issue, but I'm confidentthere is a solution there that doesn't impact the clustered/non-exclusive-replication/constant-conflict model.

If you want consistency on the target, you'll have to write lock thedatabase (not a concept couchdb really has)

I think that's an application-level concept, but I can imagine a patchthat allows it in CouchDB. Still, I'd do that at an application level,because I'm in user-triggered exclusive replication mode.

until the whole replication completes, or write all the docs in onebig bulk transaction, which won't work for large databases. Andwhile that transaction is occurring, neither the source database ortarget database file itself be compacted (the compaction will takeplace, but the old file can't be freed until the full transactioncompletes).

Yes, but once again I think there are valid use-cases that allow thate.g. mine at least.


Antony Blakey
--------------------------
CTO, Linkuistics Pty Ltd
Ph: 0438 840 787

The trouble with the world is that the stupid are cocksure and theintelligent are full of doubt.

  -- Bertrand Russell

Re: couchdb transactions changes

Reply via email to