Re: Bulk Docs

Antony Blakey Thu, 12 Mar 2009 15:39:25 -0700

On 13/03/2009, at 1:46 AM, Damien Katz wrote:

Atomic bulk docs is in the patch, it just doesn't do conflictchecking. If any docs are conflicts, they are saved anyway asconflicts. This means it's really for message queue functionality,not database consistency, your data is safe and committed but mightnot be immediately available or consistent between docs. The reasonswe are removing all or nothing with conflict checking as it doesn'twork with replication (both offline and clustering) as docs are notreplicated in a single transaction or even in update order. Andgetting it to work with partitioning would cause unacceptable writeperformances. If we leave it, people will rely on the behavior notunderstanding it doesn't really work with the rest of CouchDB.
So if you are currently using bulk docs to guarantee inter-documentconsistency, it already doesn't work with replication. It only workson a single machine, so no master-slave and no hot stand-by setupwould work as neither are guaranteed to be in a consistent state atany point.


The current bulk docs IS useful in a particular scenario.

It allows me, on a single node, to do transactional updates inresponse to e.g. a web submit/AJAX call, without having to expose theconflict model to the user and deal with conflicts in my single-nodecode.


I then have two distinct phases of operation for peers:

1. Replication is triggered by the user and they do nothing else untilreplication commpletes, after which they have to resolve the conflictsgenerated by replication. This code deals with conflicts and aresolution UI and nothing else.

2. Normal operation - concurrent access by multiple applications,multiple users. The code never sees a conflict, and hence the userinteraction and programming model is considerable simpler

There are a few additional features useful in this model, theprincipal ones being either 1) the ability to roll back a partialreplication to deal with network failures; or b) the ability tomaintain monotonic source writes which ensures that each replicationstep is consistent. To date neither of these features have gainedsufficient community support to be considered.

I've presented this model before, and it has been rejected as beingincompatible with the initial couchdb intentions, but in response toTim Parkin, this is the reason for my fork. There are more details tomy effort - pure binary bodies rather than JSON, unification ofattachments with documents, strict metadata/content separation, map/reduce over arbitrary data, generalised derivation, an immutable modelof fully reified state, replication of operations rather than data -but maybe anyone interested can contact me offlist - it's no longerCouchDB and I'm sure everyone's sick of saying/reading "forget it,it's not going to happen" :)


Antony Blakey
-------------
CTO, Linkuistics Pty Ltd
Ph: 0438 840 787

One should respect public opinion insofar as is necessary to avoidstarvation and keep out of prison, but anything that goes beyond thisis voluntary submission to an unnecessary tyranny.

  -- Bertrand Russell

Re: Bulk Docs

Reply via email to