Re: couchdb transactions changes

Antony Blakey Sat, 07 Feb 2009 22:28:18 -0800

I think discussion of this issue is complicated by the lack of a clearexposition of the different ways in which CouchDB may be used/deployed. I have the following in mind:


---------------------------------------


A. A single-node database engine embedded in a desktop application.
B. A single-node database server.
C. A multi-node clustered database server.

Furthermore it might have:

D. No replication or replication from the app purely for backup. Noconflict is possible.E. Replication from a distinguished peer that accepts write operationse.g. a content/query distribution mechanism. No conflict is possible.F. Replication in a p2p mesh e.g. collaborative content management.Conflict is possible.


Then, in a non-orthogonal way, conflicts are dealt with:

G. Not at all because they can't arise.

H. Replication is under user control, and exclusive with 'normal'operation. Conflict resolution is only caused by replication, saidconflicts being resolved by the user using a specialized UI/Workflow.Normal operation sees no conflicts.I. Replication is concurrent with normal operation, and may or may notbe under user control. Normal operation sees conflicts.


---------------------------------------

I have a pending deployment project of type A/E/G, and pendingprojects of types A/F/H and B+A/E/G. In all my cases, update andindexing throughput is not an issue, although replication efficiency,especially of incremental updates to attachments, is a concern.

I understand that there is a sense in which CouchDB was on atrajectory pre-Apache to be C/F/I, but I wonder if the desire toachieve that isn't *unnecessarily* at the expense of other deploymentmodels. In particular, some of these sound like a Notes client, and Ihave heard CouchDB promoted as 'Notes done right', hence my focus onthose kinds of use cases (as opposed to high-throughput db servers).IMO it would be a good thing to not burden these other use cases withthe operational cost of supporting just one of them.

Obviously supporting transactions in a partition-based cluster canimpose a cost (although only if the transaction spans the cluster insome way, the probability of which is potentially lessened by thepartitioning), but what if one could turn them off via configuration?

From what Damien has said about replication, I'm getting the ideathat it is possible to do replication on an MVCC boundary, in the sameway that a view represents an MVCC boundary, although I hear loud andclear that CouchDB has never, ever, claimed that replication works inthat manner.

The benefit of a transactional API vs. a conflict based API, for localoperations, is not only that certain models can only be implementedusing a transactional API, but the transaction failure mode has aclear and simple reflection into the GUI. Users have an expectation oftransactionality, and IMO domain-dependent conflict resolution (asopposed to domain-independent transactionality) is a leap into theunknown. I think it's both less natural and more work for the user.

IMO The tradeoff of user-interface model/complexity vs. single/multi-node deployment vs. transaction cost should be in the hands of theapplication developer.


Antony Blakey
-------------
CTO, Linkuistics Pty Ltd
Ph: 0438 840 787

When I hear somebody sigh, 'Life is hard,' I am always tempted to ask,'Compared to what?'

  -- Sydney Harris

Re: couchdb transactions changes

Reply via email to