Re: couchdb transactions changes

Antony Blakey Wed, 11 Feb 2009 12:28:22 -0800

I vote against this patch for the following reasons:

1. My reading of the Bayou research has shown me that transactions canwork with replication. The nature of a transaction is an interestingissue, but orthogonal to this argument.

2. It's not clear on the real performance hit in a partitioneddatabase. Furthermore, the hit may be highly dependent on theconfiguration and details of the write e.g. application design mayresult in transactions always going to one shard.

3. In any case, I argue that it should be up the user whether totradeoff a possible performance hit for a ACID semantics.

4. I'm not convinced of the utility of the two models proposed asreplacements for the bulk operations. IMO it would be better to nothave a bulk operation than to have the proposed models.

4. The justification is dependent on the implementation details of afuture feature that isn't itself described or known. From a proceduralpoint of view therefore it's not possible to assess this argumentbecause the community has no way of assessing it's validity.

5. This argument is also dependent on another argument that CouchDBmust provide a single API over both single-node and multi-nodeoperation, and must not allow the user to take advantage of thedifferences. I disagree with that, but in any case it's not anargument that has been put and resolved by the community.


On 08/02/2009, at 2:17 AM, Damien Katz wrote:

I'm working on a branch that implements couchdb the securityfeatures with replication. It not done yet, but anyone is welcome tolook at the branch in /branches/rep_security.
In this patch I am attempting to implement new transactions models.The old transaction model has you all or nothing commits for a groupof docs, along with conflict checking. If any document was inconflict, the transaction as a whole doesn't save.
The problems with this are:
1. Transactions don't work with replication. Replication doesn'trepeat the bulk single transaction, it just copies the documentsindividually to the target replica. This means any downstreamreplica can and will sees inconsistent states until replicationfully completes, not "all or nothing" states. With bidirectionalreplication is even worse, as you can get edit conflicts that mustbe resolved by an external process, .2. Transactions don't work in a partitioned database without a hugeperformance hit (locking + 2 phase commits).
So I propose supporting 2 different transaction models:
This first is to support "All or nothing commits", but withoutguaranteed conflict checking. So you can save bunch of documents tothe database and be sure they are all safely stored, or none aresafely stored, but you can't be guarantee you don't have anyconflicts when you do.
The second is support non-acid bulk transactions, where somedocument fail and some succeed. If the db crashes in the middle ofthe transaction, some documents may have made it to disk (completelyintact), while others have not. The client will need to check to besure.
With these 2 transactions models, it's possible to deploy the sameapps on a single machine or a huge partitioned cluster. To supportthe current model, it's only possible to deploy apps on a singlemachine. I propose we drop the current model as bulk transactionsare not supportable in clustered or replicated set ups.
-Damien


Antony Blakey
-------------
CTO, Linkuistics Pty Ltd
Ph: 0438 840 787

There is nothing more difficult to plan, more doubtful of success, normore dangerous to manage than the creation of a new order of things...Whenever his enemies have the ability to attack the innovator, they doso with the passion of partisans, while the others defend himsluggishly, So that the innovator and his party alike are vulnerable.

  -- Niccolo Machiavelli, 1513, The Prince.

Re: couchdb transactions changes

Reply via email to