Re: [HACKERS] [PATCH 10/16] Introduce the concept that wal has a 'origin' node

Heikki Linnakangas Tue, 19 Jun 2012 23:40:54 -0700

On 20.06.2012 01:27, Kevin Grittner wrote:

Andres Freund<[email protected]>  wrote:

Yes, thats definitely a valid use-case. But that doesn't preclude
the other - also not uncommon - use-case where you want to have
different master which all contain up2date data.


I agree.  I was just saying that while one requires an origin_id,
the other doesn't.  And those not doing MM replication definitely
don't need it.

I think it would be helpful to list down a few concrete examples ofthis. The stereotypical multi-master scenario is that you have a singletable that's replicated to two servers, and you can insert/update/deleteon either server. Conflict resolution stretegies vary.

The reason we need an origin id in this scenario is that otherwise thiswill happen:


1. A row is updated on node A

2. Node B receives the WAL record from A, and updates the correspondingrow in B. This generates a new WAL record.3. Node A receives the WAL record from B, and updates the rows again.This again generates a new WAL record, which is replicated to A, and youloop indefinitely.

If each WAL record carries an origin id, node A can use it to refrainfrom applying the WAL record it receives from B, which breaks the loop.

However, note that in this simple scenario, if the logical log replay /conflict resolution is smart enough to recognize that the row hasalready been updated, because the old and the new rows are identical,the loop is broken at step 3 even without the origin id. That works forthe newest-update-wins and similar strategies. So the origin id is notabsolutely necessary in this case.

Another interesting scenario is that you maintain a global counter, likein an inventory system, and conflicts are resolved by accumulating theupdates. For example, if you do "UPDATE SET counter = counter + 1"simultaneously on two nodes, the result is that the counter isincremented by two. The avoid-update-if-already-identical optimizationdoesn't help in this case, the origin id is necessary.

Now, let's take the inventory system example further. There are actuallytwo ways to update a counter. One is when an item is checked in or outof the warehouse, ie. "UPDATE counter = counter + 1". Those updatesshould accumulate. But another operation resets the counter to aspecific value, "UPDATE counter = 10", like when taking an inventory.That should not accumulate with other changes, but should benewest-update-wins. The origin id is not enough for that, because bylooking at the WAL record and the origin id, you don't know which typeof an update it was.

So, I don't like the idea of adding the origin id to the record header.It's only required in some occasions, and on some record types. And I'mworried it might not even be enough in more complicated scenarios.

Perhaps we need a more generic WAL record annotation system, where aplugin can tack arbitrary information to WAL records. The extrainformation could be stored in the WAL record after the rmgr payload,similar to how backup blocks are stored. WAL replay could just ignorethe annotations, but a replication system could use it to store theorigin id or whatever extra information it needs.


--
  Heikki Linnakangas
  EnterpriseDB   http://www.enterprisedb.com

--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] [PATCH 10/16] Introduce the concept that wal has a 'origin' node

Reply via email to