Hi Robert,

On Mar 23, 2010, at 5:13 PM, Robert Hodges wrote:

On 3/23/10 3:24 AM PDT, "Paul McCullagh" <[email protected]>
wrote:

Hi Robert,

On Mar 22, 2010, at 8:43 PM, Robert Hodges wrote:


1.) What are the "big" problems in replication?  Let's say we have
things
like availability and basic read scaling basically handled.  What's
next on
the list?  (Big data, No-SQL, replication/database impedance
mismatch due to
faster hardware, complex topologies, management, etc., all
suggestions are
welcome.)

2.) What replication solutions are emerging to address those
problems?  Jay
and I work on or know most of the usual suspects like Drizzle,
Tungsten,
Galera, MySQL 5.4, Rabbit MQ, etc.  However, if there's something
really
cool out there we'll add it to the list.

<plug>
It may be relevant to mention engine-level replication in a survey of
the replication landscape. This is now built into PBXT.

Engine-level replication is not as flexible as Drizzle replication,
for example, but it is extremely efficient.

It is relatively low-level, but not as low level has block device
replication (DRBD), and is much easier to setup and maintain.

I believe this has a role to play in its position between the block-
level replication and the high-level, replicate to anywhere, being
developed for Drizzle, and global replication systems like Tungsten.
</plug>


This is a very interesting point. Physical and logical replication are kind
of like Yin and Yang for building systems.

I was thinking of pointing out in the talk the work going on in PostgreSQL with PG 9 and streaming replication/hot standby. The PostgreSQL approach,
as you are probably aware, has been to implement very solid physical
replication while leaving logical replication to external, trigger- based
products like Londiste.

This is basically the PBXT approach.

How do you get around some of the issues like snapshot maintenance for
queries that are currently consuming cycles in the PostgreSQL effort? In the current PG 9 alpha users may have to choose between between up to date slaves and ability to run queries that maintain a snapshot for a lengthy
period of time.

Should not be much of a problem because PBXT MVCC implementation can handle long running snapshot queries/transactions. PBXT stores all versions of a row on disk, so it does not limit the size of a snapshot.

However, each version of a row is indexed. So a long running snapshot can lead to inflation of the index. This is not a problem for updates, however. Especially not replicated updates which are based on the internal row ID. So a snapshot should not have that much affect on the speed at which replication changes are applied.

But, it can affect the speed of long running SELECTs that are being done on the slave.

Best regards,

Paul



--
Paul McCullagh
PrimeBase Technologies
www.primebase.org
www.blobstreaming.org
pbxt.blogspot.com




_______________________________________________
Mailing list: https://launchpad.net/~drizzle-discuss
Post to     : [email protected]
Unsubscribe : https://launchpad.net/~drizzle-discuss
More help   : https://help.launchpad.net/ListHelp

Reply via email to