Re: massive replication?

Daniel Trümper Fri, 23 Oct 2009 15:54:32 -0700

2. It seems like there's a point at which explicit 1-1 replicationstarts to be an administrative nightmare. Some kind of publish-subscribe or multi-cast update model seems needed.
Would the new continuous replication feature be what you need? Withthis all changes to A get automatically replicated to B, if I getthings right here...
Well it's more of, when I change A, I want the changes to propagateto B through Z (or "all") - with some sort of multi-cast addressingrather than having to identify every node explicitly.

Hm, I guess you would have to create/write/program a custom layer ontop of each CouchDB instance. Maybe an adpted version of the couchdb-proxy [1] that would listen on the multicast address. With this youcould notice new instances being up and the replication from themaster machines would be handled by the proxy, i.e. the proxy wouldsend the replication commands to the master. You could then even dothis in layers of proxies and CouchDBs...

But: while writing the above lines I notice that you may be want tohave a look at Zookeeper [2] !?

ZooKeeper is a centralized service for maintaining configurationinformation, naming, providing distributed synchronization, andproviding group services.

You could basically wrap the start script and write additionalinformation (like the CouchDB URL) into Zookeeper. One node could thenperiodically read the info about existing CouchDB instances andtrigger or configure the replications...


Daniel

[1]: http://github.com/benoitc/couchdbproxy
[2]: http://hadoop.apache.org/zookeeper/

Re: massive replication?

Reply via email to