Re: gcache implementation ideas[long]

David Jencks Wed, 13 Sep 2006 14:43:49 -0700

I'm a complete beginner in clustering.... but I have some questions.


On Sep 12, 2006, at 12:19 PM, Jeff Genender wrote:

I wanted to go over a high level design on a gcache cache componentandget some feedback, input and invite folks who are interested tojoin in.

..so here it goes...

The gcache will be one of several cache/clustering offerings...but
starting off with the first one...

The first pass I want to go with the master/slave full replication
implementation.  What this means is a centralized caching server which
runs a cache implementation (likely will use ehcache underneath), and

this server is known as a master. My interest in ehcache is itprovides

the ability to persist session state from a configuration if full
failure recovery is needed (no need to reinvent the wheel on a great
cache).  The master will communicate with N number of slave servers,
also running a gcache implementation.

   +--------+   +---------+  +---------+
   |        |   |         |  |         |
   | MASTER |   | SLAVE 1 |  | SLAVE 2 | ... n-slaves
   |        |   |         |  |         |
   +--------+   +---------+  +---------+
      |   |            |           |
      |   |            |           |
      |   |____________|           |
      |                            |
      |____________________________|

I can't tell if the master communicates with all slaves or only slave1, then slave 1 communicates with slave2, etc. My questions arebiased a bit towards a linked list/chain of slaves, which might wellnot be what you intend.


Can you describe what happens if:
- the master goes down
- the former master comes back up
- slave 1 (or any intermediate slave) goes down.
- slave 1 (or any intermediate slave) comes back up
- slave n (last slave) goes down.

We then have client component(s) that "plugs in" and communicates with
the server. The configuration for the client should be very lightwhereit will only really be concerned with the master/slave/slave/nth-slave.
 In other words, it communicates only with the master.  The master is
responsible for "pushing" anything it receives to its slaves and other
nodes in the cluster. The slaves basically look like clients tothe master.

Are you saying that every client maintains the entire shared state?If not, how does a client determine if it needs to fetch some stateand how much state?

   +--------+   +---------+  +---------+
   |        |   |         |  |         |
   | MASTER |---| SLAVE 1 |  | SLAVE 2 |
   |        |   |         |  |         |
   +--------+   +---------+  +---------+
       |  |                       |
       |  +-----------------------+
       |
   ,-------.
  ( CLIENT  )
   `-------'

In the event the master goes down, the client notes the timeout andthenautomatically communicates with slave #1 as the new master. Sinceslave#1 is also a client of the MASTER, it can determine either byitself, or

by the first request that comes in asking for data, that it is the new
master.

   +--------+   +---------+  +---------+
   |  OLD   |   |NEW MSTER|  |         |
   | MASTER |   |   WAS   |--| SLAVE 2 |
   |        |   | SLAVE 1 |  |         |
   +--------+   +---------+  +---------+
       |           _,'
       X         ,'
       |      ,-'
   ,-------.<'
  ( CLIENT  )
   `-------'

It looks to me as if this relies on every client maintaining anidentical list of master/slave servers. How do they do that? (Thisrelates to my first set of questions I think)


What happens if

Client 1 maintains communication with master and all slavesthroughout, and all servers remain running at all timesClient 2 loses connectivity to master long enough to decide thatmaster is dead, and gets it back in time to communicate with slave 1,thus telling slave 1 it is the new master?

I think this is a fairly simple implementation, yet fairly robust.
Since we are not doing the heart beat and mcast, we cut down on alot of
network traffic.


Can you describe briefly what they are for?


Communication will be done by TCPIP sockets and would probably like to
use NIO.

I would like to see this component be able to run on its own...i.e. no

Geronimo needed. We can build a Geronimo gbean and deployer aroundit,

but I would like to see this component usable in many other areas,

including outside of Geronimo. Open source needs more "free"clustering

implementations.  I would like this component to be broken down into 2
major categories...server and client.


What is the difference in functionality between a server and a client?

After a successful implementation of master/slave, I would like tomakepluggable strategies, so we can provide for more of a distributedcache,
partitioning, and other types of joins, such as mcast/heart beat for
those who want it.

Thoughts and additional ideas?


Well, it made me think of lots of questions!

Are there any good references on "principles of clustering" fornewbies like me?


thanks
david jencks

Thanks,

Jeff

Re: gcache implementation ideas[long]

Reply via email to