Re: Dynamic balancing of queues.

Gordon Sim Mon, 11 Sep 2006 10:35:24 -0700

Alan Conway wrote:

On Mon, 2006-09-11 at 14:37 +0100, Gordon Sim wrote:

Why do we need extra queues to scale up?


For a single broker you don't, the broker should be optimized to take
full advantage of local CPUs etc. with a single queue.

I'm thinking about the large scale deployment where you need to
distribute load across multiple hosts (possibly on different networks)
because either the CPUs on the exchange host or it's network

connectivity are a bottleneck.


Ok, understood.

So let the exchange act as a consumer and *remove* messages from
over-full queues to re-queue them on under-full/empty ones. Now we have
a full dynamic balance in both growing and shrinking phases
This complicates the exchange though as it now presumably needs toperiodically monitor the queue lengths (i.e. it becomes an active entityrather than just reacting to publications routed through it). Maybe anentirely separate re-balancing component would be cleaner?
Maybe, need to think about that. My intuition is that the broker can do
a better job of this because it's a single place to keep the statistics,
but a distributed cleanup component might have advantages if network
bandwidth at the broker is the bottleneck.

I'm not saying that component shouldn't be in the broker(s) just that itisn't necessarily part of an exchange.

I'm not really sure I understand the root problem here. i.e. why do wewant multiple queues of the same (or similar) length?

Trying to balance multiple queues only makes sense if there's a resource
problem with a everyone talking to a single queue - not enough memory,
not enough open file descriptors, performance degrades due to memory
requirements, network topology/firewalls etc. You can imagine situations
where a single broker with 1,000,000 consumers might not perform as well
as a federation of 1001 brokers each with 1000 consumers.

My confusion here stemmed from not understanding that you were talkingabout a group of co-operating brokers. (This is actually the use casethat the java clustering code currently in svn was designed for thoughany actual improvements to scalability have not been confirmed throughtesting).

That being the case I would argue even more strongly against theexchange removing messages from queues it has delivered them to andredelivering them to shorter queues to load balance. For one thing thatwould have implications for ordering. A single logical queue that is inimplementation distributed would seem like a better fit from the designpoint of view (the clustering code mentioned in the previous paragraphdoes something similar).

That said need to work hard optimizing the broker so that the single
queue solution can scale as far as possible before we get into more
complicated federations and the like. We should also look at some real
data before assuming that such federation will solve a real-world
problem, this stuff doesn't always work out the way you think it will!

Agreed! The justification for the earlier work on java was purely to getfeature parity with an alternative implementation.

Re: Dynamic balancing of queues.

Reply via email to