ActiveMQ 5.3.0_SNAPSHOT (Sep 8th according to the snapshots listing)

I'm running into an issue with the store and forward feature of
activemq, which I using in an attempt to create a highly available
distributed queue.  I'm trying to figure out if the issue is a
misconfiguration on my part, expected behavior of activemq, or a bug.

The summary of the problem is that given 2 brokers, B1 and B2, which
each have one consumer, C1 and C2, which are subscribed to the same
queue.  If I stop a consumer on one of the brokers, the pending
messages from that broker are not always forwarded to the other broker
which still has a consumer, leading to those messages getting
indefinitely stuck.

The steps I use to reproduce this scenario are as follows (Note: all
producing and consuming is performed over the stomp transport):

Couple notes about the consumers:
- they have a prefetchSize of 40
- the processing of messages can take some time, so for the purposes
  of this exercise, I've created a simple consumer that sleeps for 10
  seconds before sending the message ack (using client-individual ack
  mode)

1. start both brokers (B1 and B2). The consumers (C1 and C2) are not
   yet running.
2. produce a few thousand messages to B1
   Note: B1 now has a few thousand pending messages and B2 has 0.
3. start consumer C2 (listing for messages from B2)
   Note: messages are are successfully received and begin processing
   (monitoring the brokers shows pending messages decreasing). Now B2
   has all the pending messages and B1 has 0.
4. start consumer C1 (listing for messages from B1)
   Note: no messages are received, which is another issue I have since
   B2 now has thousands of pending messages which C1 could help
   process, but instead sits idle while C2 is forced to handle all the
   messages.
5. stop consumer C2
   Note: now I have thousands of messages sitting on B2 and 0 on B1
   where a C1 is alive and ready to handle them.  So at this point,
   despite having a consumer running, thousands of messages are stuck
   in the queue.
6. stop consumer C1
   Note: now I have no consumers.  Stopping and restarting C1 has no
   effect on the pending messages sitting on B1's queue.
7. stop both brokers
8. start B1, then start B2
9. start C1
   Note: now all messages have migrated from B2 to B1 and C1 is again
   processing messages.

So after step 5, the only way to recover from the stuck messages is to
restart the brokers.

Below is my current connector config which I have on both brokers.
I've tried playing with the various properties of the connector, but
it seems as though no matter what I try the above scenario continues
to occur.

<networkConnector
    name="default-nc"
    uri="multicast://default?group=${broker.group}"
    dynamicOnly="true"
    networkTTL="25"
    suppressDuplicateQueueSubscriptions="true"/>


So, is this an activemq bug? Am I mis-using activemq? Is there some
other way to achieve a highly available distributed queue?

Any help in this regard is greatly appreciated.

--
eric

Reply via email to