Deadlock with static:(failover:(...)) configuration ---------------------------------------------------
Key: AMQ-3277 URL: https://issues.apache.org/jira/browse/AMQ-3277 Project: ActiveMQ Issue Type: Bug Components: Broker Affects Versions: 5.4.2 Environment: 5.4.2 with back-ported patches for AMQ-3176, AMQ-2736, AMQ-3124, AMQ-3125, AMQ-3167. Two brokers, each with a single duplex network connection to the other, using static:(failover:(first,second)) URI format. Reporter: Arthur Naseef Using failover transports with static transports to create a network of brokers with H/A pairs, the failovers lead to deadlocks. This appears related to AMQ-3176, but the patch for that problem is included in our 5.4.2 build. <networkConnector name="nw1_from_nw2" uri="static:(failover:(tcp://127.0.0.1:60992,tcp://127.0.0.1:60002))" duplex="true" conduitSubscriptions="false" dynamicOnly="true" networkTTL="7" suppressDuplicateQueueSubscriptions="true" decreaseNetworkConsumerPriority="true" bridgeTempDestinations="true" > </networkConnector> The deadlock occurs after stopping and restarting each broker once. Here's the steps: - start broker1 - start broker2 * test connectivity - sthudown broker1 - restart broker1 * test connectivity - shutdown broker2 - restart broker2 * test connectivity Here is the deadlock information from the JVM: Java stack information for the threads listed above: =================================================== "StartRemotelBridge: localBroker=vm://peer-nw2#2": at org.apache.activemq.transport.failover.FailoverTransport.oneway(FailoverTransport.java:477) - waiting to lock <0x00002aaac2df83c8> (a java.lang.Object) at org.apache.activemq.transport.MutexTransport.oneway(MutexTransport.java:40) - locked <0x00002aaac2e050f8> (a java.lang.Object) at org.apache.activemq.transport.ResponseCorrelator.oneway(ResponseCorrelator.java:60) at org.apache.activemq.network.DemandForwardingBridgeSupport.startRemoteBridge(DemandForwardingBridgeSupport.java:352) - locked <0x00002aaac2df7c50> (a org.apache.activemq.network.DurableConduitBridge) at org.apache.activemq.network.DemandForwardingBridgeSupport$4.run(DemandForwardingBridgeSupport.java:277) at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908) at java.lang.Thread.run(Thread.java:662) "ActiveMQ Failover Worker: 169776139": at org.apache.activemq.network.DemandForwardingBridgeSupport.startLocalBridge(DemandForwardingBridgeSupport.java:290) - waiting to lock <0x00002aaac2df7c50> (a org.apache.activemq.networFound one Java-level deadlock: ============================= "StartRemotelBridge: localBroker=vm://peer-nw2#2": waiting to lock monitor 0x0000000044796938 (object 0x00002aaac2df83c8, a java.lang.Object), which is held by "ActiveMQ Failover Worker: 169776139" "ActiveMQ Failover Worker: 169776139": waiting to lock monitor 0x00002aaac470f0b0 (object 0x00002aaac2df7c50, a org.apache.activemq.network.DurableConduitBridge), which is held by k.DurableConduitBridge) at org.apache.activemq.network.DemandForwardingBridgeSupport$2.transportResumed(DemandForwardingBridgeSupport.java:220) at org.apache.activemq.transport.TransportFilter.transportResumed(TransportFilter.java:109) at org.apache.activemq.transport.TransportFilter.transportResumed(TransportFilter.java:109) at org.apache.activemq.transport.failover.FailoverTransport.doReconnect(FailoverTransport.java:870) - locked <0x00002aaac2df83c8> (a java.lang.Object) at org.apache.activemq.transport.failover.FailoverTransport$2.iterate(FailoverTransport.java:135) - locked <0x00002aaac2df83d8> (a java.lang.Object) at org.apache.activemq.thread.DedicatedTaskRunner.runTask(DedicatedTaskRunner.java:98) at org.apache.activemq.thread.DedicatedTaskRunner$1.run(DedicatedTaskRunner.java:36) Found 1 deadlock. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira