Duncan Godwin created BROOKLYN-319:
--------------------------------------

             Summary: RabbitMQ restart failure
                 Key: BROOKLYN-319
                 URL: https://issues.apache.org/jira/browse/BROOKLYN-319
             Project: Brooklyn
          Issue Type: Bug
         Environment: Centos 6 on BlueBox and Google Compute
            Reporter: Duncan Godwin


RabbitMQ intermittently fails to come back up after restart of the entity on 
some clouds (BlueBox and Google Compute known to fail).

The brooklyn.debug.log shows the following:

{code}
2016-07-28 14:05:12,642 DEBUG brooklyn.SSH 
[brooklyn-execmanager-E26Om0Sg-12886]: check-running 
RabbitBrokerImpl{id=d5kw04uk46, amqpPort=5672}, initiating ssh on machine 
SshMachineLocation[10.104.2.181:[email protected]/10.104.2.181:22(id=t0tvrv6o0h)]
 (env 
{RABBITMQ_HOME=/home/users/amp/brooklyn-managed-processes/apps/b6kxbm5fei/entities/RabbitBroker_d5kw04uk46,
 
RABBITMQ_LOG_BASE=/home/users/amp/brooklyn-managed-processes/apps/b6kxbm5fei/entities/RabbitBroker_d5kw04uk46,
 RABBITMQ_NODENAME=d5kw04uk46, RABBITMQ_NODE_PORT=5672, 
RABBITMQ_PID_FILE=/home/users/amp/brooklyn-managed-processes/apps/b6kxbm5fei/entities/RabbitBroker_d5kw04uk46/rabbitmq.pid,
 
RABBITMQ_CONFIG_FILE=/home/users/amp/brooklyn-managed-processes/apps/b6kxbm5fei/entities/RabbitBroker_d5kw04uk46/rabbitmq}):
 #!/bin/bash -e
 ; export 
INSTALL_DIR="/home/users/amp/brooklyn-managed-processes/installs/RabbitBroker_3.6.0"
 ; export 
RUN_DIR="/home/users/amp/brooklyn-managed-processes/apps/b6kxbm5fei/entities/RabbitBroker_d5kw04uk46"
 ; mkdir -p $RUN_DIR ; cd $RUN_DIR ; ./sbin/rabbitmqctl -q status
2016-07-28 14:05:12,982 DEBUG o.a.b.u.c.i.w.w.Winrm4jTool 
[brooklyn-execmanager-E26Om0Sg-12879]: Finished WinRM exec on 
[email protected]:11049  done after 2.01s (connected in 0ms)
2016-07-28 14:05:13,012 DEBUG brooklyn.SSH [Thread-350446]: 
[[email protected]:stdout] Executed 
/tmp/brooklyn-20160728-140512642-NDCx-check-running_RabbitBrokerImpl.sh, result 
69
2016-07-28 14:05:13,012 DEBUG brooklyn.SSH [Thread-350447]: 
[[email protected]:stderr] Error: unable to connect to node 
'd5kw04uk46@qa-scheduler-rabbitmq-server-be5': nodedown
2016-07-28 14:05:13,013 DEBUG brooklyn.SSH [Thread-350447]: 
[[email protected]:stderr] DIAGNOSTICS
2016-07-28 14:05:13,013 DEBUG brooklyn.SSH [Thread-350447]: 
[[email protected]:stderr] ===========
2016-07-28 14:05:13,013 DEBUG brooklyn.SSH [Thread-350447]: 
[[email protected]:stderr] attempted to contact: 
['d5kw04uk46@qa-scheduler-rabbitmq-server-be5']
2016-07-28 14:05:13,013 DEBUG brooklyn.SSH [Thread-350447]: 
[[email protected]:stderr] d5kw04uk46@qa-scheduler-rabbitmq-server-be5:
2016-07-28 14:05:13,013 DEBUG brooklyn.SSH [Thread-350447]: 
[[email protected]:stderr]   * connected to epmd (port 4369) on 
qa-scheduler-rabbitmq-server-be5
2016-07-28 14:05:13,013 DEBUG brooklyn.SSH [Thread-350447]: 
[[email protected]:stderr]   * epmd reports: node 'd5kw04uk46' not 
running at all
2016-07-28 14:05:13,013 DEBUG brooklyn.SSH [Thread-350447]: 
[[email protected]:stderr]                   no other nodes on 
qa-scheduler-rabbitmq-server-be5
2016-07-28 14:05:13,013 DEBUG brooklyn.SSH [Thread-350447]: 
[[email protected]:stderr]   * suggestion: start the node
2016-07-28 14:05:13,013 DEBUG brooklyn.SSH [Thread-350447]: 
[[email protected]:stderr] current node details:
2016-07-28 14:05:13,013 DEBUG brooklyn.SSH [Thread-350447]: 
[[email protected]:stderr] - node name: 
'rabbitmq-cli-23557@qa-scheduler-rabbitmq-server-be5'
2016-07-28 14:05:13,013 DEBUG brooklyn.SSH [Thread-350447]: 
[[email protected]:stderr] - home dir: /home/users/amp
2016-07-28 14:05:13,013 DEBUG brooklyn.SSH [Thread-350447]: 
[[email protected]:stderr] - cookie hash: hGw+myYNiff8VA9lwwfh0g==
{code}

the console-out.log contains:

{code}
ERROR: node with name "d5kw04uk46" already running on 
"qa-scheduler-rabbitmq-server-be5"
{code}

The RabbitMQ service is not runing on the machine.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to