Re: [Openstack-operators] Rabbitmq cluster_status alarms

2017-03-23 Thread Andreas Vallin

On 03/23/2017 10:59 AM, Matteo Panella wrote:


Hi,

On 22/03/2017 14:48, Andreas Vallin wrote:

Cluster status of node 'rabbit@Infra1-rabbit-mq-container-2590dd44' ...
[{nodes,[{disc,['rabbit@Infra1-rabbit-mq-container-2590dd44',
 'rabbit@Infra2-rabbit-mq-container-ff24b66b',
 'rabbit@Infra3-rabbit-mq-container-bf7948a7']}]},
  {running_nodes,['rabbit@Infra3-rabbit-mq-container-bf7948a7',
  'rabbit@Infra2-rabbit-mq-container-ff24b66b',
  'rabbit@Infra1-rabbit-mq-container-2590dd44']},
  {cluster_name,<<"rabbitmq_osa_prod">>},
  {partitions,[]},
  {alarms,[{'rabbit@Infra3-rabbit-mq-container-bf7948a7',[]},
   {'rabbit@Infra2-rabbit-mq-container-ff24b66b',[]},
   {'rabbit@Infra1-rabbit-mq-container-2590dd44',[]}]}]

AFAIR, once all alarms are cleared the node name remains in
cluster_status' output but the alarm list becomes empty - which seems to
be the case for your cluster.

Regards,
OK, my assumption was that the alarms section would disappear when 
everything is ok or possibly that it would list something like:

{alarms,[]}

So if I understand you correctly the output from rabbitmqctl 
cluster_status in my first mail says that there are no errors? Every 
member/node/container is listed with an empty list of errors?
If I remember correctly from when I tried yesterday the alarms section 
disappeared if I started the rabbitmq cluster with only one member.


Thanks for you help!

Regards,
Andreas


___
OpenStack-operators mailing list
OpenStack-operators@lists.openstack.org
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operators


Re: [Openstack-operators] Rabbitmq cluster_status alarms

2017-03-23 Thread Matteo Panella
Hi,

On 22/03/2017 14:48, Andreas Vallin wrote:
> Cluster status of node 'rabbit@Infra1-rabbit-mq-container-2590dd44' ...
> [{nodes,[{disc,['rabbit@Infra1-rabbit-mq-container-2590dd44',
> 'rabbit@Infra2-rabbit-mq-container-ff24b66b',
> 'rabbit@Infra3-rabbit-mq-container-bf7948a7']}]},
>  {running_nodes,['rabbit@Infra3-rabbit-mq-container-bf7948a7',
>  'rabbit@Infra2-rabbit-mq-container-ff24b66b',
>  'rabbit@Infra1-rabbit-mq-container-2590dd44']},
>  {cluster_name,<<"rabbitmq_osa_prod">>},
>  {partitions,[]},
>  {alarms,[{'rabbit@Infra3-rabbit-mq-container-bf7948a7',[]},
>   {'rabbit@Infra2-rabbit-mq-container-ff24b66b',[]},
>   {'rabbit@Infra1-rabbit-mq-container-2590dd44',[]}]}]

AFAIR, once all alarms are cleared the node name remains in
cluster_status' output but the alarm list becomes empty - which seems to
be the case for your cluster.

Regards,
-- 
Matteo Panella
INFN CNAF
Via Ranzani 13/2 c - 40127 Bologna, Italy
Phone: +39 051 609 2903



smime.p7s
Description: S/MIME Cryptographic Signature
___
OpenStack-operators mailing list
OpenStack-operators@lists.openstack.org
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operators


[Openstack-operators] Rabbitmq cluster_status alarms

2017-03-22 Thread Andreas Vallin

Hi!

I have an openstack installation done with openstack-ansible 
14.0.6-10-g01ff4c8 (Newton). After doing some firmware upgrades on 
network switches (while rabbitmq was up and running) I see that I get 
alarms when running rabbitmqctl cluster_status:


Cluster status of node 'rabbit@Infra1-rabbit-mq-container-2590dd44' ...
[{nodes,[{disc,['rabbit@Infra1-rabbit-mq-container-2590dd44',
'rabbit@Infra2-rabbit-mq-container-ff24b66b',
'rabbit@Infra3-rabbit-mq-container-bf7948a7']}]},
 {running_nodes,['rabbit@Infra3-rabbit-mq-container-bf7948a7',
 'rabbit@Infra2-rabbit-mq-container-ff24b66b',
 'rabbit@Infra1-rabbit-mq-container-2590dd44']},
 {cluster_name,<<"rabbitmq_osa_prod">>},
 {partitions,[]},
 {alarms,[{'rabbit@Infra3-rabbit-mq-container-bf7948a7',[]},
  {'rabbit@Infra2-rabbit-mq-container-ff24b66b',[]},
  {'rabbit@Infra1-rabbit-mq-container-2590dd44',[]}]}]

All communications are restored and from what I can see everything is 
working but I still have the alarms reported when running rabbitmqctl 
cluster_status. It might be something that I have missed and something 
is broken but in that case I would like to know how I can get 
information about what the alarm is giving me an alarm about. Is there 
any way I can tell rabbitmq to remove the alarms?


If I look at the rabbitmq containers one by one they report that they 
have no alarms:


ansible -m shell -a "rabbitmqctl status|grep alarm" rabbitmq
Variable files: "-e @/etc/openstack_deploy/user_secrets.yml -e 
@/etc/openstack_deploy/user_variables.yml "

Infra3_rabbit_mq_container-bf7948a7 | SUCCESS | rc=0 >>
 {alarms,[]},

Infra2_rabbit_mq_container-ff24b66b | SUCCESS | rc=0 >>
 {alarms,[]},

Infra1_rabbit_mq_container-2590dd44 | SUCCESS | rc=0 >>
 {alarms,[]},

Regards,
Andreas


___
OpenStack-operators mailing list
OpenStack-operators@lists.openstack.org
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operators