Re: [Openstack] Messaging reliability/durability expectations

Gordon Sim Tue, 14 Oct 2014 10:45:52 -0700

I agree that greater clarity on expectations around reliability are needed.


The drivers all differ in this regard.

As it stands today, the impl_rabbit driver only retries an RPC requestif an exception occurs while sending it. However messages are sentunconfirmed[1]. This means a message can be lost before it gets enqueuedby the broker, without the sender of the message receiving any error ornotification of that fact.

Even if the requests are durably stored and/or replicated in a clusteredRabbitMQ configuration, the reply queues are currently alwaysauto-deleted and are not durable regardless of configuration, so repliesmay be lost on broker failure even if requests are not.

So I believe that various failures may cause an RPC request to fail(i.e. to timeout). It seems this is not universally expected however, soI am not sure how many OpenStack services using oslo.messaging expectand handle such failures.


--Gordon

[1] The impl_qpid driver by contrast sends messages synchronously - i.e.blocking until confirmed, but on the receive side it does not useacknowledgements so again message loss is possible.


_______________________________________________
Mailing list: http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack
Post to     : openstack@lists.openstack.org
Unsubscribe : http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack

Re: [Openstack] Messaging reliability/durability expectations

Reply via email to