Re: [zeromq-dev] Formally modelling the publish-subscribe pattern

John Lång Fri, 16 Oct 2020 01:13:25 -0700

Hi Bill,

I hadn't heard about OZ when I started my project that now relies on0MQ. The tech stack was largely chosen by my supervisor and co-workers.Now the priority is to get the system verified and in production as soonas possible. Thus, at this point we must stick with the currentdependencies. Anyway, this OZ does seem interesting...

About Spin, I mainly chose it because I had used it with success in myearlier work that relates to the thesis. I'll list some other pointsthat come into my mind:

- I find the PROMELA language to be intuitive and elegant forformalising asynchronous systems. I like the way how PROMELA resembles aprogramming language. The TLA+ language seems to be lower level in thesense that I need to keep track of program counters etc. Of coursethere's the pluscal language for TLA, but I haven't tried it.- The TLC model checker seems to be quite a bit slower than Spin. Thisis more like the first impression than an informed opinion, though. Itcould be the case that TLC just has more overhead that slows down theverification of small models but which becomes unnoticeable on largermodels.- The TLA+ IDE does feel much better than iSpin or jSpin. However, Ipersonally prefer the simplicity of the Emacs PROMELA mode.- I find the theory behind Spin and enumerative model checking (guardedcommands language, Büchi automata) easier to understand than that ofsymbolic model checking (CSPs, SAT, SMT, DPLL, CDCL, BMC, IC3, etc.).Direct construction with automata feels more comfortable for me than themore analytical encodings and constraint satisfaction problems. Ofcourse, your mileage may vary. Some understanding on how the modelchecker works helps fighting the state space explosion.- I've read Mordechai Ben-Ari's book "Principles of the Spin ModelChecker", and it was a great tutorial. It was much more understandablefor me than Leslie Lamport's TLA Hyperbook and other resources.

Of the four model checkers I've tested (Spin, NuSMV, TLA+, and DIVINE) Ifind Spin the easiest to use. This is not to say that Spin itself iseasy to use. Frankly, the output formatting of Spin verifier executablesis horrible. Understanding counter examples can take much work.


Best regards,
John

On 15.10.2020 17.11, Bill Torpey wrote:

Hi John:

I’ll help where I can — pls see embedded comments.
Also, shameless plug: you may want to check out OZ(https://github.com/nyfix/OZ <https://github.com/nyfix/OZ>) — this isan “out of the box” implementation of ZeroMQ as the transport layerfor OpenMAMA (https://www.openmama.org/ <https://www.openmama.org/>). It may provide a simple way to run some test code (or at leastprovide some implementation hints).
I’m interested in your thesis in any case — I’ve been wanting tocreate a formal model of OZ for some time, and have been dipping mytoes into TLA+ (https://en.wikipedia.org/wiki/TLA%2B<https://en.wikipedia.org/wiki/TLA%2B>) for that purpose. I’m curiousif you’ve considered TLA+ and if so what prompted you to choose PROMELA?
Good luck with your thesis, and I’d very much like to review it whenyou’ve got something suitable.
Regards,

Bill Torpey
On Oct 15, 2020, at 5:23 AM, John Lång <john.l...@mykolab.com<mailto:john.l...@mykolab.com>> wrote:
Hello,
(Hopefully this message didn't get duplicated...) As part of mymaster's thesis, I'm building a formal model for my distributedsystem using PROMELA. The system I'm modelling uses 0MQpublish-subscribe pattern. I have some questions about the pattern.I'd really appreciate all answers to these questions! I'm now lookingat the informal specification at https://rfc.zeromq.org/spec/29/
The specification mostly looks clear. However, I'm a bit confusedabout these two bullet points:
  * SHALL create a queue when initiating an outgoing connection to a
    subscriber, and SHALL maintain the queue whether or not the
    connection is established.
  * SHALL create a queue when a subscriber connects to it. If this
    subscriber disconnects, the PUB socket SHALL destroy its queue
    and SHALL discard any messages it contains.
What is the difference between initiating an outgoing connection to asubscriber and a subscriber connecting to a publisher? In the C++source code of my system, a publisher binds to a PUB socket and asubscriber connects to the address of a publisher. I guess thissounds more like the second point, but I'm not certain.
This is an implementation detail, but can be important in someuse-cases. Basically, when a SUB connects to a PUB, subscriptioninformation is exchanged between the SUB and PUB as part of theconnect handshake — when a PUB connects to a SUB an additional set ofmessages is required to communicate subscription information from theSUB to the PUB. This matters because, for most protocols, filteringis done by the PUB, so the PUB will not send messages until it knowsthat the SUB is interested in them. In practice, this means that whena SUB connects to a PUB, the SUB will immediately start receivingmessages — in the other direction, there is a “window” where somenumber of messages that are published immediately following theconnect may not be sent to SUB.
There’s been a lot of discussion of this in the ZeroMQ community, forinstance https://github.com/zeromq/libzmq/issues/2267<https://github.com/zeromq/libzmq/issues/2267>. Note that thecanonical solution for this (calling zmq_poll after connect, as perhttps://gist.github.com/hintjens/7344533<https://gist.github.com/hintjens/7344533>) is NOT reliable — there isstill a window, although calling zmq_poll does often reduce the windowsize such that things appear to work.
Did I understand correctly that this message queue between apublisher and a subscriber works FIFO? It says that for outgoingmessages, "SHALL silently drop the message if the queue for asubscriber is full." Does this imply that those messages that fit inthe queue are delivered in the order they were sent in? I take itthat the queue being full means that the high water mark has beenexceeded.
Our testing was done not on queue full, but on socketdisconnect/reconnect, but the principle is the same. Any queuedmessages are delivered in order, and any messages dropped are droppedfrom the tail of the queue. So a typical sequence at a subscriberwould look something like this if the queue fills up after message #3and then is drained before message #10 is sent:
1 .. 2 .. 3 .. <queue full> .. 10, 11, 12
What is a binary comparison? Is it the same as bitwise comparison?
Yes.
To me, "binary comparison" sounds like just comparing two things witheach other.
Currently, I'm modelling my 0MQ publish-subscribe connections asarrays in PROMELA. After all, the specification for the publishersays that processing outgoing messages "SHALL NOT block on sending".Another reason for my decision is that I have multiple publishers andsubscribers and a fixed message queue length, so a three-dimensionalarray is handy for accessing the messages. Is there a way forachieving sensible channel semantics for 0MQ publish-subscribe pattern?
I wonder if there's already a formal specification for 0MQpublish-subscribe pattern out there somewhere... I should probably dosome more research to see if I can find related work on this matter.
If you do I’m sure that the community would love to know about it, aswould I.
Best regards,
John Lång
_______________________________________________
zeromq-dev mailing list
zeromq-dev@lists.zeromq.org <mailto:zeromq-dev@lists.zeromq.org>
https://lists.zeromq.org/mailman/listinfo/zeromq-dev
_______________________________________________
zeromq-dev mailing list
zeromq-dev@lists.zeromq.org
https://lists.zeromq.org/mailman/listinfo/zeromq-dev

_______________________________________________
zeromq-dev mailing list
zeromq-dev@lists.zeromq.org
https://lists.zeromq.org/mailman/listinfo/zeromq-dev

Re: [zeromq-dev] Formally modelling the publish-subscribe pattern

Reply via email to