[Twisted-Python] Unhandled exceptions and observability

Svein Seldal Wed, 27 Dec 2017 20:31:39 -0800

Hi

I'm not sure how to write this email, but please let me try. I'd like toaddress something that I see as a limitation in Twisted. It might bethat my use case is odd or that I'm outside the scope of Twisted, butnon the less, I'd hope this could be a relevant topic.


Problem:

Unhandled exceptions can leave the application in a half-working state,and the in-app observability for them is difficult to obtain. Instead ofterminating the whole application, the rest of the app can still keeprunning, and can be completely unaware of the failure.

This applies to unhandled errbacks in Deferred and principally to anyother reactor callbacks. E.g. it can occur in Deferreds being usedinternally in Twisted, where direct access to the object isn't availableto the caller.

As a user of Twisted, I would like to have the option to catch or failmy application completely when these unhandled exceptions occur, aswould be expected in a sequential program.



Background:

I have a larger application using many simultaneous TCP, UDP and UNIXconnections. As with Twisted, the app is grouped in functions, wheremost of the heavy lifting are done in black-box-ish modules. There is ofcourse, no guarantee for everything to work smoothly and if somethingfails, the entire application stops as a clear indication of thefailure. However, there have been some occasions where this applicationis found to be half-dead, due to a failure occurring in a reactor-basedcallback that can only be seen by reading the logs. The main applicationis unfortunately unaware of its own failure.

AFAIK Twisted has no direct mechanism for handling errors that mightoccur when user code is called from the reactor. Or even worse, thecaller does not know about the occurred failure unless the caller hasdirect access to the failing object. I believe this is more dangerous toreliability than the plain failing applications is, due to lowerobservability.


Lets say the following code is used in a running application:

   from twisted.internet.task import LoopingCall
   class Foo:
     def __init__(self):
       self.loop = LoopingCall(self.cb)
       self.loop.start(2, False)
     def cb(self):
       self.count += 1

   # Main app does this:
   try:
     foo = Foo()
   except:
     print "Won't happen"
     raise

The code will fail due to the programmical error in cb, but the callingapplication won't fail and thinks everything is fine. The methodology indebugging errors like this is by looking through the logs.



The 0-solution:

Everywhere a function is being called from the reactor, the user isresponsible to handling all exceptions. As is the current case.

However, this is not completely straight forward. try-expect are greatto catch expected errors, but it's easy to forget and ignore theunexpected ones. Like in the example above. The safeguard for this wouldbe something like:


   def cb(self):
     try:
        self.count += 1
     except:
        print "Whoops. Unexpected"
        signal_main_app()

And in a large application, there are many entrypoints (e.g. methods ina protcol handler), so the code becomes very cluttered. Plus it puts theresponsibility for the user to implement the signal_main_app() framework.



Proposal:

The ideal solution would be if there were a way to configure Twisted toinform about unhandled exceptions. It can be a addSystemEventTrigger(),or a SW signal, or a process signal, or perhaps a globalexecute-last-errback function. Possibly in a debug-context.

With this one could inform the application that one deferred object hasnot handled its errbacks. Then the main application is given a choice torespond appropriately, like shutting down.

Is my concern about the non-observability of unhandled exceptions at allwarranted? Is the thinking wrong? Are there any other types of solutionsto this problem? (I would like to avoid having to patch Twisted to do it.)



Best regards,
Svein

_______________________________________________
Twisted-Python mailing list
[email protected]
https://twistedmatrix.com/cgi-bin/mailman/listinfo/twisted-python

[Twisted-Python] Unhandled exceptions and observability

Reply via email to