Re: [Tutor] properly propagate problems

Cameron Simpson Sat, 23 Mar 2019 16:07:15 -0700

On 23Mar2019 11:04, ingo janssen <ingoo...@gmail.com> wrote:

One thing I often struggle with is how to deal with exceptions,especially when I have a chain of functions that use each othersoutput and/or long running processes. As the answer will probably be"it depends"


Oh yes!

The core rule of thumb is "don't catch an exception which you don't knowhow to handle", but that is for truly unexpected errors not envisaged bythe programmer. Then your programme aborts with a debugging stack trace.


Your situation below is more nuanced. Discussion below.

take for example this program flow:

open a file and read into BytesIO buffer
get a FTP connection from pool
send buffer to plantuml.jar in memory FTP server
render file to image
get image from FTP server
push the image onto CherryPy bus
push (SSE) the image to web browser

def read_file(input_file):
   try:
       with open(input_file, 'rb') as f:
           buffer = io.BytesIO(f.read())
   except FileNotFoundError as e:
       print(e)
       ....
   return buffer

assume the file is not found, I cannot just kill the whole process.Catching the exception is one thing, but how to deal with it properly,I have to inform the client somehow what went wrong.


Given a function like that, I would be inclined to do one of 2 things:

A) don't make a policy decision (catching the exception) this close tothe failure, instead let the exception out and let the caller handle it:


   def read_file(input_file):
       with open(input_file, 'rb') as f:
           return io.BytesIO(f.read())

   filename = "foo"
   try:
       buffer = read_file(filename)
   except OSError as e:
       error("could not load %r: %s", filename, e)
       ... failure action, maybe return from the function ...
   ... proceed with buffer ...

This leaves the policy decision with the calling code, which may have abetter idea about what is suitable. For example, you might pass someuseful response to your web client here. The low level functionread_file() doesn't know that it is part of a web service.

The handy thing about exceptions is that you can push that policydecision quite a long way out. Provided the outer layer where you decideto catch the exception knows that this involved accessing a file you canput that try/except quite a long way out and still produce a sensiblelooking error response.

Also, the further out the policy try/except lives, the simpler the innerfunctions can be because they don't need to handle failure - they can bewritten for success provided that failures raise exceptions, making them_much_ simpler and easier to maintain. And with far fewer policydecisions!

The flip side to this is that there is a limit to how far out in thecall chain this try/except can sensibly happen: if you're far enough outthat the catching code _doesn't_ know that there was a file readinvolved, the error message becomes more vague (although you still havethe exception instance itself with the low level detail).


B) to return None on failure:

   def read_file(input_file):
       try:
           with open(input_file, 'rb') as f:
               return io.BytesIO(f.read())
       except OSError as e:
           error(

"read_file(%r): could not read input file: %s",input_file, e)

           return None

None is a useful sentinel value for failure. Note that sometimes youwill want something else if None is meaningful return value in ordinarycircumstances. Then your calling code can handle this withoutexceptions:


   buffer = read_file("foo")
   if buffer is None:
       ... return nice message to web client ...
   else:
       ... process the image ...

However, it does mean that this handling has to happen right at the callto read_file. That can be fine, but might be inconvenient.


Finally, some related points:

I find it useful to distinguish "mechanism" and "policy". In my idealworld a programme is at least 90% mechanism with a thin layer of policyoutside it. Here "policy" is what might be termed "business logic" or"application logic" in some circumstances: what to do to achieve thehigh level goal. The high level is where you decide how to behave invarious circumstances.

This has a few advantages: almost all low level code is mechanism: ithas a well defined, usually simple, purpose. By having almost allfailures raise an exception you can make the low level functions verysimple: do A then B then C until success, where you return the result;raise exceptions when things go wrong (failure to open files, invalidinput parameters, what have you). This produces what I tend to call"white list" code: code which only returns a result when all therequired operations succeed.


This is option (A) above, and makes for very simple inner functions.

For option (B) "return None on failure", this is where we decide thatspecific failures are in fact valid execution paths, and None is a validfunction return, indicating some kind of null result. You might stillraise exceptions of various types for invalid input in this case; theNone is only for a well defined expected non-answer.


Regarding uncaught exceptions:

As you say, you don't want your whole app to abort. So while you maycatch specific exception types at some inner layer, you might want tocatch _all_ exceptions at the very outermost layer and log them (with astack trace), but not abort. So:


   try:
       ... process client request ...
   except Exception as e:
       # log exception and stack trace to the application log
       error("handler failed: %s", e, exc_info=True)
       return 500 series web response to client here ...

This is one of those situaions where you might use the normally reviled"catch all exceptions" anti-pattern: at the outermost layer of some kindof service programme such as a daemon or web app handling requests:report the exception and carry on with the application. Remember theZen: errors should not pass silently. Always log something when youcatch an exception.

Note that a primary reason to hate "catch all" is that such code oftenthen proceeds to do more work with the bogus results. In a daemon or aweb app, you're aborting _that request_. Any further work is shiny andnew from a new request, not continuing with nonsensical data left aroundby a catch-all.

Fortunately web frameworks like Flask or CherryPy usually embed such acatch-everything in their handler logic, outside you own code (afterall, what if you own catch-everything was buggy?) So you don't normallyneed to write one of these things yourself. Which is good really, mostof the time - they are a recipe for accidentally hiding errors. Let theframework do that one - it has been debugged for you.

Another issue is the distinction between what to log and what to showthe client. You usually DO NOT want to let the nitty gritty of theexception get to the end user: that way lies accidental leaking ofcredentials or private implementation details. So log details, butreturn fairly bland information to the client. Try to write your code sothat this is the default behaviour. Again, web frameworks generally dojust this in their outermost catch-all handler: only if you turn on somekind of DEBUG mode does it splurge private stuff over the web page forease of debugging in development.

Finally, I'm sure you've thought to yourself: if I catch an exception along way from where it happened, won't the exception message lack allsorts of useful context about what happened? How useful is a log entrylike this (from the outermost "OCR the document" level):


   error("OCR failed: %s", e)

producing:

   OCR failed: permission denied

because of a permission issue on a specific (but here, unnamed) file?

My own solution to this issue is my cs.pfx module (you can install thiswith "pip install cs.pfx").

This provides a context manager named Pfx which adorns exceptions withcall stack information, totally under your control. It also has various.error and .warning etc methods which produce prefixed log messages.


Example:

   from cs.pfx import Pfx

   def read_file(input_file):
       with Pfx("read_file(%r)", input_file):
           with open(input_file, 'rb') as f:
               return io.BytesIO(f.read())

and outer calls might look like:

   def produce_image(image_name):
       with Pfx("produce_image(%r)", image_name):
           filename = path_to_image_file(image_name)
           buffer = read_file(filename)
           ... do stuff with the buffer ...

If the inner open fails, the exception message, which is originally likethis:


   [Errno 2] No such file or directory: 'fffff'

becomes:

   produce_image('image_name'): read_file("/path/to/image_name.png"): [Errno 2] 
No such file or directory: '/path/to/image_name.png'

How much context you get depends on where you put the "with Pfx(...):"statements.

It also furthers simple code, because you no longer need to pepper yourown exceptions with annoying repetitive context, just the core message:


   def read_file(input_file):
       with Pfx("read_file(%r)", input_file):
           if not input_file.startswith('/'):
               raise ValueError("must be an absolute path")
           with open(input_file, 'rb') as f:
               return io.BytesIO(f.read())

Because of the Pfx the ValueError gets the input_file value in questionprefixed automatically, so you don't need to include it in your raisestatement.


Hoping all this helps.

Short takeaway: decide what's mechanism and what is policy, and try toput policy further out in higher level code.


Cheers,
Cameron Simpson <c...@cskk.id.au>

Go not to the elves for counsel, for they will say both no and yes.
- Frodo, The Fellowship of the Ring
_______________________________________________
Tutor maillist  -  Tutor@python.org
To unsubscribe or change subscription options:
https://mail.python.org/mailman/listinfo/tutor

Re: [Tutor] properly propagate problems

Reply via email to