Re: [Web-SIG] [PEP 444] Future- and Generator-Based Async Idea

P.J. Eby Sat, 08 Jan 2011 09:00:39 -0800

At 03:26 AM 1/8/2011 -0800, Alice BevanMcGregor wrote:

Warning: this assumes we're running on bizzaro-world PEP 444 thatmandates applications are generators. Please do not dismiss thisidea out of hand but give it a good look and maybe some feedback. ;)

First-glance feedback: I'm impressed. You may have something goinghere after all. I just wish you'd sent this sooner. ;-)

I can easily see why I didn't think of this myself: I hadn't shiftedmy thinking to accomodate for two important changes in the Pythonenvironment since the first WSGI spec, circa 2003-04:


1. Coroutines and decorators are ubiquitous and non-intrusive

2. WSGI has stdlib support, and in any event it is much easier torely on non-stdlib packages

My major concern about the approach is still that it requires a fairamount of overhead on the part of both app developers and middlewaredevelopers, even if that overhead mostly consists of importing anddecorating. (More below.)

The second middleware demonstration (using a decorator) makesmiddleware look a lot more like an application: yielding futures, ora response, with the addition of yielding an application callablenot explored in the first (long, but trivial) example. I believethis should cover 99% of middleware use cases, including interactivedebugging, request routing, etc. and the syntax isn't too bad, ifyou don't mind standardized decorators.

If we assume that the implementation would be in a wsgi2ref forPython 3.3 and distributed standalone for 2.x, I think we can makesomething work. (In the sense of practical to implement, notnecessarily *desirable*.)

One of my goals is that it should be possible to write "async-naive"applications and middleware, so that people who don't care aboutasync can ignore it.

On the application side, this is easy: a trivial decorator sufficesto translate a return into a yield.

For middleware, it's not quite as simple, unless you have a pureingress or egress filter, since you can't simply "call" theapplication. However, a "context manager"-like pattern applies,wherein you could simply yield to calling a wrapped version of the application.

Hm. This seems to pretty much generalize to a standardcoroutine/trampoline pattern, where the server provides thetrampoline, and can provide APIs in the environ to create waitableobjects that can be yielded upward.

Actually, this is kind of like what I really wanted the futures PEPto be about. And it also preserves composability nicely.

In fact, it doesn't actually need any middleware decorators, if theserver provides the trampoline.

We would leave your "my_awesome_application" example intact (possiblyapart from having a friendlier API for reading from wsgi.input), butchange my_middleware as follows:


   def my_middleware(app):
       def wrapper(environ):
           # pre-response code here
           response = yield app(environ)
           # post-response code here
           yield altered_response
       return wrapper

That's it.  No decorators, no nothing.

The server-level trampoline is then just a function that lookssomething like this:


    def app_trampoline(coroutine, yielded):
        if [yielded is a future of some sort]:
            [arrange to invoke 'coroutine(result)' upon completion]
            [arrange to inovke 'coroutine(None, exc_info)' upon error]
            return "pause"
        elif [yielded is a response]:
            return "return"
        elif [yielded has send/throw methods]:
            return "call"  # tell the coroutine to call it
        else:
            raise TypeError

The trampoline function is used with a coroutine class like this:

    class Coroutine:

        def __init__(self, iterator, trampoline, callback):
            self.stack = [iterator]
            self.trampoline = trampoline
            self()

        def __call__(self, value=None, exc_info=()):
            stack = self.stack
            while stack:
                try:
                    it = stack[-1]
                    if exc_info:
                        try:
                            rv = it.throw(*exc_info)
                        finally:
                            exc_info = ()
                    else:
                        rv = it.send(value)
                except BaseException:
                    value = None
                    exc_info = sys.exc_info()
                    if exc_info[0] is StopIteration:
                        # pass return value up the stack
                        value, = exc_info[1].args or (None,)
                        exc_info = ()   # but not the error
                    stack.pop()
                else:
                    switch = self.trampoline(self, rv)
                    if switch=="pause":
                        return
                    elif switch=="call":
                        stack.append(rv)  # Call subgenerator
                        value, exc_info = None, ()
                    elif switch=="return":
                        value, exc_info = rv, ()
                        stack.pop()

            # Coroutine is entirely finished
            self.callback(value)

And run by simply calling:

    Coroutine(app(environ), app_trampoline, process_response)

Where process_response() is a function receiving a three-tuple toprocess the actual result.

That's basically it. The Coroutine class isserver/framework-independent; the minimal trampoline function is thepart the server author has to write.

The body iterator can follow a similar protocol, but the trampolinefunction is different:


    def body_trampoline(coroutine, yielded):
        if type(yielded) is bytes:

if len(coroutine.stack)==1: # only accept fromoutermost middleware

                [send the bytes out]
                [arrange to invoke coroutine() when send is completed]
                return "pause"
            else:
                return "return"
        if [yielded is a future of some sort]:
            [arrange to invoke 'coroutine(result)' upon completion]
            [arrange to inovke 'coroutine(None, exc_info)' upon error]
            return "pause"
        elif [yielded has send/throw methods]:
            return "call"  # tell the coroutine to call it
        else:
            raise TypeError

So, part of the server's "process_response" callback would look like:

    Coroutine(body_iter, body_trampoline, finish_response)


You can then implement response-processing middleware like this:

    def latinize_body(body_iter):
        while True:
            chunk = yield body_iter
            if chunk is None:
                break
            else:
                yield piglatin(yield body_iter)

    def piglatin(app):
        def wrapper(environ):
            s, h, b = yield app(environ)
            if [suitable for processing]:
                yield s, h, latinize_body(b)
            else:
                yield s, h, b  # skip body processing

My overall impression is still that there's something worthconsidering here, but there is still some ugly mental overheadsinvolved for body-processing middleware, if we want to supportpausing during the body iteration. The latinize_body function aboveisn't exactly intuitively obvious, compared to a for loop, and itcan't be replaced by one without using greenlets.


On the plus side, it can actually all be done without any decorators at all.

(The next interesting challenge would be to integrate this withGraham's proposal for adding cleanup handlers...)


_______________________________________________
Web-SIG mailing list
[email protected]
Web SIG: http://www.python.org/sigs/web-sig
Unsubscribe: 
http://mail.python.org/mailman/options/web-sig/archive%40mail-archive.com

Re: [Web-SIG] [PEP 444] Future- and Generator-Based Async Idea

Reply via email to