Re: [Web-SIG] PEP 444 / WSGI 2 Async

Alice Bevan–McGregor Wed, 05 Jan 2011 20:03:53 -0800

[Apologies if this is a double- or triple-post; I seem to be having astupid number of connectivity problems today.]


Howdy!

Apologies for the delay in responding, it’s been a hectic start to thenew year. :)


On 2011-01-03, at 6:22 AM, Timothy Farrell wrote:

You don't know me but I'm the author of the Rocket Web Server(http://pypi.python.org/pypi/rocket) and have, in the past, beeninvolved in the web2py community. Like you, I'm interested in seeingweb development come to Python3. I'm glad you're taking up WSGI2. Ihave a feature-request for it that perhaps we could work in.

Of course; in fact, I hope you don’t mind that I’ve re-posted thisresponse to the web-sig mailing list. Async needs significantlybroader discussion. I would appreciate it if you could reply to themailing list thread.

I would like to see futures added as a server option. This way,controllers could dispatch emails (or run some other blocking orlong-running task) that would not block the web-response. WSGI2Servers could provide a futures executor as environ['wsgi.executor']that the app could use to offload processes that need not completebefore the web-request is served to the client.

E-mail dispatch is one of the things I solved a long time ago withTurboMail; it uses a dedicated thread pool and can deliver > 100 uniquemessages per second (more if you use BCC) in the default configuration,so I don’t really see that one use case as one that can benefit fromthe futures module. Updating TurboMail to use futures would be aninteresting exercise. ;)

I was thinking of exposing the executor asenviron[‘wsgi.async.executor’], with ‘wsgi.async’ being a boolean valueindicating support.

What should the server do with the future instances?

The executor returns future instances when running executor.submit/map;the application never generates its own Future instances. Theapplication may, however, use whatever executor it sees fit; it can,for example, have one thread pool executor and one process pool, usedfor different tasks.

The server itself can utilize any combination of single-threadedIO-based async (see further on in this message), and multi-threaded ormulti-process management of WSGI requests. Resuming suspendedapplications (ones pending future results) is an implementation detailof the server.

Should future.add_done_callback() be allowed? I'm not sure howpractical/reliable this would be. (By the time the callback is called,the calling environment could be gone. Is this undefined behavior?)

If you wrap your callback in a partial(my_callback, environ) theenviron will survive the end of the request/response cycle (due to theincremented reference count), and should be allowed to enableintelligent behaviour in the callbacks. (Obviously the callbacks willnot be able to deliver a response to the client at the time they arecalled; the body iterator can, however, wait for the future instance tocomplete and/or timeout.)

A little bit later in this message I describe a better solution thanthe application registering its own callbacks.

Do we need to also specify what type of executor is provided (threadedvs. separate process)?

I think that’s an application-specific configuration issue, not reallythe concern of the PEP.

Do you have any thoughts about this?

I believe that intelligent servers need some way to ‘pause’ a WSGIworker rather than relying on the worker executing in a thread andblocking while waiting for the return value of a future. Usinggenerator syntax (yield) with the following rules is my initial idea:

* The application may yield None. This is a polite way to have theasync reactor (in the WSGI server/gateway) reschedule the worker forthe next reactor cycle. Useful as a hint that “I’m about do dosomething that may take a moment”, allowing other workers to get achance to perform work. (Cooperative multi-tasking on single-threadedasync servers.)

* The application must yield one 3-tuple WSGI response, and must notyield additional data afterwords. This is usually the last thing theWSGI application would do, with possible cleanup code afterwords(before falling off the bottom / raising StopIteration / returningNone).

* The application may yield Future instances returned byenviron[‘wsgi.executor’].submit/map; the worker will then be pausedpending execution of the future; the return value of the future will bereturned from the yield statement. Exceptions raised by the futurewill be re-raised from the yield statement and can thus be captured ina natural way. E.g.:


        try:
            complex_value = yield environ[‘wsgi.executor’].submit(long_running)
        except:
            pass # handle exceptions generated from within long_running

Similar rules apply to the response body iterator: it yieldsbytestrings, may yield unicode strings where native strings are unicodestrings, and may yield Future instances which will pause the bodyiterator as per the application callable.


Servers must:

* Allow configuration of the future implementation for options likethreading / processes.


* Allow developers to override the executor completely.

* Provide additional attributes on wsgi.input: async_ prefixed versionsof the read methods, which are factories returning server-specificFuture instances. (Allowing a single-threaded async server to handlesocket IO intelligently with select/epoll/etc.)

To the libraries you use, futures make async pretty much transparent. E.g. libraries (such as a DB layer) must not create their own Futureobjects, but must instead utilize an executor passed to them explicitlyby the application.


My ideas thus far,

        — Alice.

P.s. a number of these ideas (wsgi.executor, wsgi.async, some of theyield syntax described above) have been soundly argued against by aco-conspirator over IRC. I’ll re-read my IRC logs and reply with thoseconsiderations in mind (and transcribed logs) shortly.

P.p.s. my kernel panicked while I was translating my rewrite into ReST;I'll re-do the conversion tonight or tomorrow morning and submit itdownstream ASAP.



_______________________________________________
Web-SIG mailing list
Web-SIG@python.org
Web SIG: http://www.python.org/sigs/web-sig
Unsubscribe: 
http://mail.python.org/mailman/options/web-sig/archive%40mail-archive.com

Re: [Web-SIG] PEP 444 / WSGI 2 Async

Reply via email to