Re: [Web-SIG] Server-side async API implementation sketches

Alice Bevan–McGregor Sun, 09 Jan 2011 03:36:39 -0800

On 2011-01-08 19:34:41 -0800, P.J. Eby said:

At 04:40 AM 1/9/2011 +0200, Alex Grönholm wrote:
09.01.2011 04:15, Alice BevanMcGregor kirjoitti:
I hope that clearly identifies my idea on the subject. Sinceasync>>servers will /already/ be implementing their own executors, Idon't>>see this as too crazy.
-1 on this. Those executors are meant for executing code in athread>pool. Mandating a magical socket operation filter herewould>considerably complicate server implementation.
Actually, the *reverse* is true. If you do it the way Alice proposes,my sketches don't get any more complex, because the filtering goes inthe executor facade or submit function.

Indeed; the executor is what then adds the file descriptor to theunderlying server async reactor (select/epoll/kqueue/other). In thecase of the Marrow server, this would utilize a reactor callback (somemight say "deferred") to update the Future instance with the data,setting completion status, executing callbacks, etc. One might even beable to use a threading.Event (or whatever is the opposite of a lock)to wake up blocking .result() calls, even if not multi-threaded(greenthreads, etc.).

Of course, adding the file descriptor to a pure async reactor then.result() blocking on it from your application would result in adeadlock; the .result() would never complete as the reactor would neverget a chance to perform the pending request. (This is why Marrowrequires threading be enabled globally before adding an executor to theenvironment; this requires rather explicit documentation.) Thisproblem is solved completely by yielding the future instance (pausingthe application) to let the reactor do its thing. (Yielding the futurebecomes a replacement for the blocking behaviour of future.result().)

Effectively what I propose adds emulation of threading on top of asyncby mutating an Executor. (The Executor would be a mixedthreading+async executor.)

I suggest bubbling a future back up the yield stack instead of theactual result to allow the application (or middleware, or whateverhappened to yield the future) to capture exceptions generated by thefuture'd request. Bubbling the future instance avoids excessiveexception handling cruft in each middleware layer; and I see no realissue with this. AFIK, you can use a shorthand (possibly wrapped in atry: block) if all you care about is the result:


   data = (yield my_future).result()

Truthfully, I don't really see the point of exposing the map() method(which is the only other executor method we'd expose), so it probablymakes more sense to just offer a 'wsgi.submit' key... which can be afunction as follows: [snip]

True; the executor itself could easily be hidden behind the filter. Ina multi-threaded environment, however, the map call poses no problem,and can be quite useful. (E.g. with one of my use cases for inclusionof an executor in the environment: image scaling.)

Granted, this might be a rather long function. However, since it'sessentially an optimization, a given server can decide how manyfunctions can be shortcut in this way. The spec may wish to offer aguarantee or recommendation for specific methods of certainstdlib-provided types (sockets in particular) and wsgi.input.

+1

Personally, I do think it might be *better* to offer extendedoperations on wsgi.input that could be used via yield, e.g. "yieldinput.nb_read()". But of course then the trampoline code hastorecognize those values instead of futures.

Because wsgi.input is provided by the server, and the executor isprovided by the server, is there a reason why these extended functionscouldn't return... futures? :)

Note, too, that this complexity also only affects servers that want tooffer a truly async API. A synchronous server has no reason to payparticular attention to what's in a future, since it can't offer anyperformance improvement.

I feel a sync server and async server should provide the same API foraccessing the input. E.g. the application/middleware must be agnosticto the server in this regard. This is why a little bit of magic goes along way. The following code would work on any WSGI2 stack that offersan executor (sync, async, or provided by middleware):


   data = (yield env['wsgi.submit'](env['wsgi.input'].read, 4096)).result()

In a sync server, the blocking read would execute in another thread.In an async one appropriate actions would be taken to request a socketread from the client. Both cases pause the application pending theresult. (If you don't immediately yield the future the behaviourbetween servers is the same!)

I do think that this sort of API discussion, though, is the mostdangerous part of trying to do an async spec. That is, I don'texpectthat everyone will spontaneously agree on the exact same API. Alice'sproposal (simply submitting object methods) has theadvantage ofseverely limiting the scope of API discussions. ;-)

Since each async server will either implement or utilize a specificasync framework, each will offer its own "async-supported" featureset.What I mean is that all servers should make wsgi.input callsasync-able, some would go further to make all socket calls async. Somemight go even further than that and define an API for externallibraries (e.g. DBs) to be truly cooperatively async. I do believe mysolution is flexible enough for the majority of use cases, and where itisn't (i.e. would block) "abusing" futures in this way will allow anapplication to reasonalby fake async without killing async server (whoare internally single-threaded) performance by delegating blockingcalls.

I will have to experiment with determining the type of the classinstance a method is bound to from the bound method itself; this is thecrux of the implementation I suggest. If you can't get that, the ideais pooched for anything but wsgi.input which the server would have adirect reference to anyway.

I hope the clarity of this post didn't degenerate too much over the fewhours I had it open and noodling around.


        - Alice.


_______________________________________________
Web-SIG mailing list
Web-SIG@python.org
Web SIG: http://www.python.org/sigs/web-sig
Unsubscribe: 
http://mail.python.org/mailman/options/web-sig/archive%40mail-archive.com

Re: [Web-SIG] Server-side async API implementation sketches

Reply via email to