[Twisted-Python] Waiting for a contended resource

Richard van der Hoff Mon, 12 Mar 2018 11:50:24 -0700

Hi folks,

I thought I'd poll the list on the best way to approach a problem inTwisted.

The background is that we have a number of resources which can berequested by a REST client, and which are calculated on demand. Thecalculation is moderately expensive (can take multiple seconds), so theresults of the calculation are cached so multiple lookups of the sameresource are more efficient.

The problem comes in trying to handle multiple clients requesting thesame resource at once. Obviously if 200 clients all request the sameresource at the same time, we don't want to fire off 200 calculationrequests.

The approach we adopted was, effectively, to maintain a lock for eachresource:

lock = defer.DeferredLock()
cached_result = None

@defer.inlineCallbacks
def getResource():
     yield lock.acquire()
     try:
         if cached_result is None:
             cached_result = yield do_expensive_calculation()
         defer.returnValue(cached_result)
     finally:
         lock.release()

(Of course one can optimise the above to avoid getting the lock if wealready have the cached result - I've omitted that for simplicity.)

That's all very well, but it falls down when we get more than about 200requests for the same resource: once the calculation completes, we cansuddenly serve all the requests, and the Deferreds returned byDeferredLock end up chaining together in a way that overflows the stack.

I reported this as http://twistedmatrix.com/trac/ticket/9304 and, at thetime, worked around it by adding a call to reactor.callLater(0) into ourimplementation. However, Jean-Paul's comments on that bug implied thatwe were approaching the problem in completely the wrong way, and insteadwe should be avoiding queuing up work like this in the first place.

It's worth reiterating that the requests arrive from REST clients whichwe have no direct control over. We *could* keep track of the number ofwaiting clients, and make the API respond with a 5xx error or similar ifthat number gets too high, with the expectation that the client retries- but one concern would be that the load from the additional HTTPtraffic would outweigh any efficiency gained by not stacking up Deferreds.


So, I'd welcome any advice on better ways to approach the problem.

Richard

_______________________________________________
Twisted-Python mailing list
Twisted-Python@twistedmatrix.com
https://twistedmatrix.com/cgi-bin/mailman/listinfo/twisted-python

[Twisted-Python] Waiting for a contended resource

Reply via email to