Re: mod_cache: store_body() bites off more than it can chew

Graham Leggett Fri, 03 Sep 2010 07:59:12 -0700

On 03 Sep 2010, at 4:25 PM, Niklas Edmundsson wrote:

This could even go a bit further with providing the cacheimplementation with a hint of when it would be polite of it toreturn. I think it would probably be easier if the cacheimplementation knows what's expected of it. Or?


That I've covered separately in the email about atomic commits.

Also, if the client hangs up, will the cache impl get the chance tofinish its job (ie. completing the caching of a file instead ofstarting over later on)?

That is a decision made by mod_cache itself, not the implementationthough, but it's definitely possible. In theory, if mod_cache kepttrack of a downstream failure, and then responded to the failure byreading from the backend and caching until done before returning theerror, this would definitely work.

A side-step from this, how would it interact with the thunderingherd lock and slow client first to access large file while otherfast clients also wants to access it? Wouldn't this just be anothervariety of the "client gets bored before reply" scenario?

The thundering herd lock never holds back a client or makes a clientwait. When the URL is completely uncached, the lock allows the firsthit to start to cache, and passes all subsequent requests throughwithout caching, in the process stopping the huge race that used tooccur while many requests attempted to cache the same file over andover until at least one response was successfully completed. When theURL is already cached and has recently gone stale, the first hit isallowed to hit the backend and (hopefully) refresh the entry, whilesubsequent requests are served stale content with a Warning (as perthe RFC). There is a safety valve on the lock in that the lock onlylives for a few seconds, so if the request to the backend breaks, anew request to attempt to freshen the cache will be attempted in a fewseconds time.

Regarding the issue of the disk cache cramming the entire file intomemory/address space, an alternate solution could be that the cachereturns buckets pointing to the cached file, ie that the cacheconsumed those pesky mmapped buckets. This way the cache could cachethe file rather quickly independent of the speed of the client, sothat the file caching is finished in a sane time so others canbenefit from it even though the cache-initiating request is runningat snail-speed...

Been keen to do this for a while, this would definitely solve the RAMproblem, but wouldn't solve the time problem. Copying 4GB of data froma slow disk can easily take minutes, and when Blu-ray images startbecoming common, the problem would get worse.


Regards,
Graham
--

Re: mod_cache: store_body() bites off more than it can chew

Reply via email to