Re: [users@httpd] Serving partial data of in-memory common data set

André Warnier Tue, 28 Jul 2009 13:53:34 -0700

Jonathan Zuckerman wrote:

On Tue, Jul 28, 2009 at 3:37 AM, S.A.<qmt...@yahoo.com> wrote:

...

Concurring with Jonathan about the free advice and the tenuous relevanceto the main list topic, I'd nevertheless want to try to contribute.


My summary of the issue :
- there are N clients accessing the site
- each client is authenticated, with a client-id of some kind
- they all request originally the same URL

- the server however returns a page to each client that can bedifferent, based on a server-side client profile, selected as per theclient-id- the returned page is different, because it includes for each client, adifferent mixture of "items" in the page, based on the client profile- each client gets a different selection of i items, but these i itemsare picked among a grand total of I items, which are themselves alwaysthe same- you would like to cache at least part of these I items in memory, tospeed up the responses to the clients

You haven't given us any hard numbers, like how many clients there are,how concurrently they access the server, how many I items there reallyare, how large each I item is, how fast the server is, how much memoryit has, or anything of the kind.You have mentioned that some of the items I were "media", which Ipersonally tend to associate with "large", byte-wise.

My very first reaction would be to ask myself if it is all really worthit. Caching in memory, no matter how it's done, has a cost. A cost indesign, complexity, and in pure cache management.Modern operating systems already cache disk data. So if a same "object"is accessed frequently in a short period of time, it will already be inthe practice cached in memory buffers by the OS. Below the OS level,good disk controllers also cache frequently accessed data. Below thecontrollers, disks themselves cache data in cache memory.Caching it yet again, with a different piece of software, may just addoverhead.

An additional aspect is that, if some of the objects are large, and yourserver has limited memory, caching many such objects may fill up thephysical memory, and cause the system to start swapping, which wouldreally have the opposite effect to what you're looking for.

On the other hand, for Apache to access an object on disk, requires onthe part of Apache quite a bit of work; all the more work the deeper theobject resides in the "document space", because Apache needs to "walk"the directory hierarchy, all the while checking access and other rulesat each level. So by organising your objects smartly on disk, so as tominimise the work Apache has to do to find it and return it, you maygain a whole lot of processing time.

And servers nowadays are cheap. For the time and money you'd spendstudying the best caching scheme, you could easily buy an extra serverwith terabytes of disk space and gigabytes of ram to use as I/O cache.

So basically what I am saying, is : try it, without any clever cachingscheme, but with a clever organisation of your data and an efficientApache configuration. That /may/ show a problem and a bottleneck, whichyou can then tackle on its own merits. On the other hand, it may showno problem at all.

A lot of work has gone into Apache, to make it as efficient as possibleto serve content of all kinds. There are thousands of Apache siteshandling thousands of clients, and a lot of content.Do not spend a lot of time ahead of time, to solve what is maybe anon-existent problem. As someone said a long time ago : prematureoptimisation is the source of much evil.


---------------------------------------------------------------------
The official User-To-User support forum of the Apache HTTP Server Project.
See <URL:http://httpd.apache.org/userslist.html> for more info.
To unsubscribe, e-mail: users-unsubscr...@httpd.apache.org
  "   from the digest: users-digest-unsubscr...@httpd.apache.org
For additional commands, e-mail: users-h...@httpd.apache.org

Re: [users@httpd] Serving partial data of in-memory common data set

Reply via email to