Re: [squid-users] Re: Squid 3.2.6 & hot object cache

Amos Jeffries Mon, 21 Jan 2013 19:59:19 -0800

On 22/01/2013 2:00 a.m., babajaga wrote:

Rock and COSS storage types however are far more optimized for speed,

using both disk and RAM storage in ther normal "disk" configuration. <


Amos,

haven't you been a little bit too "generous" in your comments, especially
this referred one ?

I don't think so. They *have* been optimized for speed and aremeasurably so.I made no comment about bug-free state in any of the disk I/O modues.Just about speed versus a RAM disk.


I looked at the docs both for COSS and Rock, and the following excerpts made
me a bit skeptical:

1) COSS:
Changes in 3.3 cache_dir
     COSS storage type is lacking stability fixes from 2.6

When I read such a statement, I refuse to use this feature in a production
environment. Even in case, it has a lot of speed advantages. One crash might
wipe out all speed advantages.

As it was intended. Until somebody wants to do the portage its unlikelyto change either. We have debated both removing COSS entirely orexpending the effort to debug it fully. Neither debate came to asatisfactory conclusion yet. The developers do agree that: Rock wasdesigned to do the same things as COSS and does them a bit better, andCOSS is not worth our time fixing. If you or someone else has adifferent opinion patches are still welcome (so we are required to leavethe COSS code present in 3.2+).

Note also that it is referring to the squid-3 version of COSS. There wassome bug fixes that went into squid-2.6 and COSS in 2.7 has a proventrack record for high performance now. Rock was built on that 2.7 trackrecord with a few design fixes for lessons learned since COSS wascreated and SMP support.

2) Rock:
http://wiki.squid-cache.org/Features/RockStore#limitations
2a) Rock store is available since Squid version 3.2.0.13. It has received
some lab and limited deployment testing. It needs more work to perform well
in a variety of environments, but appears to be usable in some of them.
2b)Objects larger than 32,000 bytes cannot be cached when cache_dirs are
shared among workers.
2c)Current implementation uses OS buffers for simplicity.

When reading 2a) I start to be cautious again :-)

Good. It is a new feature, the small number of people using it so fargive us confidence enough to promote it but not to say its bug-free.Problems may occur in a situation where nobody has tried using it. Alsowe are aware that startup time is slower with Rock than we would like.That is all 2a means.

By all means be cautious. But please do not let that stop you testing orusing it. The more people we have using it the more confident we can bethat it is bug-free.

2b) tells me, it very much depends upon the mean size/standard deviation of
the cached objects, whether using Rock really has an advantage. Might change
in the future with Rock-large, though.
2c) Makes the theoretical approach to evaluate performance advantages of
Rock almost impossible. Because you always have to consider the filesystem
used, with the respective options, having a huge impact on performance. So
the only serious approach right now to advocate possible performance
advantages would be after quite some benchmarking, using real workloads.
Which certainly are very site specific.
Because of the basic principle of Rock and Rock-large (which are like
filesystems themselves), using raw disk-I/O is possible in the future, at
least, which MIGHT THEN justify a general statement  "much more optimized to
speed".

The COSS model is a slice model the same way that a disk backed RAM-diskoperates its swap pages. In both designs large chunks of memory areswapped in and out to fetch items stored somewhere within that chunk.Under the UFS on RAM-disk model these would be allocated random disklocations by the generic disk manager and each is swapped inindividually only after being requested by the client. Under Rock/COSSrequests within a certain time range of each other are assigned slotswithin one memory page/chunk - such that a client loading a page causes,with a high probability, the related objects, images, scripts - to beswapped in and ready to served directly from the RAM area slice beforethey are requested by the client. Overall this means the latency of afirst-request is either the same as RAM or the same as disk I/O, PLUSthe latency of followup related items is that of RAM *instead* of diskI/O - for a total net reduction in latency / gain in speed when loadinga web page.

As you can see this is also very page-centric. If you are using Squid asgateway for a web app which does not have that type of page-centrictemporal linkage between its requests the storage types become muchcloser in latency.

Yes, it is *complicated*, with a great many factors which we have not orcannot measure with any accuracy.


Amos

Re: [squid-users] Re: Squid 3.2.6 & hot object cache

Reply via email to