Re: [Templates] Template Caching & premature optimization

Tim Tompkins Tue, 05 Jul 2005 13:40:34 -0700

Thanks, Andy, for this analysis, but unfortunately it really doesn'tcome close to the scale I'm dealing with. I have a couple of thousandapache and thttpd processes constantly hitting nfs shares for file statson over a terabyte of content (only a very small fraction of this is webtemplates). And we already have projects slated to migrate other sitesthat will double the traffic. This is definite and we need to be readyfor tripling the traffic within the next 2-3 years. We are, however,due for some current benchmarking which will have to be done anyway asdevelopment ensues on our rewrite. Previous benchmarking was performeda few years ago by the CTO at the time, and is no longer available.

Certainly I can see where this thread is being considered as preemptiveoptimization. This is my fault for not giving the full scope of theissue and just leaving it as "too much nfs activity." But I don't seethis as preemptive optimization. I see it as an unnecessary call beyondthe initial page load and not much less than if I were to attempt tofrequently validate that my chair exists after I've sat down in it.Once it's there and it's in use it does not require re-validation. Ifthe chair were to break while I'm sitting in it, the entire process ofsitting down must be restarted--I get up, find a replacement chair andthen sit down again. It's the same thing with templates: if an error isfound in the template then a revision is made which must be approved, itthen replaces the template and the servers are restarted. Think of thetemplates less in the light of traditional web pages and more in thelight of perl modules. Perl doesn't care if a module has changed oreven if it has been deleted from disk after it's been loaded. If youwant to enact a changed library, you (typically) must bounce the process.

This may sound a bit over the edge, but it helps to ensure the integrityof any code that could be used for processing credit cards. Only a fewpeople can approve these these types of changes while many people mayhave their hands in the development of templates. As a furthercomplication, those who can approve changes cannot be involved in thembeyond reviewing the revision. I've really been trying to avoid gettinginto much detail here, it's time consuming and borders on disclosingcompany policy. I was hoping that simply stating that this is my needand asking "what is the accepted approach with TT" would suffice, but itseems that there's not an "accepted" approach.For whatever reason and whether it's accepted by the community or not Ihave a few goals in mind for our redesign that I'm hoping to come closeto using TT. Here are a couple that are relevant to this topic:

* Mark certain templates as "protected" so they cannot be modifiedafter being loaded and reinstate the ability to modify non-sensitivepages (which mostly eliminates this whole stat issue from my perspectiveexcept for protected components, because statting a file would onceagain be needed).

* Preload selective (primarily the protected) templates in the parentapache (1.3) process to ensure that changes can't sneak through as newapache children are spawned

If these two goals in particular can be done with TT, then this issue isresolved for me as soon as I find out how. Otherwise, I'm left withlocking down *all* template revisions until I come up with analternative. For as much as I know about TT at this point, it mightmean sub-classing from Template::Provider, but as I mentioned, I'm newto TT and I'd really prefer to keep my hands out of there until I becomemore familiar with it.

Locking down template revisions (in part or in whole) is a tiny detailin the big picture and that it's not being done because I *want* to doit or that I think it's the best approach (it's certainly not theeasiest), it's being done because I *must* do it to show strict auditingpolicy over any piece of code involved in a point of sale. We've notyet solidified our final templating solution; I'm still working outdiscovery and so far TT is the forerunner. This entire issue mayresolve out to having my head stuck in previous solutions that I reallyneed to rethink. But I was just looking for a response to how this hasbeen dealt with previously by experienced TT users (as I think was theoriginal post on this thread). My joining this thread was simply inthat it sounded similar to what I will be dealing with.


--
Tim


Andy Wardley wrote:

Andy Lester wrote:
Actually, you'll only have half a million stat calls, which according to mytest below is less than a second of machine overhead per day.
 perl -MBenchmark -e 'timethis(10_000_000, sub { stat $0 })'
timethis 432000: -1 wallclock secs ( 0.18 usr + 0.28 sys = 0.46 CPU)@ 939130.43/s (n=432000)
Why?  $Template::Provider::STAT_TTL is set to 1 (second) by default.  That
means that each file is checked once a second, at most, regardless of howmany page impressions you're getting. That's 86k stat() calls per day(60*60*24), per template used (which I assumed to be 5 in the calculationabove) = 432,000
And even if you were hitting stat() for every template, for every page, 20million stat() calls is still only approx. 20 seconds of processor overhead
per day.  That's pretty cheap.

You mention that you're mounted across NFS, which will certainly make things
a little slower.  But if you're looking to speed thing up, then replicating
the templates to a local filesytem is going to have a much greater impact
than trying to optimise away stat() calls.

So I think Andy's advice is sound: measure what you're doing, and be
sure that you're optimising the right thing.

I personally suspect that tuning out the stat() calls isn't going to save
you a great deal of time, but I could be wrong.  So if you want to reduce the
number of stat calls, simply set STAT_TTL to a higher value.
$Template::Provider::STAT_TTL = 60;
HTH
A



_______________________________________________
templates mailing list
[email protected]
http://lists.template-toolkit.org/mailman/listinfo/templates

Re: [Templates] Template Caching & premature optimization

Reply via email to