[PECL-DEV] Using an APC-lite in a GCI / SuPHP environment

Terry Ellison Thu, 27 Sep 2012 10:16:33 -0700

(Repost plus Rasmus feedback -- as the original got chucked into the bin-- ezmlm issues)

Hi,

I have a particular interest in performance optimisation of theinfrastructure stack used in share hosting services, and this topic israised regularly at various applications forums as well as lists such asthis. The last was a thread "APC and CGI" [1], and I've CCed Marten andRasmus who had this discussion. The nub of this were the points raised(i) by Martin: there is a genuine need for performance acceleration ofLAMP stacks used in shared hosting offerings: and (ii) by Rasmus thatthere are technical difficulties with doing this so the work involvedwould be non-trivial, and "if you are using a fork-per-request CGI modelyou are obviously not that concerned about performance".

Can I offer an alternative perspective: developers and implementers useshared hosting offerings for a number of reasons, and perhaps the twomajor one are that (i) most such implementers simply don't have theadministration skills to build and administrate a dedicate/VM hostoffering a LAMP stack, and (ii) yes, shared services are cheaper.However, the main discriminant on choice of shared or dedicated hostingservice architecture is on request *volumes*, not responsiveness. Ihave yet to meet an admin / implementer that isn't concerned about theirapplication or service responsiveness. Also despite the fall inrelative price of VM offerings, the number of shared hosting accountsoffered by the hosting providers exceed the number of VM and dedicatedaccounts by *more than an order of magnitude*, and now such providersroutinely offer "one-click" installation of such heavyweightapplications as WordPress, MediaWiki and phpBB.

Such applications usually have poor responsiveness on a typical sharedservice. The main reason for this is NOT the php-cgi image activationtime (<100m Sec on a current server) [2], but on the infrastructurearchitecture adopted by most hosts to achieve scaling. This is normallyachieved by using a farm of servers with separate dedicated tiersproving: web services; back-end D/Bs; and user storage. Modern serversarranged in such a farm infrastructure architecture can deliver ampleMIPs needed to support such shared hosting solutions, so processingdelays aren't usually a major factor in responsive.

The main responsiveness killer is I/O delay. A single web request toone of the MW-class applications can require the loading of roughly 100PHP modules and compiling perhaps 500K lines of source. The roughly 1CPU second needed to compile such a script set isn't non-trivial, butthe issue driving script response is normally the I/O delay associatedwith RPCs to the backend NAS infrastructure needed to read the sourcefiles. (The NAS filesystems are usually NFS mounted with a short, say15s, acregmin, so the in-server VFAT cache is usually flushed by eachrequest and this I/O therefore can generate ~300 off-server RPCs as wellas cache-miss physical I/O within the NAS itself.)

This I/O and compilation delay is largely avoidable by using aper-script cdb-style file-based opcode cache. This would in effectreplace the compilation overhead and assembly/input of the ~100 PHPmodules with the largely serial read of a single (compressed) opcodefile, and as NFS4 effectively does bulk read-ahead for serial access tofiles, the I/O overheads in doing this can be reduced by well over onorder of magnitude.

I've demonstrated that this can be achieved at an application level withphpBB by processing script hierarchies to marshal them into condensedglob sets[3] and even this give a ~3x speed-up, *but* such processing isapplication specific and deeply unpopular with the applicationmaintainers, who usually offer "use a dedicated LAMP VM and APC orXcache". This really needs to be supported at the PHP extension level.So what I am proposing is:

1) We develop an APC-lite extension offering code caching, (but notvariable caching) in a manner that is transparent to the applicationssitting over it.

2) The main usecase is to deliver performance acceleration for complexapplications such as MediaWiki, WordPress and phpBB implemented in ashared hosting environment, that is targeted specifically at php-cgi/clienvironments .

3) Any cache strategy should be thread and process-safe, UID-specificand normally SCRIPT_NAME-specific.

4) The file caches are piece-wise constant and extended when necessaryusing an incremental approach to overwrite the existing cache with theextended version. No LRU or variant cache pruning schemes are supportedand the only refresh/prune option is to invalidate an existing cache toenable its recreation on a subsequent script execution.

5) The default mode of operation is "stat=0", that is the cacheassociated with the SCRIPT_NAME is opened if it exists and no sourcefiles are statted or read. This means that a single file can execute anapplication such as MW or WP.

6) The starting point to this Lite Program Cache (LPC) is some specificand relevant modules from the existing APC code base, (albeit strippedof all code redundant in this usecase).

I am proposing that *I* do this development at least to a working proofof concept extension. (I am not asking the current APC maintainers todo any material work here, since this is a fork, though I would verymuch welcome access and feedback for review of design docs, code andpossibly advice on specific issues.) I've already implemented asuitable DBA(cdb) replacement extension, since cdb performs poorly inthis usecase, but I'll release the two extension as a set, and amlargely through the reengineering the APC design from its code base. Itwill take me a couple of months to complete this first release. I canalso push the code base from my local git repository to github in a fewweeks if anyone wants review access.

This is really just an FYI to the community, but any comments andfeedback would be welcome. Failing this, I will post back here when Ihave a working extension and hard performance data.


Regards Terry Ellison

PS. Since this is my first post to this DL, a short intro on myself: Iam an ex-IT dev with ~10yrs C/C++ dev experience -- mostlyinfrastructure and realtime, as well as some more senior stuff such asbeing a systems architect. I am now early-retired and "gentlemancontributor" keeping my hand in by code contributions to a few FLOSSprojects. I was also the sysadmin and maintainer the OpenOffice.orguser forums and wiki for over 5 years, and lately a member of the ApacheInfrastructure team (though this has now lapsed as I was unhappy withthe shift from the previous more friendly Sun project ethos, so I am nowlooking for another project to get my teeth into). I also answer Qs onStackOverflow on PHP/Apache/using shared hosting services, etc. [3]

T

[1] http://news.php.net/php.pecl.dev/start/9807

[2] http://blog.ellisons.org.uk/article-44 alsohttp://blog.ellisons.org.uk/search-PHP for b/g this and related articles

[3] http://stackoverflow.com/users/1142045/terrye


On 27/09/12 16:44, Rasmus Lerdorf wrote:

That was quite a long message. I still don't believe in CGI-based
setups. It will be extremely painful to build what you propose. I think
a better option would be to look at php-fpm and figure out what is
missing from that to use it in a shared hosting environment. It already
supports APC and per-user process pools. The only missing piece is
probably related to spinning up pools on demand as opposed to starting
them all on server start.

-Rasmus

[PECL-DEV] Using an APC-lite in a GCI / SuPHP environment

Reply via email to