[gentoo-dev] Re: [gentoo-osx] Package testing -- Automated initiative

Kito Sun, 14 Aug 2005 07:04:29 -0700

Adding -dev to CC: in case someone has any meaningful input or hasalready tackled this problem...


On Aug 14, 2005, at 8:21 AM, Grobian wrote:

Introduction
============

Recently, once again we were confronted with a package marked as
ppc-macos stable, while it didn't compile at all, let alone run.It isbelieved more of these packages are in portage, and need to befound and
fixed.  Keeping the cause of why they are marked stable up to another
discussion, and out of the scope of this discussion, I will focuson how
to track these packages down and report them to us.
In the secondary line, all 'unstable' packages, marked ~ppc-macosshouldbe tested as well, since they can be faulty as well. Since for OSXmuchis in ~ppc-macos, many users consider it a normal procedure toswitch tothe unstable side of portage, hence some extra need for carefultesting
of ~ppc-macos also.


Proposed Global Structure
=========================

Testing should be done on a regular basis, both push and pull based.
This means that the testing machine would start testing packagesitself
if it is out of work, and on the other hand starts testing packages as
soon as they are being added/changed in CVS.  It may need no great
imagination to see that the latter 'push-based' activity has priority
over the 'pull-based' work.

I'm not sure direct interaction with CVS would be needed, usuallyonly takes a short time for cvs commits to hit the rsync mirrors(hence the volatile nature of the tree)

Starting over, will for the test machine mean that it starts cleaning
out its world file.  Cleaning this file out to a bare minimum is an
important aspect of getting a test environment that reflects the
situation on new user's machines.  If an ebuild uses a package without
having it in it's DEPEND, this may get noticed only when starting on a
clean machine.  This, however, will add a big delay in testing as many
packages will need to be built prior the right package can beinstalled.
The testing machine will have a queue file, which it reads packages to
emerge from.  If the queue file is empty, i.e. when there is no push
based work, the machine will generate work by starting to compile
uncompiled packages, or emptying the tree.
Because ~ppc-macos and ppc-macos packages interfere with each other-- a ~ppc-macos package overwrites a ppc-macos package -- bothstable and unstable have to be dealt with separately, i.e. theyshould both have their own environment either via two separatemachines, or through the use of a chroot jail.

I think seperate chroots are definitely the way to go. We can juststore a 'pristine' chroot in iso or dmg or whatever on the buildserver and copy when needed.

Queues
------
In order not to drag in a full DBMS (in the end Portage already isone) queues are just simple flat files consisting of absolutepackage names, one per line. Table wise locking granularity ishandled by the OS as one process opens the file in write mode.Consumers -- the testing box in this case -- read the first lineand delete it, while producers simply add one line (or more) to theend of the file.
The queue itself, is more a set than a list. This means thatpackages that are in the queue, should be unique. If a package isadded that is already in the queue, it is dropped such that theoriginal queue position of the package is maintained.

Maybe a 'proper' dbms wouldn't be such a bad idea, could also storebuild logs, timestamps, etc. there and make it easier for multiplebuild hosts to push/pull from a centralized server.

CVS Producer
------------
To catch up automatically with changes made to the tree, it isnecessary to act upon any commit to the tree for an ebuild file. Apossibility to do this would be via processing of CVS commitmessages, sent out as email by the CVS server. It is a task of theproducer to find out whether the ebuild found applies to thetesting machine (ppc-macos) and add the package/ebuild to the queue.
Consumer (testing process)
--------------------------
The test machine reads a line from the queue, and basicallyexecutes 'emerge ${PACKAGE}'. However, before doing this, first itfigures out which use flags can be used (emerge -pv) and whichdependencies will be pulled (emerge -pt). If portage returns themessage all ebuilds that could satisfy X have been masked, theemerge is cancelled, the line is removed from the queue and anemail message will be sent out.
All dependencies are put in the right order and emerged as normalpackages, that is: all dependencies are pushed at the front of thequeue, thereby keeping uniqueness of the queue and removingduplicates that appear later on in the queue. After this, theconsumer is restarted and reads again from the queue. This shouldresult in usually merging only one package at a time, and as suchquite isolated cases, which should improve the error emailnotification service.
Compile testing a package is supposed to be a thorough test thattries all possible combinations of the package's USE flags. Asthis might be somewhat endless as some packages are rather big andhave zillions of USE flags, it may be necessary to have a special"don't do it" file.Since all dependencies were put at the front of the queue, thereshould normally be no dependencies that the package pulls.If compilation fails for a certain USE-flag combination this isreported by sending out an email, and compilation of the next USE-flag combination is attempted.
When everything goes fine, no email notification is being sentout. A convenient log structure would, however, make it possibleto see which packages and USE-flag combinations successfully passedthrough. Providing this log via a web-page would be a usefulthing. Again backing this with a DBMS to allow easy searching,versioning and stuff is considered to be overhead, though craftinglogs in SQL's "INSERT INTO" format might enable another machine todisplay the output data. Perhaps the communication methods needs asection on itself.
Recap and Conclusion
====================
By setting up a testing system, it is possible to greatly improvethe Quality of Service of the portage tree for an architecture byexhaustive testing of both packages already in there, as well aspackages added or modified. Automated testing should not releasedevelopers from testing themselves, but should help in pointing outproblems that may arise on moving grounds such as portage wherepackages are constantly updated and dependencies might get broken.
ToDo
====
- Not only check dependencies of the respective package, but alsoconsider packages that depend on the respective package, thusrebuilding all packages that depend on the package to check ifanything is broken by the update.- Is there a gleptomaniac in the room? This would be useful forx86 also, of course. In that case it may be necessary to make surethe packages are split over multiple machines.- The message system needs more customisation options, especiallybacking things by a DBMS would allow for many nice bugzilla-likepreferences for email generation as well as web-based versionedinfo/report pages- To make the system even bigger, a central DBMS powered servermight take a leading role and ... {editor note: wait, stop it rightnow, you're going too fast right now}
By The Way
==========
- Kito offers his lil' chico as machine for this automated testinginitiative.- Comments are welcome, as well as expressions of worry on mymental state.- Implementation of described system will need some betterspecified system and needs some coding (the dirty work) in somelanguage...
--
Fabian Groffen
eBuild && Porting
Gentoo for Mac OS X

--
gentoo-osx@gentoo.org mailing list


--
gentoo-dev@gentoo.org mailing list

[gentoo-dev] Re: [gentoo-osx] Package testing -- Automated initiative

Reply via email to