Re: If you had a month to improve gcc build parallelization, where would you begin?

Jeff Law Wed, 03 Apr 2013 21:26:08 -0700

On 04/03/2013 09:44 PM, Joern Rennecke wrote:

Quoting Jeff Law <l...@redhat.com>:

Using distcc and ccache is trivial; I spread my builds across ~20
processors around the house...


CC=distcc
CXX=distcc g++
CC_FOR_BUILD=distcc
CXX_FOR_BUILD=distcc


It's not quite that simple if you want bootstraps and/or Canadian crosses.

It is for bootstraps.  Been using it for years.

STAGE_CC_WRAPPER=distcc
STAGE_CXX_WRAPPER=distcc


How does that work?
The binaries have to get the all the machines of the clusters somewhere.

NFS with wired connections. A mix of 100M and 1000M interfaces on theboxes.

Does this assume you are using NFS or similar for your build directory?
Won't the overhead of using that instead of local disk kill most of the
parallelization benefit of a cluster over a single SMP machine?

What I've found works best is to have the machine with the disks handlethe configury, preprocessing, linking & java nonsense and farm all theactual compilations to the rest of the cluster. I've manuallydistributed testing through the years with varying degrees of success.

The net result is the gcc bootstrap itself can be parallelized well.We're left with the configury serialization, java which isn't handled bydistcc, lameness in the multilib builds & testing as the big holdups.We probably lose a little trying to distribute stuff like libgcc whereeach file is so trivial. Not surprising those are the areas Isuggested for improvement.

I played with pump mode which basically moves preprocessing to theclients (by shipping them the headers). That would push a significantamount of the load off the master to the rest of the cluster, but nevergot it to work with bootstraps.


Jeff

Re: If you had a month to improve gcc build parallelization, where would you begin?

Reply via email to