I dont have the figures in front of me, but reliably 4 and 8 create a slowdown, 16 is about even, whilst 32 gives a slight gain. This is on an athlon-tbird 1.4g. On a p4, no flag, and 4 were the similar - havent got as far as the rest yet. As well as being processor dependent, I would expect it to be application dependent.
Makes sense when you read what man gcc says. BillK On Wed, 2003-10-22 at 03:49, [EMAIL PROTECTED] wrote: > If you could please clarify for me, your saying compiling with -falign-functions=32 > gives you gains? > > > On all the machines (athlon t-bird, p4) I have tried so far, -O3 always > > decreases performance - O2 is best (dramatically so on something like a > > celery). Most of the other options you see mentioned on the list show a > > small incremental advantage, except for -falign-functions=4 which also > > slows things down. Making this 32 gives a small advantage though. -- [EMAIL PROTECTED] mailing list