array access by constant mat tweak)

Jeff Law Thu, 21 Mar 2024 12:56:32 -0700



On 3/21/24 11:19 AM, Vineet Gupta wrote:


So if we go back to Robin's observation that scheduling dramatically
increases the instruction count, perhaps we try a run with
-fno-schedule-insns -fno-schedule-insns2 and see how the instruction
counts compare.


Oh yeah ! Robin hinted to this in Tues patchworks meeting too

default     : 2,565,319,368,591
128         : 2,509,741,035,068
256         : 2,527,817,813,612
no-sched{,2}: 1,295,520,567,376

Now we're getting somewhere.  That's in line with expectations.

I would strongly suspect it's -fno-schedule-insns rather than-fno-schedule-insns2. The former turns off scheduling before registerallocation, the second turns it off after register allocation. So ifour theory about spilling is correct, then it must be the first sincethe second won't affect register allocation. While I can speculateabout other potential scheduler impacts, spilling due to sched1'sactions is by far the most likely.

Given the magnitude here, I would bet we can see this pretty clearly ifyou've got function level or block level count data for those runs. I'dstart with that, ideally narrowing things down to a function or hot loopwithin a function which shows a huge delta.

From that we can then look at the IRA and LRA dumps and correlate whatwe see there with the before/after scheduling dumps to see how we'velengthened lifetimes in critical locations.

I'd probably start with the IRA dump. It's going to have annotations inits dump output like "Potential Spill" which may guide us. In simplestterms a pseudo is trivially allocatable when it has fewer neighbors inthe conflict graph than available hard registers. If it has moreneighbors in the conflict graph than available hard registers, then it'spotentially going to be spilled -- we can't know during this phase ofallocation.

As we pop registers off the coloring stack, some neighbors of the pseudoin question may end up allocated into the same hard register. That cansometimes result in a hard register being available. It might be easierto see with a graph


    a--b--c
       |
       d

Where a..d are pseudo registers. If two pseudos are connected by anedge, then they have overlapping lifetimes and can't be allocated to thesame hard register. So as we can see b conflicts with a, c & d. If weonly have two hard registers, then b is not trivially colorable and willbe marked as a potential spill.

During coloring we may end up allocating a, c & d to the same hardregister (they don't conflict, so its safe). If that happens, thenthere would be a register available for b.

Anyway, that should explain why b would be marked as a potential spilland how it might end up getting a hard register anyway.

The hope is we can see the potential spills increasing. At which pointwe can walk backwards to sched1 and dive into its scheduling decisions.


Jeff

Re: scheduler queue flush (was Re: [gcc-15 0/3] RISC-V improve stack/array access by constant mat tweak)

Reply via email to