Re: More aggressive threading causing loop-interchange-9.c regression

Aldy Hernandez via Gcc Fri, 10 Sep 2021 09:39:09 -0700



On 9/10/21 6:21 PM, Jeff Law wrote:

On 9/10/2021 10:05 AM, Aldy Hernandez wrote:
On 9/10/21 5:43 PM, Jeff Law wrote:
On 9/9/2021 3:21 AM, Aldy Hernandez wrote:
   /* If this path does not thread through the loop latch, then we are
      using the FSM threader to find old style jump threads. This
      is good, except the FSM threader does not re-use an existing
      threading path to reduce code duplication.

      So for that case, drastically reduce the number of statements
      we are allowed to copy.  */
*blink*
Woah. The backward threader has been using FSM threadsindiscriminately as far as I can remember. I wonder what wouldbreak if we "fixed it".
?!? I'm not sure what you're suggesting here. If you s/FSMthreader/backwards threader/ in the comment does it make more sense?The term FSM really should largely have been dropped as the backwardsthreader was improved to handle more cases.
back_threader_registry::register_path() uses EDGE_FSM_THREAD as thethread type to register threads. I was wondering if it should havebeen some combination of EDGE_START_JUMP_THREAD /EDGE_*_COPY_SRC_BLOCK, etc. I (purposely) know nothing about theunderlying threading types ;-). But if the backwards threader has beenimproved, then perhaps we should just remove the confusing FSMreferences.
No we shouldn't change it to any of the other types. EDGE_FSM_THREADmeans a thread found by the backwards threader and it's the key used todetermine which of the two CFG updating mechanisms should be used, thegeneric copier in the case of EDGE_FSM_THREAD.
Changing the name, yes, absolutely. I probably should have done thatwhen the original FSM threader was tweaked to handle generic threading.


I'll put that on my list.

As you've probably heard me mention before, all the EDGE_FSM_THREADstuff in the registry really could be pulled out. The registry'spurpose is to deal with the two stage nature of jump threading in DOM(find threads, then optimize them later). I don't think any of thebackwards threading bits need that two stage handling.

Yeah yeah. But you said that a year ago, and all I heard was *mumblemumble/complicated things*. That was before I had much exposure to thecode. Now I feel a lot more comfortable ;-).

I'll also put this on my list, but it may not get done this release,cause I'm running against the clock with the VRP/threader replacement,which is what's keeping us back from replacing VRP with an evrp instanceright now :).

My current thinking is that replacing the forward VRP threader with ahybrid one is a gentler approach to the longer term goal of replacingthe forward threader altogether. However, all the work I've beendoing could go either way-- we could try the forward/VRP replacementor a hybrid approach. It will all use the path solver underneath.
And that's probably a reasonable intermediate step on the way towardsremoving the VRP threading.
My main problem with replacing the forward/VRP with a backward clientis that the cost models are so different that it was difficult tocompare how we fared. I constantly ran into threads the solver couldhandle just fine, but profitable_path_p was holding it back.
Yea.  Sorry about that tangle of insanity
FWIW, we get virtually everything the forward threader gets, minus avery few things. At least when I plug in the solver to theDOM/forwarder threader, it can solve everything it can (minus noiseand floats).
So once you plug those bits in, we don't have to carry around theavail/copies tables for the threader anymore, right? That's a nicecleanup in and of itself.

Correct. For the VRP/hybrid approach I'm working on, there are nocopies/avails. The solver can do everything they did. After all, it'san easier target, since VRP threading only happens on ints and withoutthe IL changing constantly.

If you prefer a backward threader instance to replace the VRP/forwardthreader, I'm game. It's just harder to compare. Either way (backwardthreader or a hybrid forward+solver) uses the same underlying solverwhich is solid.
I think we can go hybrid, then look at the next step, which could wellbe bringing some consistency into the costing models.
c) DOM changes the IL as it goes. Though we could conceivablydivorce do the threading after DOM is done.
The only reason threading runs in parallel with DOM is so that it canuse the context sensitive equivalences. With the infrastructureyou're building, there's a reasonable chance we can move to a modelwhere we run DOM (and in the long term a simpler DOM) and threadingas distinct, independent passes.
Andrew mumbled something about replacing all of DOM eventually :-).Well, except that value-numbering business I bet.
Essentially a realization of Bodik's work in GCC. The nugget in thereis it's a path sensitive optimizer. That's kindof what I've envisionedDOM turning into.
1. We separate out jump threading from DOM.
2. We replace the bulk of DOM with FRE
3. The remnants of DOM turn into a path sensitive optimizer (and forgod's sake we don't want to call it DOM anymore :-)

Well, my tree has improvements to the solver for full path sensitiveranges (using ranger to resolve definitions outside of the path). Andit also does path sensitive relations, though Andrew is overhauling themnext week. So yeah, given a path, we could tell you all sorts ofinteresting things about it :).


Aldy

Re: More aggressive threading causing loop-interchange-9.c regression

Reply via email to