Re: [PATCH 21/25] GCN Back-end (part 1/2).

Andrew Stubbs Mon, 12 Nov 2018 04:13:59 -0800

On 09/11/2018 19:11, Jeff Law wrote:

There's a ton of work related to reduction setup, updates and teardown.
  I don't guess there's any generic code we can/should be re-using.  Sigh.

I'm not sure what can be shared, or not, here. For OpenMP we don't haveany special code, but OpenACC is much closer to the metal, and AMD GCNdoes things somewhat differently to NVPTX.

WRT your move patterns.  I'm a bit concerned about using distinct
matters for so many different variants.  But they mostly seem confined
to vector variants.  Be aware you may need to squash them into a single
pattern over time to keep LRA happy.

As you might guess, the move patterns have been really difficult to getright. The added dependency on the EXEC register tends to put LRA intoan infinite loop, and the fact that GCN vector moves are alwaysscatter/gather (rather than a contiguous load/store from a base address)makes spills rather painful.


Thanks for your review, I'll have a V2 patch-set soonish.

Andrew

Re: [PATCH 21/25] GCN Back-end (part 1/2).

Reply via email to