On Fri, Apr 08, 2016 at 01:41:05PM -0700, Richard Henderson wrote:
> On 04/08/2016 11:10 AM, Bill Schmidt wrote:
> > The first is an issue with TOC-relative addresses on PowerPC.  These are
> > symbolic addresses that are to be loaded from a fixed slot in the table
> > of contents, as addressed by the TOC pointer (r2).  In the RTL phases
> > prior to register allocation, these are described in an UNSPEC that
> > looks like this for an example store:
> > 
> > (set (mem/c:DI (unspec:DI [
> >        (symbol_ref:DI ("*.LANCHOR0") [flags 0x182])
> >        (reg:DI 2 2)
> >       ] UNSPEC_TOCREL) [1 svul+0 S8 A128])
> >      (reg:DI 178))
> > 
> > The UNSPEC helps keep track of the r2 reference until this is split into
> > two or more insns depending on the memory model.
> 
> 
> That's why Alpha uses LO_SUM for pre-reload tracking of such things.
> 
> Even though that's a bit of a liberty, since there's no HIGH to go along with
> the LO_SUM.  But at least it allows the middle-end to continue to find the 
> symbol.

I wish I'd been made aware of the problem with alias analysis when I
invented this scheme for -mcmodel=medium code..

Back in gcc-4.3 days, when small-model code was the only option, we
used to generate
        mem (plus ((reg 2) (const (minus ((symbol_ref)
                                          (symbol_ref toc_base))))))
for a toc mem reference, which accurately reflects the addressing.

The problem is that when splitting this to a high/lo_sum you lose the
r2 reference in the lo_sum, and that allows r2 to die prematurely,
breaking an important linker code editing optimisation.

Hmm.  Maybe if we rewrote the mem to
        mem (plus ((symbol_ref toc_base) (const (minus ((symbol_ref)
                                                        (reg 2))))))
It might look odd, but is no lie.  r2 is equal to toc_base.  Or
perhaps we could lie a litte and simply omit the plus and toc_base
reference?

Either way, when we split to
        set (reg tmp) (high (const (minus ((symbol_ref) (reg 2)))))
        .. mem (lo_sum (reg tmp) (const (minus ((symbol_ref) (reg 2)))))
both high and lo_sum reference r2 and the linker could happily replace
rtmp in the lo_sum insn with r2 when the high address is known to be
zero.

Bill, do you have test cases for the alias problem?  Is this something
that needs fixing for gcc-6?

-- 
Alan Modra
Australia Development Lab, IBM

Reply via email to