Re: 回复：[PING] [PATCH RESEND] riscv: improve the cost model for loading a 64bit constant in rv32.

Palmer Dabbelt Mon, 28 Nov 2022 11:18:35 -0800

On Mon, 28 Nov 2022 11:15:01 PST (-0800), gcc-patches@gcc.gnu.org wrote:
>
>
> On 11/24/22 00:43, Sinan wrote:
>>>Â TheÂ motivationÂ ofÂ thisÂ patchÂ isÂ toÂ correctÂ theÂ wrongÂ estimationÂ 
>>>of
>>>>Â theÂ numberÂ ofÂ instructionsÂ neededÂ forÂ loadingÂ aÂ 64bitÂ constantÂ 
>>>>in
>>>>Â rv32Â inÂ theÂ currentÂ costÂ model(riscv_interger_cost).Â AccordingÂ to
>>>>Â theÂ currentÂ implementation,Â ifÂ aÂ constantÂ requiresÂ moreÂ thanÂ 3
>>>>Â instructions(riscv_const_insnÂ andÂ riscv_legitimate_constant_p),
>>>>Â thenÂ theÂ constantÂ willÂ beÂ putÂ intoÂ constantÂ poolÂ whenÂ expanding
>>>>Â gimpleÂ toÂ rtl(legitimate_constant_pÂ hookÂ andÂ emit_move_insn).
>>>>Â SoÂ theÂ inaccurateÂ costÂ modelÂ leadsÂ toÂ theÂ suboptimalÂ codegen
>>>>Â inÂ rv32Â andÂ theÂ wrongÂ estimationÂ partÂ couldÂ beÂ correctedÂ through
>>>>Â thisÂ fix.
>>>>
>>>>Â e.g.Â theÂ currentÂ codegenÂ forÂ loadingÂ 0x839290001Â inÂ rv32
>>>>
>>>>Â Â Â Â luiÂ Â Â Â Â a5,%hi(.LC0)
>>>>Â Â Â Â lwÂ Â Â Â Â Â a0,%lo(.LC0)(a5)
>>>>Â Â Â Â lwÂ Â Â Â Â Â a1,%lo(.LC0+4)(a5)
>>>>Â .LC0:
>>>>Â Â Â Â .wordÂ Â Â 958988289
>>>>Â Â Â Â .wordÂ Â Â 8
>>>>
>>>>Â outputÂ afterÂ thisÂ patch
>>>>
>>>>Â Â Â Â liÂ a0,958988288
>>>>Â Â Â Â addiÂ a0,a0,1
>>>>Â Â Â Â liÂ a1,8
>>>>
>>>>Â gcc/ChangeLog:
>>>>
>>>>Â Â Â Â Â Â Â Â Â Â *Â config/riscv/riscv.ccÂ (riscv_build_integer):Â 
>>>>HandleÂ theÂ caseÂ ofÂ loadingÂ 64bitÂ constantÂ inÂ rv32.
>>>>
>>>>Â gcc/testsuite/ChangeLog:
>>>>
>>>>Â Â Â Â Â Â Â Â Â Â *Â gcc.target/riscv/rv32-load-64bit-constant.c:Â NewÂ 
>>>>test.
>>>>
>>>>Â Signed-off-by:Â LinÂ SinanÂ <sinan....@linux.alibaba.com>
>>>>Â ---
>>>>Â Â Â gcc/config/riscv/riscv.ccÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â |Â 
>>>>23Â +++++++++++
>>>>Â Â Â .../riscv/rv32-load-64bit-constant.cÂ Â Â Â Â Â Â Â Â Â |Â 38Â 
>>>>+++++++++++++++++++
>>>>Â Â Â 2Â filesÂ changed,Â 61Â insertions(+)
>>>>Â Â Â createÂ modeÂ 100644Â 
>>>>gcc/testsuite/gcc.target/riscv/rv32-load-64bit-constant.c
>>>>
>>>>Â diffÂ --gitÂ a/gcc/config/riscv/riscv.ccÂ b/gcc/config/riscv/riscv.cc
>>>>Â indexÂ 32f9ef9ade9..9dffabdc5e3Â 100644
>>>>Â ---Â a/gcc/config/riscv/riscv.cc
>>>>Â +++Â b/gcc/config/riscv/riscv.cc
>>>>Â @@Â -618,6Â +618,29Â @@Â riscv_build_integerÂ (structÂ riscv_integer_opÂ 
>>>>*codes,Â HOST_WIDE_INTÂ value,
>>>>Â Â Â Â }
>>>>Â Â Â Â Â Â Â }
>>>>
>>>>Â +Â Â ifÂ ((valueÂ >Â INT32_MAXÂ ||Â valueÂ <Â INT32_MIN)Â &&Â 
>>>>!TARGET_64BIT)
>>>
>>>Â Nit.Â Â Â It'sÂ commonÂ practiceÂ toÂ haveÂ theÂ TARGETÂ testÂ firstÂ inÂ 
>>>aÂ seriesÂ of
>>>Â tests.Â Â ItÂ mayÂ alsoÂ beÂ advisableÂ toÂ breakÂ thisÂ intoÂ twoÂ lines.
>>>Â SomethingÂ likeÂ this:
>>>
>>>
>>>Â Â Â ifÂ ((!TARGET_64BIT)
>>>Â Â Â Â Â Â Â ||Â valueÂ >Â INT32_MAXÂ ||Â valueÂ <Â INT32_MIN)
>>>
>>>
>>>Â That'sÂ theÂ styleÂ mostÂ GCCÂ folksÂ areÂ moreÂ accustomedÂ toÂ reading.
>>
>> ThanksÂ forÂ theÂ tipsÂ andÂ IÂ willÂ changeÂ itÂ then.
>>
>>>>Â +Â Â Â Â {
>>>>Â +Â Â Â Â Â Â unsignedÂ HOST_WIDE_INTÂ lovalÂ =Â sext_hwiÂ (value,Â 32);
>>>>Â +Â Â Â Â Â Â unsignedÂ HOST_WIDE_INTÂ hivalÂ =Â sext_hwiÂ ((valueÂ -Â 
>>>>loval)Â >>Â 32,Â 32);
>>>>Â +Â Â Â Â Â Â structÂ riscv_integer_opÂ alt_codes[RISCV_MAX_INTEGER_OPS],
>>>>Â +Â Â Â Â Â Â Â hicode[RISCV_MAX_INTEGER_OPS];
>>>>Â +Â Â Â Â Â Â intÂ hi_cost,Â lo_cost;
>>>>Â +
>>>>Â +Â Â Â Â Â Â hi_costÂ =Â riscv_build_integer_1Â (hicode,Â hival,Â mode);
>>>>Â +Â Â Â Â Â Â ifÂ (hi_costÂ <Â cost)
>>>>Â +Â {
>>>>Â +Â Â Â lo_costÂ =Â riscv_build_integer_1Â (alt_codes,Â loval,Â mode);
>>>>Â +Â Â Â ifÂ (lo_costÂ +Â hi_costÂ <Â cost)
>>>
>>>Â JustÂ soÂ I'mÂ sure.Â Â "cost"Â hereÂ refersÂ strictlyÂ toÂ otherÂ 
>>>synthesized
>>>Â forms?Â IfÂ so,Â thenÂ ISTMÂ thatÂ we'dÂ wantÂ toÂ generateÂ theÂ newÂ 
>>>styleÂ when
>>>Â lo_costÂ +Â hi_costÂ <Â costÂ ORÂ whenÂ lo_costÂ +Â hi_costÂ isÂ lessÂ 
>>>thanÂ loading
>>>Â theÂ constantÂ fromÂ memoryÂ --Â whichÂ isÂ almostÂ certainlyÂ moreÂ thanÂ 
>>>"3"
>>>Â sinceÂ theÂ sequenceÂ fromÂ memoryÂ willÂ beÂ atÂ leastÂ 3Â instructions,Â 
>>>twoÂ of
>>>Â whichÂ willÂ hitÂ memory.
>>>
>>>
>>>Â Jeff
>>>
>>
>> Yes,Â almostÂ right.Â TheÂ basicÂ ideaÂ ofÂ thisÂ patchÂ isÂ toÂ improveÂ 
>> theÂ cost
>> calculationÂ forÂ loadingÂ 64bitÂ constantÂ inÂ rv32,Â insteadÂ ofÂ addingÂ 
>> aÂ new
>> wayÂ toÂ loadÂ constant.
>>
>> gccÂ nowÂ loadsÂ 0x739290001LLÂ inÂ rv32gcÂ withÂ threeÂ instructions,
>>  Â Â Â Â Â Â Â Â liÂ Â Â Â Â Â a0,958988288
>>  Â Â Â Â Â Â Â Â addiÂ Â Â Â a0,a0,1
>>  Â Â Â Â Â Â Â Â liÂ Â Â Â Â Â a1,7
>> However,Â whenÂ itÂ loadsÂ 0x839290001LL,Â theÂ outputÂ assemblyÂ becomes
>>  Â Â Â Â Â Â Â Â luiÂ Â Â Â Â a5,%hi(.LC0)
>>  Â Â Â Â Â Â Â Â lwÂ Â Â Â Â Â a0,%lo(.LC0)(a5)
>>  Â Â Â Â Â Â Â Â lwÂ Â Â Â Â Â a1,%lo(.LC0+4)(a5)
>>  Â Â Â Â .LC0:
>>  Â Â Â Â Â Â Â Â .wordÂ Â Â 958988289
>>  Â Â Â Â Â Â Â Â .wordÂ Â Â 8
>> TheÂ costÂ calculationÂ isÂ inaccurateÂ inÂ suchÂ cases,Â sinceÂ loadingÂ 
>> these
>> twoÂ constantsÂ shouldÂ haveÂ noÂ differenceÂ inÂ rv32Â (justÂ changeÂ `liÂ 
>> a1,7`
>> toÂ `liÂ a1,8`Â toÂ loadÂ theÂ hiÂ part).Â ThisÂ patchÂ willÂ takeÂ theseÂ 
>> cases
>> intoÂ consideration.
>>
> I think I see better what's going on.  This really isn't about the
> constant pool costing.  It's about another way to break down the
> constant into components.
>
> riscv_build_integer_1, for the cases we're looking at breaks down the
> constant so that high + low will give the final result.  It costs the
> high and low parts separately, then sums their cost + 1 for the addition
> step.
>
> Your patch adds another method that is specific to rv32 and takes
> advantage of register pairs.   You break the constant down into 32bit
> high and low chunks, where each chunk will go into a different 32 bit
> register.  You just then need to sum the cost of loading each chunk.
>
> For the constants in question, your new method will result in a smaller
> cost than the current method.   That's really the point of
> riscv_build_integer -- find the sequence and cost of creation.  We later
> use that information to determine if we should use that sequence or a
> constant pool.
>
> Palmer raised an issue on the tests with a request to not include the
> arch/abi specification.  But I think you addressed that in a later
> comment.  Specifically for rv64 we end up with another instruction,
> which would cause some constants to be considered cheaper as constant
> pool entries rather than inline sequences.
>
> Palmer is right in this seems like it ought to be generic, particularly
> breaking things down on word boundaries.  But I don't think adding that
> infrastructure should hold this patch up.  Reality is not much is
> happening with 32bit (or smaller) architectures and little is happening
> with 128bit integer types.  So there's not much motivation to fix this
> stuff more generically right now.


Seems reasonable to me, we can always promote it to something generic 
later if some other port wants something similar.

Re: 回复：[PING] [PATCH RESEND] riscv: improve the cost model for loading a 64bit constant in rv32.

Reply via email to