Re: [RISC-V][RVV] wide bitfield insertion & extractions to/from vector regs

Robin Dapp via Gcc Fri, 25 Jul 2025 00:19:17 -0700

There are two levels of dysfunction here:
1. Why spill & fill through the stack? Why not extract scalars directlyfrom vregs
    directly into scalar regs?
2. Why involve scalar registers at all? Why not vslide or even vrgather,using
    temporary vregs as necessary?

That's how expmed does it. If vec_extract and friends or subregs don't work weneed to go via memory as last resort.

The fatal deficiency seems to be that the backend lacks vec_extractNMpatterns
for mode M bigger than ELEN. Here are some ideas:
1. Define scalar modes M larger than DI mode. Aarch64 defines TI, OI, andXI modesfor 128, 256, and 512-bit integers (all of which are wider than thehardware supports). 2. Define vector modes M that are half, quarter,eighth, ... width of vector mode N. Thatcan be done with mode iterators. We already have VLS_HALF andVLS_QUARTER, butthere are no such iterators for the VLA modes. Note: there are nofractional LMUL
    modes defined for SEW=64, i.e., no RVVMF[248]DI.

Yeah, generally vec_extract with vector modes is the way to go I'd say, that'sgenerally a "VLS" line of thinking, though.

We cannot have RVVMF2DI and smaller when the minimum vector length is 64 bits.Increasing the minimum vector length helps but then we're not fully "VLA" anymore.

How does aarch64 do it? Do the larger scalar modes help for your problem?They have those trn instructions I guess but doesn't their approach involveBIT_FIELD_REFs?

How is your approach, i.e. what code do you write? Do you start with C code oris this an autovec expansion? Couldn't you use vrgathers etc. right away?


--
Regards
Robin

Re: [RISC-V][RVV] wide bitfield insertion & extractions to/from vector regs

Reply via email to