Re: [RFC v5 00/68] support vector extension v1.0
On Mon, Nov 9, 2020 at 6:09 PM Frank Chang wrote: > > ping 2nd~ I mentioned earlier that I will wait until this becomes a patch series to review it. Overall it looks like it's on the right track though. Alistair > > On Tue, Oct 20, 2020 at 3:42 PM Frank Chang wrote: >> >> On Wed, Sep 30, 2020 at 3:04 AM wrote: >>> >>> From: Frank Chang >>> >>> This patchset implements the vector extension v1.0 for RISC-V on QEMU. >>> >>> This patchset is sent as RFC because RVV v1.0 is still in draft state. >>> v2 patchset was sent for RVV v0.9 and bumped to RVV v1.0 since v3 patchset. >>> >>> The port is available here: >>> https://github.com/sifive/qemu/tree/rvv-1.0-upstream-v5 >>> >>> You can change the cpu argument: vext_spec to v1.0 (i.e. vext_spec=v1.0) >>> to run with RVV v1.0 instructions. >>> >>> Note: This patchset depends on two other patchsets listed in Based-on >>> section below so it might not able to be built unless those two >>> patchsets are applied. >>> >>> Changelog: >>> >>> v5 >>> * refactor RVV v1.0 check functions. >>> (Thanks to Richard Henderson's bitwise tricks.) >>> * relax RV_VLEN_MAX to 1024-bits. >>> * implement vstart CSR's behaviors. >>> * trigger illegal instruction exception if frm is not valid for >>> vector floating-point instructions. >>> * rebase on riscv-to-apply.next. >>> >>> v4 >>> * remove explicit float flmul variable in DisasContext. >>> * replace floating-point calculations with shift operations to >>> improve performance. >>> * relax RV_VLEN_MAX to 512-bits. >>> >>> v3 >>> * apply nan-box helpers from Richard Henderson. >>> * remove fp16 api changes as they are sent independently in another >>> pathcset by Chih-Min Chao. >>> * remove all tail elements clear functions as tail elements can >>> retain unchanged for either VTA set to undisturbed or agnostic. >>> * add fp16 nan-box check generator function. >>> * add floating-point rounding mode enum. >>> * replace flmul arithmetic with shifts to avoid floating-point >>> conversions. >>> * add Zvqmac extension. >>> * replace gdbstub vector register xml files with dynamic generator. >>> * bumped to RVV v1.0. >>> * RVV v1.0 related changes: >>> * add vlre.v and vsr.v vector whole register >>> load/store instructions >>> * add vrgatherei16 instruction. >>> * rearranged bits in vtype to make vlmul bits into a contiguous >>> field. >>> >>> v2 >>> * drop v0.7.1 support. >>> * replace invisible return check macros with functions. >>> * move mark_vs_dirty() to translators. >>> * add SSTATUS_VS flag for s-mode. >>> * nan-box scalar fp register for floating-point operations. >>> * add gdbstub files for vector registers to allow system-mode >>> debugging with GDB. >>> >>> Based-on: <20200909001647.532249-1-richard.hender...@linaro.org/> >>> Based-on: <1596102747-20226-1-git-send-email-chihmin.c...@sifive.com/> >>> >>> Frank Chang (62): >>> target/riscv: drop vector 0.7.1 and add 1.0 support >>> target/riscv: Use FIELD_EX32() to extract wd field >>> target/riscv: rvv-1.0: introduce writable misa.v field >>> target/riscv: rvv-1.0: add translation-time vector context status >>> target/riscv: rvv-1.0: remove rvv related codes from fcsr registers >>> target/riscv: rvv-1.0: check MSTATUS_VS when accessing vector csr >>> registers >>> target/riscv: rvv-1.0: remove MLEN calculations >>> target/riscv: rvv-1.0: add fractional LMUL >>> target/riscv: rvv-1.0: add VMA and VTA >>> target/riscv: rvv-1.0: update check functions >>> target/riscv: introduce more imm value modes in translator functions >>> target/riscv: rvv:1.0: add translation-time nan-box helper function >>> target/riscv: rvv-1.0: configure instructions >>> target/riscv: rvv-1.0: stride load and store instructions >>> target/riscv: rvv-1.0: index load and store instructions >>> target/riscv: rvv-1.0: fix address index overflow bug of indexed >>> load/store insns >>> target/riscv: rvv-1.0: fault-only-first unit stride load >>> target/riscv: rvv-1.0: amo operations >>> target/riscv: rvv-1.0: load/store whole register instructions >>> target/riscv: rvv-1.0: update vext_max_elems() for load/store insns >>> target/riscv: rvv-1.0: take fractional LMUL into vector max elements >>> calculation >>> target/riscv: rvv-1.0: floating-point square-root instruction >>> target/riscv: rvv-1.0: floating-point classify instructions >>> target/riscv: rvv-1.0: mask population count instruction >>> target/riscv: rvv-1.0: find-first-set mask bit instruction >>> target/riscv: rvv-1.0: set-X-first mask bit instructions >>> target/riscv: rvv-1.0: iota instruction >>> target/riscv: rvv-1.0: element index instruction >>> target/riscv: rvv-1.0: allow load element with sign-extended >>> target/riscv: rvv-1.0: register gather instructions >>> target/riscv: rvv-1.0: integer scalar move instructions >>> target/riscv: rvv-1.0: floating-
Re: [RFC v5 00/68] support vector extension v1.0
ping 2nd~ On Tue, Oct 20, 2020 at 3:42 PM Frank Chang wrote: > On Wed, Sep 30, 2020 at 3:04 AM wrote: > >> From: Frank Chang >> >> This patchset implements the vector extension v1.0 for RISC-V on QEMU. >> >> This patchset is sent as RFC because RVV v1.0 is still in draft state. >> v2 patchset was sent for RVV v0.9 and bumped to RVV v1.0 since v3 >> patchset. >> >> The port is available here: >> https://github.com/sifive/qemu/tree/rvv-1.0-upstream-v5 >> >> You can change the cpu argument: vext_spec to v1.0 (i.e. vext_spec=v1.0) >> to run with RVV v1.0 instructions. >> >> Note: This patchset depends on two other patchsets listed in Based-on >> section below so it might not able to be built unless those two >> patchsets are applied. >> >> Changelog: >> >> v5 >> * refactor RVV v1.0 check functions. >> (Thanks to Richard Henderson's bitwise tricks.) >> * relax RV_VLEN_MAX to 1024-bits. >> * implement vstart CSR's behaviors. >> * trigger illegal instruction exception if frm is not valid for >> vector floating-point instructions. >> * rebase on riscv-to-apply.next. >> >> v4 >> * remove explicit float flmul variable in DisasContext. >> * replace floating-point calculations with shift operations to >> improve performance. >> * relax RV_VLEN_MAX to 512-bits. >> >> v3 >> * apply nan-box helpers from Richard Henderson. >> * remove fp16 api changes as they are sent independently in another >> pathcset by Chih-Min Chao. >> * remove all tail elements clear functions as tail elements can >> retain unchanged for either VTA set to undisturbed or agnostic. >> * add fp16 nan-box check generator function. >> * add floating-point rounding mode enum. >> * replace flmul arithmetic with shifts to avoid floating-point >> conversions. >> * add Zvqmac extension. >> * replace gdbstub vector register xml files with dynamic generator. >> * bumped to RVV v1.0. >> * RVV v1.0 related changes: >> * add vlre.v and vsr.v vector whole register >> load/store instructions >> * add vrgatherei16 instruction. >> * rearranged bits in vtype to make vlmul bits into a contiguous >> field. >> >> v2 >> * drop v0.7.1 support. >> * replace invisible return check macros with functions. >> * move mark_vs_dirty() to translators. >> * add SSTATUS_VS flag for s-mode. >> * nan-box scalar fp register for floating-point operations. >> * add gdbstub files for vector registers to allow system-mode >> debugging with GDB. >> >> Based-on: <20200909001647.532249-1-richard.hender...@linaro.org/> >> Based-on: <1596102747-20226-1-git-send-email-chihmin.c...@sifive.com/> >> >> Frank Chang (62): >> target/riscv: drop vector 0.7.1 and add 1.0 support >> target/riscv: Use FIELD_EX32() to extract wd field >> target/riscv: rvv-1.0: introduce writable misa.v field >> target/riscv: rvv-1.0: add translation-time vector context status >> target/riscv: rvv-1.0: remove rvv related codes from fcsr registers >> target/riscv: rvv-1.0: check MSTATUS_VS when accessing vector csr >> registers >> target/riscv: rvv-1.0: remove MLEN calculations >> target/riscv: rvv-1.0: add fractional LMUL >> target/riscv: rvv-1.0: add VMA and VTA >> target/riscv: rvv-1.0: update check functions >> target/riscv: introduce more imm value modes in translator functions >> target/riscv: rvv:1.0: add translation-time nan-box helper function >> target/riscv: rvv-1.0: configure instructions >> target/riscv: rvv-1.0: stride load and store instructions >> target/riscv: rvv-1.0: index load and store instructions >> target/riscv: rvv-1.0: fix address index overflow bug of indexed >> load/store insns >> target/riscv: rvv-1.0: fault-only-first unit stride load >> target/riscv: rvv-1.0: amo operations >> target/riscv: rvv-1.0: load/store whole register instructions >> target/riscv: rvv-1.0: update vext_max_elems() for load/store insns >> target/riscv: rvv-1.0: take fractional LMUL into vector max elements >> calculation >> target/riscv: rvv-1.0: floating-point square-root instruction >> target/riscv: rvv-1.0: floating-point classify instructions >> target/riscv: rvv-1.0: mask population count instruction >> target/riscv: rvv-1.0: find-first-set mask bit instruction >> target/riscv: rvv-1.0: set-X-first mask bit instructions >> target/riscv: rvv-1.0: iota instruction >> target/riscv: rvv-1.0: element index instruction >> target/riscv: rvv-1.0: allow load element with sign-extended >> target/riscv: rvv-1.0: register gather instructions >> target/riscv: rvv-1.0: integer scalar move instructions >> target/riscv: rvv-1.0: floating-point move instruction >> target/riscv: rvv-1.0: floating-point scalar move instructions >> target/riscv: rvv-1.0: whole register move instructions >> target/riscv: rvv-1.0: integer extension instructions >> target/riscv: rvv-1.0: single-width averaging add and subtract >> instructions >> target/ri
Re: [RFC v5 00/68] support vector extension v1.0
On Wed, Sep 30, 2020 at 3:04 AM wrote: > From: Frank Chang > > This patchset implements the vector extension v1.0 for RISC-V on QEMU. > > This patchset is sent as RFC because RVV v1.0 is still in draft state. > v2 patchset was sent for RVV v0.9 and bumped to RVV v1.0 since v3 patchset. > > The port is available here: > https://github.com/sifive/qemu/tree/rvv-1.0-upstream-v5 > > You can change the cpu argument: vext_spec to v1.0 (i.e. vext_spec=v1.0) > to run with RVV v1.0 instructions. > > Note: This patchset depends on two other patchsets listed in Based-on > section below so it might not able to be built unless those two > patchsets are applied. > > Changelog: > > v5 > * refactor RVV v1.0 check functions. > (Thanks to Richard Henderson's bitwise tricks.) > * relax RV_VLEN_MAX to 1024-bits. > * implement vstart CSR's behaviors. > * trigger illegal instruction exception if frm is not valid for > vector floating-point instructions. > * rebase on riscv-to-apply.next. > > v4 > * remove explicit float flmul variable in DisasContext. > * replace floating-point calculations with shift operations to > improve performance. > * relax RV_VLEN_MAX to 512-bits. > > v3 > * apply nan-box helpers from Richard Henderson. > * remove fp16 api changes as they are sent independently in another > pathcset by Chih-Min Chao. > * remove all tail elements clear functions as tail elements can > retain unchanged for either VTA set to undisturbed or agnostic. > * add fp16 nan-box check generator function. > * add floating-point rounding mode enum. > * replace flmul arithmetic with shifts to avoid floating-point > conversions. > * add Zvqmac extension. > * replace gdbstub vector register xml files with dynamic generator. > * bumped to RVV v1.0. > * RVV v1.0 related changes: > * add vlre.v and vsr.v vector whole register > load/store instructions > * add vrgatherei16 instruction. > * rearranged bits in vtype to make vlmul bits into a contiguous > field. > > v2 > * drop v0.7.1 support. > * replace invisible return check macros with functions. > * move mark_vs_dirty() to translators. > * add SSTATUS_VS flag for s-mode. > * nan-box scalar fp register for floating-point operations. > * add gdbstub files for vector registers to allow system-mode > debugging with GDB. > > Based-on: <20200909001647.532249-1-richard.hender...@linaro.org/> > Based-on: <1596102747-20226-1-git-send-email-chihmin.c...@sifive.com/> > > Frank Chang (62): > target/riscv: drop vector 0.7.1 and add 1.0 support > target/riscv: Use FIELD_EX32() to extract wd field > target/riscv: rvv-1.0: introduce writable misa.v field > target/riscv: rvv-1.0: add translation-time vector context status > target/riscv: rvv-1.0: remove rvv related codes from fcsr registers > target/riscv: rvv-1.0: check MSTATUS_VS when accessing vector csr > registers > target/riscv: rvv-1.0: remove MLEN calculations > target/riscv: rvv-1.0: add fractional LMUL > target/riscv: rvv-1.0: add VMA and VTA > target/riscv: rvv-1.0: update check functions > target/riscv: introduce more imm value modes in translator functions > target/riscv: rvv:1.0: add translation-time nan-box helper function > target/riscv: rvv-1.0: configure instructions > target/riscv: rvv-1.0: stride load and store instructions > target/riscv: rvv-1.0: index load and store instructions > target/riscv: rvv-1.0: fix address index overflow bug of indexed > load/store insns > target/riscv: rvv-1.0: fault-only-first unit stride load > target/riscv: rvv-1.0: amo operations > target/riscv: rvv-1.0: load/store whole register instructions > target/riscv: rvv-1.0: update vext_max_elems() for load/store insns > target/riscv: rvv-1.0: take fractional LMUL into vector max elements > calculation > target/riscv: rvv-1.0: floating-point square-root instruction > target/riscv: rvv-1.0: floating-point classify instructions > target/riscv: rvv-1.0: mask population count instruction > target/riscv: rvv-1.0: find-first-set mask bit instruction > target/riscv: rvv-1.0: set-X-first mask bit instructions > target/riscv: rvv-1.0: iota instruction > target/riscv: rvv-1.0: element index instruction > target/riscv: rvv-1.0: allow load element with sign-extended > target/riscv: rvv-1.0: register gather instructions > target/riscv: rvv-1.0: integer scalar move instructions > target/riscv: rvv-1.0: floating-point move instruction > target/riscv: rvv-1.0: floating-point scalar move instructions > target/riscv: rvv-1.0: whole register move instructions > target/riscv: rvv-1.0: integer extension instructions > target/riscv: rvv-1.0: single-width averaging add and subtract > instructions > target/riscv: rvv-1.0: single-width bit shift instructions > target/riscv: rvv-1.0: integer add-with-carry/subtract-with-borrow > target/riscv: rvv-1.0: narrowing integer right shift
[RFC v5 00/68] support vector extension v1.0
From: Frank Chang This patchset implements the vector extension v1.0 for RISC-V on QEMU. This patchset is sent as RFC because RVV v1.0 is still in draft state. v2 patchset was sent for RVV v0.9 and bumped to RVV v1.0 since v3 patchset. The port is available here: https://github.com/sifive/qemu/tree/rvv-1.0-upstream-v5 You can change the cpu argument: vext_spec to v1.0 (i.e. vext_spec=v1.0) to run with RVV v1.0 instructions. Note: This patchset depends on two other patchsets listed in Based-on section below so it might not able to be built unless those two patchsets are applied. Changelog: v5 * refactor RVV v1.0 check functions. (Thanks to Richard Henderson's bitwise tricks.) * relax RV_VLEN_MAX to 1024-bits. * implement vstart CSR's behaviors. * trigger illegal instruction exception if frm is not valid for vector floating-point instructions. * rebase on riscv-to-apply.next. v4 * remove explicit float flmul variable in DisasContext. * replace floating-point calculations with shift operations to improve performance. * relax RV_VLEN_MAX to 512-bits. v3 * apply nan-box helpers from Richard Henderson. * remove fp16 api changes as they are sent independently in another pathcset by Chih-Min Chao. * remove all tail elements clear functions as tail elements can retain unchanged for either VTA set to undisturbed or agnostic. * add fp16 nan-box check generator function. * add floating-point rounding mode enum. * replace flmul arithmetic with shifts to avoid floating-point conversions. * add Zvqmac extension. * replace gdbstub vector register xml files with dynamic generator. * bumped to RVV v1.0. * RVV v1.0 related changes: * add vlre.v and vsr.v vector whole register load/store instructions * add vrgatherei16 instruction. * rearranged bits in vtype to make vlmul bits into a contiguous field. v2 * drop v0.7.1 support. * replace invisible return check macros with functions. * move mark_vs_dirty() to translators. * add SSTATUS_VS flag for s-mode. * nan-box scalar fp register for floating-point operations. * add gdbstub files for vector registers to allow system-mode debugging with GDB. Based-on: <20200909001647.532249-1-richard.hender...@linaro.org/> Based-on: <1596102747-20226-1-git-send-email-chihmin.c...@sifive.com/> Frank Chang (62): target/riscv: drop vector 0.7.1 and add 1.0 support target/riscv: Use FIELD_EX32() to extract wd field target/riscv: rvv-1.0: introduce writable misa.v field target/riscv: rvv-1.0: add translation-time vector context status target/riscv: rvv-1.0: remove rvv related codes from fcsr registers target/riscv: rvv-1.0: check MSTATUS_VS when accessing vector csr registers target/riscv: rvv-1.0: remove MLEN calculations target/riscv: rvv-1.0: add fractional LMUL target/riscv: rvv-1.0: add VMA and VTA target/riscv: rvv-1.0: update check functions target/riscv: introduce more imm value modes in translator functions target/riscv: rvv:1.0: add translation-time nan-box helper function target/riscv: rvv-1.0: configure instructions target/riscv: rvv-1.0: stride load and store instructions target/riscv: rvv-1.0: index load and store instructions target/riscv: rvv-1.0: fix address index overflow bug of indexed load/store insns target/riscv: rvv-1.0: fault-only-first unit stride load target/riscv: rvv-1.0: amo operations target/riscv: rvv-1.0: load/store whole register instructions target/riscv: rvv-1.0: update vext_max_elems() for load/store insns target/riscv: rvv-1.0: take fractional LMUL into vector max elements calculation target/riscv: rvv-1.0: floating-point square-root instruction target/riscv: rvv-1.0: floating-point classify instructions target/riscv: rvv-1.0: mask population count instruction target/riscv: rvv-1.0: find-first-set mask bit instruction target/riscv: rvv-1.0: set-X-first mask bit instructions target/riscv: rvv-1.0: iota instruction target/riscv: rvv-1.0: element index instruction target/riscv: rvv-1.0: allow load element with sign-extended target/riscv: rvv-1.0: register gather instructions target/riscv: rvv-1.0: integer scalar move instructions target/riscv: rvv-1.0: floating-point move instruction target/riscv: rvv-1.0: floating-point scalar move instructions target/riscv: rvv-1.0: whole register move instructions target/riscv: rvv-1.0: integer extension instructions target/riscv: rvv-1.0: single-width averaging add and subtract instructions target/riscv: rvv-1.0: single-width bit shift instructions target/riscv: rvv-1.0: integer add-with-carry/subtract-with-borrow target/riscv: rvv-1.0: narrowing integer right shift instructions target/riscv: rvv-1.0: widening integer multiply-add instructions target/riscv: rvv-1.0: single-width saturating add and subtract instructions target/riscv: rvv-1.0: integer comparison instructions target/riscv: rvv-1.0: flo