For v1, I had split this into 4 logically distinct parts. But apparently there are minor interdependencies, because the later sets would not apply standalone, says Alex.
Rather than tease them apart, and then have to undo that work in order to actually apply them later, I'll just lump them. So: Part 1, patches 1-7, tcg_gen_gvec_dup_imm, is reviewed. Part 2, patch 8, vector tail clearing, is reviewed, and I have moved the target/arm patches into a different queue. Part 3, patches 9-25, TYPE_CONST temporaries, is mostly unreviewed. Part 4, patch 26, load_dest for GVecGen2, a support patch for SVE2. Part 5, patches 27-36, add vector rotate patterns, is brand new. I include two demonstrators for target/ppc and target/s390x. It will also be used by SVE2. r~ Richard Henderson (36): tcg: Add tcg_gen_gvec_dup_imm target/s390x: Use tcg_gen_gvec_dup_imm target/ppc: Use tcg_gen_gvec_dup_imm target/arm: Use tcg_gen_gvec_dup_imm tcg: Use tcg_gen_gvec_dup_imm in logical simplifications tcg: Remove tcg_gen_gvec_dup{8,16,32,64}i tcg: Add tcg_gen_gvec_dup_tl tcg: Improve vector tail clearing tcg: Consolidate 3 bits into enum TCGTempKind tcg: Add temp_readonly tcg: Introduce TYPE_CONST temporaries tcg: Use tcg_constant_i32 with icount expander tcg: Use tcg_constant_{i32,i64} with tcg int expanders tcg: Use tcg_constant_{i32,vec} with tcg vec expanders tcg: Use tcg_constant_{i32,i64} with tcg plugins tcg: Rename struct tcg_temp_info to TempOptInfo tcg/optimize: Adjust TempOptInfo allocation tcg/optimize: Use tcg_constant_internal with constant folding tcg/tci: Add special tci_movi_{i32,i64} opcodes tcg: Remove movi and dupi opcodes tcg: Use tcg_out_dupi_vec from temp_load tcg: Increase tcg_out_dupi_vec immediate to int64_t tcg: Add tcg_reg_alloc_dup2 tcg/i386: Use tcg_constant_vec with tcg vec expanders tcg: Remove tcg_gen_dup{8,16,32,64}i_vec tcg: Add load_dest parameter to GVecGen2 tcg: Fix integral argument type to tcg_gen_rot[rl]i_i{32,64} tcg: Implement gvec support for rotate by immediate tcg: Implement gvec support for rotate by vector tcg: Remove expansion to shift by vector from do_shifts tcg: Implement gvec support for rotate by scalar tcg/i386: Implement INDEX_op_rotl[is]_vec tcg/aarch64: Implement INDEX_op_rotli_vec tcg/ppc: Implement INDEX_op_rot[lr]v_vec target/ppc: Use tcg_gen_gvec_rotlv target/s390x: Use tcg_gen_gvec_rotl{i,s,v} accel/tcg/tcg-runtime.h | 15 ++ include/exec/gen-icount.h | 25 +- include/tcg/tcg-op-gvec.h | 25 +- include/tcg/tcg-op.h | 30 +-- include/tcg/tcg-opc.h | 15 +- include/tcg/tcg.h | 53 +++- target/ppc/helper.h | 4 - target/s390x/helper.h | 4 - tcg/aarch64/tcg-target.h | 3 + tcg/aarch64/tcg-target.opc.h | 1 + tcg/i386/tcg-target.h | 3 + tcg/ppc/tcg-target.h | 3 + tcg/ppc/tcg-target.opc.h | 1 - accel/tcg/plugin-gen.c | 49 ++-- accel/tcg/tcg-runtime-gvec.c | 144 +++++++++++ target/arm/translate-a64.c | 10 +- target/arm/translate-sve.c | 12 +- target/arm/translate.c | 9 +- target/ppc/int_helper.c | 17 -- target/ppc/translate/vmx-impl.inc.c | 40 +-- target/ppc/translate/vsx-impl.inc.c | 2 +- target/s390x/translate_vx.inc.c | 107 ++------ target/s390x/vec_int_helper.c | 31 --- tcg/aarch64/tcg-target.inc.c | 32 ++- tcg/arm/tcg-target.inc.c | 1 - tcg/i386/tcg-target.inc.c | 195 ++++++++++----- tcg/mips/tcg-target.inc.c | 2 - tcg/optimize.c | 204 +++++++-------- tcg/ppc/tcg-target.inc.c | 47 ++-- tcg/riscv/tcg-target.inc.c | 2 - tcg/s390/tcg-target.inc.c | 2 - tcg/sparc/tcg-target.inc.c | 2 - tcg/tcg-op-gvec.c | 374 +++++++++++++++++++++++----- tcg/tcg-op-vec.c | 218 +++++++++++----- tcg/tcg-op.c | 232 ++++++++--------- tcg/tcg.c | 347 ++++++++++++++++++++------ tcg/tci.c | 4 +- tcg/tci/tcg-target.inc.c | 6 +- target/s390x/insn-data.def | 4 +- tcg/README | 7 +- 40 files changed, 1490 insertions(+), 792 deletions(-) -- 2.20.1