On Tue, Nov 30, 2021 at 5:21 AM Uros Bizjak <ubiz...@gmail.com> wrote: > > On Mon, Nov 29, 2021 at 10:48 AM Hongtao Liu <crazy...@gmail.com> wrote: > > > > On Mon, Nov 29, 2021 at 3:53 PM Uros Bizjak <ubiz...@gmail.com> wrote: > > > > > > On Mon, Nov 29, 2021 at 2:32 AM liuhongt <hongtao....@intel.com> wrote: > > > > > > > > There're several failures reported in [1]: > > > > 1. unsupported instruction `pextrw` for "pextrw $0, %xmm31, 16(%rax)" > > > > %vpextrw should be used in output templates. > > > > 2. ICE in get_attr_memory for movhi_internal since some alternatives > > > > are marked as TYPE_SSELOG. > > > > Explicitly set memory_attr for those alternatives. > > > > > > > > Also this patch fixs a typo and some latent bugs which are related to > > > > moving HImode from/to sse register w/o TARGET_AVX512FP16. > > Here are some more fixes: > thanks. > i386: Fix and improve movhi_internal and movhf_internal some more. > > An (*v,C) alternative can be added to movhi_internal to directly load > HImode constant 0 to xmm register. Also, V4SFmode moves can be used > for xmm->xmm moves instead of TImode moves when optimizing for size. > Fix invalid %vpinsrw insn template, which needs to duplicate %xmm > register for AVX targets. > > Optimize GPR moves in movhf_internal in the same way as in movhi_internal. > Fix pinsrw and pextrw templates for AVX targets. Use sselog1 > instead of sselog type. Also, handle TARGET_SSE_PARTIAL_REG_DEPENDENCY > and TARGET_SSE_SPLIT_REGS targets. > > 2021-11-29 Uroš Bizjak <ubiz...@gmail.com> > > gcc/ChangeLog: > > PR target/102811 > * config/i386/i386.md (*movhi_internal): Introduce (*v,C) alternative. > Do not allocate non-GPR registers. Optimize xmm->xmm moves when > optimizing for size. Fix vpinsrw insn template. > (*movhf_internal): Fix pinsrw and pextrw insn templates for > AVX targets. Use sselog1 type instead of sselog. Optimize GPR moves. > Optimize xmm->xmm moves for TARGET_SSE_PARTIAL_REG_DEPENDENCY > and TARGET_SSE_SPLIT_REGS targets. > > Bootstrapped and regression tested on x86_64-linux-gnu {,-m32} w/ and > w/o -mf16c. > > Pushed to master. > > Uros.
-- BR, Hongtao