Hello, 16/07/2025 12:34, Shreesh Adiga: > Replace the clearing of lower 32 bits of XMM register with blend of > zero register. > Replace the clearing of upper 64 bits of XMM register with _mm_move_epi64. > Clang is able to optimize away the AND + memory operand with the > above sequence, however GCC is still emitting the code for AND with > memory operands which is being explicitly eliminated here. > > Additionally replace the 48 byte crc_xmm_shift_tab with the contents of > shf_table which is 32 bytes, achieving the same functionality. > > Signed-off-by: Shreesh Adiga <[email protected]>
Sorry I'm not following. Please could you start with defining the goal of this patch? Is it a code simplification or a performance optimization?

