https://gcc.gnu.org/bugzilla/show_bug.cgi?id=125876
--- Comment #12 from vekumar at gcc dot gnu.org --- (In reply to Hongtao Liu from comment #11) > (In reply to vekumar from comment #9) > > (In reply to Sarvesh Chandra from comment #8) > > > Sure Richard, would an RFC to gcc-patches be apt for this issue? > > > > Can u check if latency wise vunpcklpd is better than vmovddup for register > > variants, based on that you can post patch against this PR. > > According to Agner Fog's table, vmovddup is same as vunpcklpd for thougput > and latency on Intel platform, but looks like Zen4/zen5 has better throuput > for vunpcklpd. Yes, on Zen5 vunpcklpd has better throughput compared to vmovddup.
