On Mon, Jan 13, 2014 at 07:40:16PM +0100, Uros Bizjak wrote: > An unrelated observation: gcc should figure out that %k1 mask register > can be used in all gather insns and avoid unnecessary copies at the > beginning of the loop.
I thought about that too, even started modifying sse.md, but then I read the spec and the AVX512F gather insns overwrite the mask register (like it does for the vector mask register in AVX2 case). Jakub