http://gcc.gnu.org/bugzilla/show_bug.cgi?id=53726
--- Comment #6 from Richard Guenther <rguenth at gcc dot gnu.org> 2012-06-20 12:31:22 UTC --- Btw, I cannot reproduce the slowdown on 64bit and the 32bit memcpy in glibc simply does a rep movsb; for any size lower than 20 bytes ... but as I have been told rep movsb; setup cost is prohibitively high on most Intel CPUs ... Thus, I suppose you should look at improving the memcpy implementation for small sizes on 32bits.