On Mon, Nov 14, 2011 at 9:03 AM, Jan Hubicka <hubi...@ucw.cz> wrote: > Hi, > this is hopefully final variant of patch. The epilogue code was broken in some > scenarios for memset, but should work safely now. I also fixed the tables for > core/buldozer/amdfam10 chips. > > But before it can be comitted, we need to reoslve copyright assignment issues. > You don't seem to be liested as having copyright assignment, does you company > have one? Otherwise, please try to get one soon. > > Honza > > 2011-11-14 Zolotukhin Michael <michael.v.zolotuk...@gmail.com> > Jan Hubicka <j...@suse.cz> > > * gcc.target/i386/sw-1.c: Force rep;movsb. > > * config/i386/i386.h (processor_costs): Add second dimension to > stringop_algs array. > * config/i386/i386.c (cost models): Initialize second dimension of > stringop_algs arrays. > (core_cost): New costs based on generic64 costs with updated stringop > values. > (promote_duplicated_reg): Add support for vector modes, add > declaration. > (promote_duplicated_reg_to_size): Likewise. > (processor_target): Set core costs for core variants. > (expand_set_or_movmem_via_loop_with_iter): New function. > (expand_set_or_movmem_via_loop): Enable reuse of the same iters in > different loops, produced by this function. > (emit_strset): New function. > (expand_movmem_epilogue): Add epilogue generation for bigger sizes, > use SSE-moves where possible. > (expand_setmem_epilogue): Likewise. > (expand_movmem_prologue): Likewise for prologue. > (expand_setmem_prologue): Likewise. > (expand_constant_movmem_prologue): Likewise. > (expand_constant_setmem_prologue): Likewise. > (decide_alg): Add new argument align_unknown. Fix algorithm of > strategy selection if TARGET_INLINE_ALL_STRINGOPS is set; Skip sse_loop > (decide_alignment): Update desired alignment according to chosen move > mode. > (ix86_expand_movmem): Change unrolled_loop strategy to use SSE-moves. > (ix86_expand_setmem): Likewise. > (ix86_slow_unaligned_access): Implementation of new hook > slow_unaligned_access. > * config/i386/i386.md (strset): Enable half-SSE moves. > * config/i386/sse.md (vec_dupv4si): Add expand for vec_dupv4si. > (vec_dupv2di): Add expand for vec_dupv2di.
This may have caused: http://gcc.gnu.org/bugzilla/show_bug.cgi?id=51134 -- H.J.