[Bug tree-optimization/50969] 17% degradation in 168.wupwise for interleave via permutation

2012-03-02 Thread wschmidt at gcc dot gnu.org
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=50969

--- Comment #6 from William J. Schmidt  2012-03-02 
14:52:09 UTC ---
Author: wschmidt
Date: Fri Mar  2 14:51:58 2012
New Revision: 184787

URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=184787
Log:
2012-03-02  Bill Schmidt 
Ira Rosen 

PR tree-optimization/50031
PR tree-optimization/50969
* targhooks.c (default_builtin_vectorization_cost): Handle
vec_promote_demote.
* target.h (enum vect_cost_for_stmt): Add vec_promote_demote.
* tree-vect-loop.c (vect_get_single_scalar_iteraion_cost): Handle
all types of reduction and pattern statements.
(vect_estimate_min_profitable_iters): Likewise.
* tree-vect-stmts.c (vect_model_promotion_demotion_cost): New function.
(vect_model_store_cost): Use vec_perm rather than vector_stmt for
statement cost.
(vect_model_load_cost): Likewise.
(vect_get_load_cost): Likewise; add dump logic for explicit realigns.
(vectorizable_type_demotion): Call vect_model_promotion_demotion_cost.
(vectorizable_type_promotion): Likewise.
* config/spu/spu.c (spu_builtin_vectorization_cost): Handle
vec_promote_demote.
* config/i386/i386.c (ix86_builtin_vectorization_cost): Likewise.
* config/rs6000/rs6000.c (rs6000_builtin_vectorization_cost): Update
vec_perm for VSX and handle vec_promote_demote.


Modified:
branches/gcc-4_6-branch/gcc/ChangeLog
branches/gcc-4_6-branch/gcc/config/i386/i386.c
branches/gcc-4_6-branch/gcc/config/rs6000/rs6000.c
branches/gcc-4_6-branch/gcc/config/spu/spu.c
branches/gcc-4_6-branch/gcc/target.h
branches/gcc-4_6-branch/gcc/targhooks.c
branches/gcc-4_6-branch/gcc/tree-vect-loop.c
branches/gcc-4_6-branch/gcc/tree-vect-stmts.c


[Bug tree-optimization/50969] 17% degradation in 168.wupwise for interleave via permutation

2012-02-14 Thread wschmidt at gcc dot gnu.org
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=50969

--- Comment #5 from William J. Schmidt  2012-02-14 
19:40:22 UTC ---
Author: wschmidt
Date: Tue Feb 14 19:40:13 2012
New Revision: 184225

URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=184225
Log:
2012-02-14  Bill Schmidt 
Ira Rosen 

PR tree-optimization/50031
PR tree-optimization/50969
* targhooks.c (default_builtin_vectorization_cost): Handle
vec_promote_demote.
* target.h (enum vect_cost_for_stmt): Add vec_promote_demote.
* tree-vect-loop.c (vect_get_single_scalar_iteraion_cost): Handle
all types of reduction and pattern statements.
(vect_estimate_min_profitable_iters): Likewise.
* tree-vect-stmts.c (vect_model_promotion_demotion_cost): New function.
(vect_model_store_cost): Use vec_perm rather than vector_stmt for
statement cost.
(vect_model_load_cost): Likewise.
(vect_get_load_cost): Likewise; add dump logic for explicit realigns.
(vectorizable_type_demotion): Call vect_model_promotion_demotion_cost.
(vectorizable_type_promotion): Likewise.
* config/spu/spu.c (spu_builtin_vectorization_cost): Handle
vec_promote_demote.
* config/i386/i386.c (ix86_builtin_vectorization_cost): Likewise.
* config/rs6000/rs6000.c (rs6000_builtin_vectorization_cost): Update
vec_perm for VSX and handle vec_promote_demote.


Modified:
branches/ibm/gcc-4_6-branch/gcc/ChangeLog.ibm
branches/ibm/gcc-4_6-branch/gcc/config/i386/i386.c
branches/ibm/gcc-4_6-branch/gcc/config/rs6000/rs6000.c
branches/ibm/gcc-4_6-branch/gcc/config/spu/spu.c
branches/ibm/gcc-4_6-branch/gcc/target.h
branches/ibm/gcc-4_6-branch/gcc/targhooks.c
branches/ibm/gcc-4_6-branch/gcc/tree-vect-loop.c
branches/ibm/gcc-4_6-branch/gcc/tree-vect-stmts.c


[Bug tree-optimization/50969] 17% degradation in 168.wupwise for interleave via permutation

2012-02-06 Thread wschmidt at gcc dot gnu.org
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=50969

William J. Schmidt  changed:

   What|Removed |Added

 Status|UNCONFIRMED |RESOLVED
 Resolution||FIXED

--- Comment #4 from William J. Schmidt  2012-02-06 
21:41:47 UTC ---
Fixed with simple permute cost change for now.  A better analysis of permutes
will be considered in 4.8.


[Bug tree-optimization/50969] 17% degradation in 168.wupwise for interleave via permutation

2012-02-06 Thread wschmidt at gcc dot gnu.org
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=50969

--- Comment #3 from William J. Schmidt  2012-02-06 
21:39:38 UTC ---
Author: wschmidt
Date: Mon Feb  6 21:39:34 2012
New Revision: 183944

URL: http://gcc.gnu.org/viewcvs?root=gcc&view=rev&rev=183944
Log:
2012-02-06  Bill Schmidt  

PR tree-optimization/50969
* tree-vect-stmts.c (vect_model_store_cost): Correct statement cost to
use vec_perm rather than vector_stmt.
(vect_model_load_cost): Likewise.
* config/i386/i386.c (ix86_builtin_vectorization_cost): Change cost of
vec_perm to be the same as other vector statements.
* config/rs6000/rs6000.c (rs6000_builtin_vectorization_cost): Revise
cost of vec_perm for TARGET_VSX.


Modified:
trunk/gcc/ChangeLog
trunk/gcc/config/i386/i386.c
trunk/gcc/config/rs6000/rs6000.c
trunk/gcc/tree-vect-stmts.c


[Bug tree-optimization/50969] 17% degradation in 168.wupwise for interleave via permutation

2011-11-03 Thread rguenth at gcc dot gnu.org
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=50969

--- Comment #2 from Richard Guenther  2011-11-03 
08:19:01 UTC ---
Yes, sounds like a cost model issue.


[Bug tree-optimization/50969] 17% degradation in 168.wupwise for interleave via permutation

2011-11-02 Thread pthaugen at gcc dot gnu.org
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=50969

--- Comment #1 from Pat Haugen  2011-11-02 
21:38:28 UTC ---
I swapped the numbers, should be:

-m64 -O3 -mcpu=power7
zaxpy : -79%
zscal : -24%

-m64 -O3 -mcpu=power7 -funroll-loops
zaxpy : -61%
zscal : -65%