On Wed, 2019-01-09 at 09:56 +0000, Jonathan Wakely wrote: > On Wed, 9 Jan 2019 at 09:50, Andrew Haley wrote: > > I don't agree. Sometimes vectorization is critical. It would be > > nice > > to have a warning which would fire if vectorization failed. That > > would > > surely help the OP. > > Dave Malcolm has been working on something like that: > https://gcc.gnu.org/ml/gcc-patches/2018-09/msg01749.html
Yes: this code is in trunk for gcc 9, but it doesn't help much for the case given elsewhere in this thread: #include <cmath> extern float data [ 32768 ] ; extern void vf1() { #pragma vectorize enable for ( int i = 0 ; i < 32768 ; i++ ) data [ i ] = std::sqrt ( data [ i ] ) ; } Compiling on this x86_64 box with -fopt-info-vec-missed shows the rather cryptic: g++ -c /tmp/sqrt-test.cc -O3 -mavx2 -fopt-info-vec-missed /tmp/sqrt-test.cc:8:24: missed: couldn't vectorize loop /tmp/sqrt-test.cc:8:24: missed: not vectorized: control flow in loop. /home/david/coding/gcc-python/gcc-svn-trunk/install-dogfood/include/c++/9.0.0/cmath:464:27: missed: statement clobbers memory: __builtin_sqrtf (_1); and with -fopt-info-vec-all-internals shows: g++ -c /tmp/sqrt-test.cc -O3 -mavx2 -fopt-info-vec-all-internals Analyzing loop at /tmp/sqrt-test.cc:8 /tmp/sqrt-test.cc:8:24: note: === analyze_loop_nest === /tmp/sqrt-test.cc:8:24: note: === vect_analyze_loop_form === /tmp/sqrt-test.cc:8:24: missed: not vectorized: control flow in loop. /tmp/sqrt-test.cc:8:24: missed: bad loop form. /tmp/sqrt-test.cc:8:24: missed: couldn't vectorize loop /tmp/sqrt-test.cc:8:24: missed: not vectorized: control flow in loop. /tmp/sqrt-test.cc:5:13: note: vectorized 0 loops in function. /home/david/coding/gcc-python/gcc-svn-trunk/install-dogfood/include/c++/9.0.0/cmath:464:27: note: === vect_slp_analyze_bb === /home/david/coding/gcc-python/gcc-svn-trunk/install-dogfood/include/c++/9.0.0/cmath:464:27: note: === vect_analyze_data_refs === /home/david/coding/gcc-python/gcc-svn-trunk/install-dogfood/include/c++/9.0.0/cmath:464:27: note: got vectype for stmt: _1 = data[i_12]; vector(8) float /home/david/coding/gcc-python/gcc-svn-trunk/install-dogfood/include/c++/9.0.0/cmath:464:27: missed: not vectorized: not enough data-refs in basic block. /home/david/coding/gcc-python/gcc-svn-trunk/install-dogfood/include/c++/9.0.0/cmath:464:27: missed: statement clobbers memory: __builtin_sqrtf (_1); /tmp/sqrt-test.cc:8:24: note: === vect_slp_analyze_bb === /tmp/sqrt-test.cc:8:24: note: === vect_analyze_data_refs === /tmp/sqrt-test.cc:8:24: note: got vectype for stmt: data[i_12] = _7; vector(8) float /tmp/sqrt-test.cc:8:24: missed: not vectorized: not enough data-refs in basic block. /tmp/sqrt-test.cc:10:1: note: === vect_slp_analyze_bb === /tmp/sqrt-test.cc:10:1: note: === vect_analyze_data_refs === /tmp/sqrt-test.cc:10:1: missed: not vectorized: not enough data-refs in basic block. I had to turn on -fdump-tree-all to try to figure out what that "control flow in loop" was; it seems to be a guard against the input to value being negative: <bb 3> [local count: 1063004407]: # i_12 = PHI <0(2), i_6(7)> # ivtmp_10 = PHI <32768(2), ivtmp_2(7)> # DEBUG i => i_12 # DEBUG BEGIN_STMT _1 = data[i_12]; # DEBUG __x => _1 # DEBUG BEGIN_STMT _7 = .SQRT (_1); if (_1 u>= 0.0) goto <bb 8>; [99.95%] else goto <bb 4>; [0.05%] <bb 8> [local count: 1062472912]: goto <bb 5>; [100.00%] <bb 4> [local count: 531495]: __builtin_sqrtf (_1); I'm not sure where that control flow came from: it isn't in sqrt-test.cc.104t.stdarg but is in sqrt-test.cc.105t.cdce so I think it's coming from the argument-range code in cdce. Arguably the location on the statement is wrong: it's on the loop header, when it presumably should be on the std::sqrt call. Shall I file a bugzilla about this? Dave