[Bug tree-optimization/18438] vectorizer failed for vector matrix multiplication

2017-01-28 Thread pinskia at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=18438 --- Comment #14 from Andrew Pinski --- (In reply to Maxim Kuvyrkov from comment #12) > You are making an orthogonal point to this bug report: whether or not to > vectorize such a loop. But if loop is vectorized, then on any > microarchitecture

[Bug tree-optimization/18438] vectorizer failed for vector matrix multiplication

2016-12-13 Thread rguenth at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=18438 --- Comment #13 from Richard Biener --- (In reply to Maxim Kuvyrkov from comment #9) > I've looked into another case where inability to handle stores with gaps > generates sub-optimal code. I'm interested in spending some time on fixing > this,

[Bug tree-optimization/18438] vectorizer failed for vector matrix multiplication

2016-12-12 Thread mkuvyrkov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=18438 --- Comment #12 from Maxim Kuvyrkov --- (In reply to Andrew Pinski from comment #11) > (In reply to Maxim Kuvyrkov from comment #9) > > which then becomes for aarch64: > > .L4: > > ld2 {v0.2d - v1.2d}, [x1] > > add w2, w2, 1 > >

[Bug tree-optimization/18438] vectorizer failed for vector matrix multiplication

2016-12-12 Thread pinskia at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=18438 --- Comment #11 from Andrew Pinski --- (In reply to Maxim Kuvyrkov from comment #9) > I've looked into another case where inability to handle stores with gaps > generates sub-optimal code. I'm interested in spending some time on fixing > this, p

[Bug tree-optimization/18438] vectorizer failed for vector matrix multiplication

2016-12-12 Thread mkuvyrkov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=18438 --- Comment #10 from Maxim Kuvyrkov --- (In reply to Maxim Kuvyrkov from comment #9) > which then becomes for aarch64: > .L4: > ld2 {v0.2d - v1.2d}, [x1] > add w2, w2, 1 > cmp w2, w7 > eor v0.16b, v2.16b, v

[Bug tree-optimization/18438] vectorizer failed for vector matrix multiplication

2016-12-12 Thread mkuvyrkov at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=18438 Maxim Kuvyrkov changed: What|Removed |Added CC||mkuvyrkov at gcc dot gnu.org --- Commen

[Bug tree-optimization/18438] vectorizer failed for vector matrix multiplication

2013-03-27 Thread rguenth at gcc dot gnu.org
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=18438 --- Comment #8 from Richard Biener 2013-03-27 11:27:31 UTC --- The issue is that we cannot use a vector v4sf store to &opoints[i][0] as opoints[i][4] is not stored to. Such "masked" store (or "interleaved store with gaps") is not supporte

[Bug tree-optimization/18438] vectorizer failed for vector matrix multiplication

2012-07-13 Thread rguenth at gcc dot gnu.org
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=18438 Richard Guenther changed: What|Removed |Added Blocks||53947 --- Comment #7 from Richard Guen

[Bug tree-optimization/18438] vectorizer failed for vector matrix multiplication

2011-05-22 Thread steven at gcc dot gnu.org
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=18438 Steven Bosscher changed: What|Removed |Added Last reconfirmed|2006-09-19 07:10:15 |2011-05-22 17:40:15 --- Comment #6 from

[Bug tree-optimization/18438] vectorizer failed for vector matrix multiplication

2007-01-06 Thread irar at il dot ibm dot com
--- Comment #5 from irar at il dot ibm dot com 2007-01-07 07:40 --- On the todo list. BTW, vectorization of strided accesses was committed to the mainline 4.3. Ira -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=18438

[Bug tree-optimization/18438] vectorizer failed for vector matrix multiplication

2007-01-04 Thread giovannibajo at libero dot it
--- Comment #4 from giovannibajo at libero dot it 2007-01-05 00:37 --- Thanks Ira. What about store with gaps? -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=18438

[Bug tree-optimization/18438] vectorizer failed for vector matrix multiplication

2006-09-19 Thread irar at il dot ibm dot com
--- Comment #3 from irar at il dot ibm dot com 2006-09-19 07:10 --- > t.c:20: note: not vectorized: mixed data-types > t.c:20: note: can't determine vectorization factor. > > Removing flags[i] = true; Multiple data-types vectorization is already supported in the autovect branch, and the

[Bug tree-optimization/18438] vectorizer failed for vector matrix multiplication

2005-09-20 Thread pinskia at gcc dot gnu dot org
--- Additional Comments From pinskia at gcc dot gnu dot org 2005-09-20 17:47 --- t.c:20: note: not vectorized: mixed data-types t.c:20: note: can't determine vectorization factor. Removing flags[i] = true; we get: t.c:20: note: not consecutive access t.c:20: note: not vectorized: compli

[Bug tree-optimization/18438] vectorizer failed for vector matrix multiplication

2004-11-11 Thread pinskia at gcc dot gnu dot org
--- Additional Comments From pinskia at gcc dot gnu dot org 2004-11-12 02:43 --- Confirmed, ICC can do this but does not because it is not very inefficient to do it. -- What|Removed |Added -