https://gcc.gnu.org/bugzilla/show_bug.cgi?id=18438
--- Comment #14 from Andrew Pinski ---
(In reply to Maxim Kuvyrkov from comment #12)
> You are making an orthogonal point to this bug report: whether or not to
> vectorize such a loop. But if loop is vectorized, then on any
> microarchitecture
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=18438
--- Comment #13 from Richard Biener ---
(In reply to Maxim Kuvyrkov from comment #9)
> I've looked into another case where inability to handle stores with gaps
> generates sub-optimal code. I'm interested in spending some time on fixing
> this,
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=18438
--- Comment #12 from Maxim Kuvyrkov ---
(In reply to Andrew Pinski from comment #11)
> (In reply to Maxim Kuvyrkov from comment #9)
> > which then becomes for aarch64:
> > .L4:
> > ld2 {v0.2d - v1.2d}, [x1]
> > add w2, w2, 1
> >
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=18438
--- Comment #11 from Andrew Pinski ---
(In reply to Maxim Kuvyrkov from comment #9)
> I've looked into another case where inability to handle stores with gaps
> generates sub-optimal code. I'm interested in spending some time on fixing
> this, p
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=18438
--- Comment #10 from Maxim Kuvyrkov ---
(In reply to Maxim Kuvyrkov from comment #9)
> which then becomes for aarch64:
> .L4:
> ld2 {v0.2d - v1.2d}, [x1]
> add w2, w2, 1
> cmp w2, w7
> eor v0.16b, v2.16b, v
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=18438
Maxim Kuvyrkov changed:
What|Removed |Added
CC||mkuvyrkov at gcc dot gnu.org
--- Commen
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=18438
--- Comment #8 from Richard Biener 2013-03-27
11:27:31 UTC ---
The issue is that we cannot use a vector v4sf store to &opoints[i][0]
as opoints[i][4] is not stored to. Such "masked" store (or "interleaved
store with gaps") is not supporte
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=18438
Richard Guenther changed:
What|Removed |Added
Blocks||53947
--- Comment #7 from Richard Guen
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=18438
Steven Bosscher changed:
What|Removed |Added
Last reconfirmed|2006-09-19 07:10:15 |2011-05-22 17:40:15
--- Comment #6 from
--- Comment #5 from irar at il dot ibm dot com 2007-01-07 07:40 ---
On the todo list.
BTW, vectorization of strided accesses was committed to the mainline 4.3.
Ira
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=18438
--- Comment #4 from giovannibajo at libero dot it 2007-01-05 00:37 ---
Thanks Ira. What about store with gaps?
--
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=18438
--- Comment #3 from irar at il dot ibm dot com 2006-09-19 07:10 ---
> t.c:20: note: not vectorized: mixed data-types
> t.c:20: note: can't determine vectorization factor.
>
> Removing flags[i] = true;
Multiple data-types vectorization is already supported in the autovect branch,
and the
--- Additional Comments From pinskia at gcc dot gnu dot org 2005-09-20
17:47 ---
t.c:20: note: not vectorized: mixed data-types
t.c:20: note: can't determine vectorization factor.
Removing flags[i] = true;
we get:
t.c:20: note: not consecutive access
t.c:20: note: not vectorized: compli
--- Additional Comments From pinskia at gcc dot gnu dot org 2004-11-12
02:43 ---
Confirmed, ICC can do this but does not because it is not very inefficient to
do it.
--
What|Removed |Added
-
14 matches
Mail list logo