https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325
Sam James changed:
What|Removed |Added
Target Milestone|--- |15.0
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325
Hongtao Liu changed:
What|Removed |Added
Status|NEW |RESOLVED
Resolution|---
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325
--- Comment #17 from GCC Commits ---
The master branch has been updated by hongtao Liu :
https://gcc.gnu.org/g:ef27b91b62c3aa8841c02665dffa8914c742fd37
commit r15-919-gef27b91b62c3aa8841c02665dffa8914c742fd37
Author: liuhongt
Date: Tue Feb
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325
--- Comment #16 from Hongtao Liu ---
> I'm all for removing the 1/3 for innermost loop handling (in cunroll
> the unrolled loop is then innermost). I'm more concerned about
> unrolling more than one level which is exactly what's required for
>
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325
--- Comment #15 from rguenther at suse dot de ---
On Wed, 28 Feb 2024, liuhongt at gcc dot gnu.org wrote:
> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325
>
> --- Comment #14 from Hongtao Liu ---
> (In reply to rguent...@suse.de from com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325
--- Comment #14 from Hongtao Liu ---
(In reply to rguent...@suse.de from comment #13)
> On Tue, 27 Feb 2024, liuhongt at gcc dot gnu.org wrote:
>
> > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325
> >
> > --- Comment #11 from Hongtao Liu
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325
--- Comment #13 from rguenther at suse dot de ---
On Tue, 27 Feb 2024, liuhongt at gcc dot gnu.org wrote:
> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325
>
> --- Comment #11 from Hongtao Liu ---
>
> >Loop body is likely going to si
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325
--- Comment #12 from rguenther at suse dot de ---
On Tue, 27 Feb 2024, liuhongt at gcc dot gnu.org wrote:
> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325
>
> --- Comment #10 from Hongtao Liu ---
> (In reply to Hongtao Liu from comment #
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325
--- Comment #11 from Hongtao Liu ---
>Loop body is likely going to simplify further, this is difficult
>to guess, we just decrease the result by 1/3. */
>
This is introduced by r0-68074-g91a01f21abfe19
/* Estimate number of insns of
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325
--- Comment #10 from Hongtao Liu ---
(In reply to Hongtao Liu from comment #9)
> The original case is a little different from the one in PR.
But the issue is similar, after cunrolli, GCC failed to vectorize the outer
loop.
The interesting thing
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325
--- Comment #9 from Hongtao Liu ---
The original case is a little different from the one in PR.
It comes from ggml
#include
#include
typedef uint16_t ggml_fp16_t;
static float table_f32_f16[1 << 16];
inline static float ggml_lookup_fp16_to_
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325
--- Comment #8 from CVS Commits ---
The master branch has been updated by hongtao Liu :
https://gcc.gnu.org/g:e5e305e6048c042139037378fe6abfad5735b54f
commit r14-5632-ge5e305e6048c042139037378fe6abfad5735b54f
Author: liuhongt
Date: Fri Nov
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325
--- Comment #7 from CVS Commits ---
The master branch has been updated by hongtao Liu :
https://gcc.gnu.org/g:2b59e2b4dff42118fe3a505f07b9a6aa4cf53bdf
commit r14-5603-g2b59e2b4dff42118fe3a505f07b9a6aa4cf53bdf
Author: liuhongt
Date: Thu Nov
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325
Andrew Pinski changed:
What|Removed |Added
Depends on||106343
--- Comment #6 from Andrew Pinsk
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325
Bug 112325 depends on bug 112579, which changed state.
Bug 112579 Summary: bb vectorizer failed to reduction sum += inv >>
{0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15}
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112579
What|Removed
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325
--- Comment #5 from Richard Biener ---
Yes, as I said in comment#2. Note I specifically ended up not open-coding the
reduction because of concerns of efficiency. So a target should only provide
reduc_*_scal patterns when they are more efficien
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325
--- Comment #4 from liuhongt at gcc dot gnu.org ---
(In reply to liuhongt from comment #3)
> BB vectorizer relies on the backend support of .REDUC_PLUS for reduction,
> but loop vectorizer can manually do reduction. That's why it's not
> vectoriz
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325
liuhongt at gcc dot gnu.org changed:
What|Removed |Added
CC||liuhongt at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112325
Richard Biener changed:
What|Removed |Added
CC||rguenth at gcc dot gnu.org
19 matches
Mail list logo