https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #37 from Jakub Jelinek jakub at gcc dot gnu.org ---
Author: jakub
Date: Sat Mar 14 09:56:39 2015
New Revision: 221435
URL: https://gcc.gnu.org/viewcvs?rev=221435root=gccview=rev
Log:
PR tree-optimization/65369
*
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
Jakub Jelinek jakub at gcc dot gnu.org changed:
What|Removed |Added
Status|ASSIGNED|RESOLVED
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
Markus Trippelsdorf trippels at gcc dot gnu.org changed:
What|Removed |Added
CC|
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #28 from Alan Modra amodra at gmail dot com ---
Created attachment 35024
-- https://gcc.gnu.org/bugzilla/attachment.cgi?id=35024action=edit
modified testcase without bswap optimization
This modified testcase avoids triggering the
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #30 from Richard Biener rguenth at gcc dot gnu.org ---
(In reply to Alan Modra from comment #28)
Created attachment 35024 [details]
modified testcase without bswap optimization
This modified testcase avoids triggering the bswap
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #33 from Markus Trippelsdorf trippels at gcc dot gnu.org ---
FWIW Alan's new testcase started to fail with r212452.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #34 from Bill Schmidt wschmidt at gcc dot gnu.org ---
(In reply to Markus Trippelsdorf from comment #29)
CCing Bill because this might be a bug in rs6000_analyze_swaps().
Adding -mno-optimize-swaps to Alan's example does not fix the
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #32 from Alan Modra amodra at gmail dot com ---
Richi, ptr+12 is nonsense. Suppose ptr is 16k+1, then first vector loads from
16k and second from the same address since (16k+1+12) ~15 == 16k. But we
want to end up with 15 bytes
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
Alan Modra amodra at gmail dot com changed:
What|Removed |Added
Attachment #35016|0 |1
is
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #35 from Jakub Jelinek jakub at gcc dot gnu.org ---
Note to self. This is the case dr_explicit_realign: case of vectorizable_load.
(ptr + 12) -16L - i.e. vs_minus_1 * TYPE_ALIGN_UNIT (element) is reasonable
for the case where the
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
Jakub Jelinek jakub at gcc dot gnu.org changed:
What|Removed |Added
Status|NEW |ASSIGNED
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
Jakub Jelinek jakub at gcc dot gnu.org changed:
What|Removed |Added
Priority|P3 |P1
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
Thomas Preud'homme thopre01 at gcc dot gnu.org changed:
What|Removed |Added
Status|NEW |ASSIGNED
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #11 from Markus Trippelsdorf trippels at gcc dot gnu.org ---
0x1a00 +1024: bl 0x15b0 _init+112
0x1a04 +1028: ld r2,24(r1)
0x1a08 +1032: lwz r9,224(r1)
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
Thomas Preud'homme thopre01 at gcc dot gnu.org changed:
What|Removed |Added
Last reconfirmed|2015-03-10 00:00:00
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
Markus Trippelsdorf trippels at gcc dot gnu.org changed:
What|Removed |Added
Status|WAITING |NEW
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #12 from Thomas Preud'homme thopre01 at gcc dot gnu.org ---
(In reply to Markus Trippelsdorf from comment #11)
0x1a00 +1024: bl 0x15b0 _init+112
0x1a04 +1028: ld r2,24(r1)
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #15 from Markus Trippelsdorf trippels at gcc dot gnu.org ---
(In reply to Alan Modra from comment #13)
Markus, are you sure about comment #9? I completely disabled the bswap pass
and still see a failure of the testcase at -O3.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #14 from Alan Modra amodra at gmail dot com ---
That should be -O3 -fno-inline.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
Alan Modra amodra at gmail dot com changed:
What|Removed |Added
Last reconfirmed|2015-03-12 00:00:00 |2015-03-10 0:00
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
Thomas Preud'homme thopre01 at gcc dot gnu.org changed:
What|Removed |Added
Status|ASSIGNED|NEW
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #22 from Markus Trippelsdorf trippels at gcc dot gnu.org ---
Could you please attach the unreduced testcase?
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #24 from Martin Sebor msebor at gcc dot gnu.org ---
Created attachment 35020
-- https://gcc.gnu.org/bugzilla/attachment.cgi?id=35020action=edit
Unreduced translation unit of the test program.
Compil and run like so:
$ gcc -O3
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #20 from Martin Sebor msebor at gcc dot gnu.org ---
The original test doesn't show any valgrind errors so these could be the result
of using delta to reduce the test case.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #19 from Jakub Jelinek jakub at gcc dot gnu.org ---
Yeah. In md4_digest, for (i = 0; i 16 - 2; i++) look writes data[0] through
data[13], data[14] is stored after the loop, but data[15] is uninitialized,
but md5_transform reads it
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #23 from Jakub Jelinek jakub at gcc dot gnu.org ---
Created attachment 35019
-- https://gcc.gnu.org/bugzilla/attachment.cgi?id=35019action=edit
pr65369.c
Adjusted testcase that now works reliably at -O0, and even with -O3
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #18 from Markus Trippelsdorf trippels at gcc dot gnu.org ---
trippels@gcc2-power8 ~ % valgrind --track-origins=yes ./a.out
==152914== Memcheck, a memory error detector
==152914== Copyright (C) 2002-2013, and GNU GPL'd, by Julian
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
Jakub Jelinek jakub at gcc dot gnu.org changed:
What|Removed |Added
CC||jakub at gcc dot
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #21 from Jakub Jelinek jakub at gcc dot gnu.org ---
Looking at https://git.lysator.liu.se/nettle/nettle/blob/master/md4.c I see
there:
bit_count = (ctx-count 9) | (ctx-index 3);
data[MD4_DATA_LENGTH-2] = bit_count;
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #26 from Jakub Jelinek jakub at gcc dot gnu.org ---
So, on my version of the testcase with r210843 -O3 -mcpu=power8 there are like
49
32 bit load in host endianness found at: _105 = MEM[(const unsigned char
*)load_src_25];
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #17 from Jakub Jelinek jakub at gcc dot gnu.org ---
I've tried to add
__builtin_memset(ctx, 0, sizeof (*ctx));
to the beginning of md4_init and it changes the result of the program at -O0,
so I expect the testcase relies on some
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #25 from Markus Trippelsdorf trippels at gcc dot gnu.org ---
Jakub's reduced- and the unreduced testcase also started to fail
with r210843.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #7 from Martin Sebor msebor at gcc dot gnu.org ---
The cause of the failing tests observed on RHEL 7.1 is in the second definition
of nettle's HAVE_NATIVE_64_BIT configuration macro:
$ grep HAVE_NATIVE_64_BIT config.*
config.h:#
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #8 from Martin Sebor msebor at gcc dot gnu.org ---
Created attachment 35016
-- https://gcc.gnu.org/bugzilla/attachment.cgi?id=35016action=edit
Test case for nettle md4 test failure.
The attached test case reduced from Nettle 3.0
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #5 from Martin Sebor msebor at gcc dot gnu.org ---
While I haven't isolated it yet I suspect a bug in nettle and not one in gcc,
for at least three reasons:
First, the failures are insensitive to optimization levels. Second, the
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #6 from Matthias Klose doko at gcc dot gnu.org ---
I see this with -O3, not -O3. working to get a reduced test case.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
Jeffrey A. Law law at redhat dot com changed:
What|Removed |Added
CC||law at redhat dot
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #4 from Martin Sebor msebor at gcc dot gnu.org ---
I've downloaded nettle-3.0 from http://ftp.gnu.org/gnu/nettle, built it with
the default options (-O2) with last week's trunk (5.0.0 20150303) and the
system GCC 4.8.3 on a ppc64le
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
Richard Biener rguenth at gcc dot gnu.org changed:
What|Removed |Added
Status|UNCONFIRMED |WAITING
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65369
--- Comment #1 from Andrew Pinski pinskia at gcc dot gnu.org ---
it's enough to build the md4.c file with -O2.
I know there is an endian test in md4.c file to speed up md4. Maybe the wrong
one is being selected.
40 matches
Mail list logo