George,

Can you point me to an other project that uses 128 bits atomics ?

In my tests, i noticed that the volatile keyword is (one of) the trigger of the 
compiler bug.
At this stage, i could not see anything wrong in ompi, plus this is working 
fine with recent gcc and icc, so i concluded this is an icc bug, that is now 
fixed, so all ompi can do is hide the symptom.

Cheers,

Gilles


George Bosilca <bosi...@icl.utk.edu> wrote:
>My feeling is that the current patch hide the symptoms without addressing the 
>real issue.
>
>
>As a side note: The compiler incriminated in this thread, works perfectly for 
>128 bits atomic operations in other projects where I use atomic LIFO & FIFO 
>(but not the one from OMPI as I already raised my concerns about this).
>
>
>  George.
>
>
>PS: Why are there totally non-related comments about FIFO in the opal_lifo.h 
>(starting line 61)?
>
>
>On Wed, Feb 4, 2015 at 11:30 PM, Gilles Gouaillardet 
><gilles.gouaillar...@iferc.org> wrote:
>
>Paul and all,
>
>i just pushed 
>https://github.com/open-mpi/ompi/commit/b42e3441294e9fe787fe8e9ad7403d5b8e465163
>
>when a buggy compiler is detected, configure now forces OPAL_HAVE_CMPXCHG16B=0
>this is enough to make opal_lifo test and make check happy again.
>
>Cheers,
>
>Gilles
>
>
>
>On 2015/02/04 17:26, Gilles Gouaillardet wrote:
>
>Paul, my previous email was misleading. what i really meant is the opal_fifo 
>test works fine with icc 2013u5 (the release before 2013sp1) and icc 2013sp1u2 
>and later so even if the reproducer fails with icc older that 2013sp1u2, that 
>might not impact ompi since for other reasons, the bug is not hit for example, 
>with icc 2013u5, OPAL_HAVE_CMPXCHG16B=0 so ompi stays away from the compiler 
>bug. Cheers, Gilles On 2015/02/04 17:15, Paul Hargrove wrote: 
>
>Giles, Who says only 2 version are effected? I have access to 9 revisions of 
>icc. Using your reduced case I find 7 that fail and only 2 (the latest two) 
>that pass. Discounting icc-12 (which can't compile the test) that makes 6 
>versions effected by the bug (not 2). -Paul $ for x in 12.1.5.339 13.0.0.079 
>13.0.1.117 13.1.2.183 13.1.3.192 14.0.0.080 14.0.1.106 14.0.2.144 15.0.1.133; 
>do module swap intel intel/$x ; echo @ Testing Intel compiler version $x; icc 
>conftest.c && ./a.out && echo PASS ; done @ Testing Intel compiler version 
>12.1.5.339 conftest.c(10): error: identifier "__int128_t" is undefined 
>__int128_t value; ^ compilation aborted for conftest.c (code 2) @ Testing 
>Intel compiler version 13.0.0.079 a.out: conftest.c:36: main: Assertion 
>`a.value == b.value' failed. Aborted @ Testing Intel compiler version 
>13.0.1.117 a.out: conftest.c:36: main: Assertion `a.value == b.value' failed. 
>Aborted @ Testing Intel compiler version 13.1.2.183 a.out: conftest.c:36: 
>main: Assertion `a.value == b.value' failed. Aborted @ Testing Intel compiler 
>version 13.1.3.192 a.out: conftest.c:36: main: Assertion `a.value == b.value' 
>failed. Aborted @ Testing Intel compiler version 14.0.0.080 a.out: 
>conftest.c:36: main: Assertion `a.value == b.value' failed. Aborted @ Testing 
>Intel compiler version 14.0.1.106 a.out: conftest.c:36: main: Assertion 
>`a.value == b.value' failed. Aborted @ Testing Intel compiler version 
>14.0.2.144 PASS @ Testing Intel compiler version 15.0.1.133 PASS On Tue, Feb 
>3, 2015 at 11:45 PM, Gilles Gouaillardet < gilles.gouaillar...@iferc.org> 
>wrote: 
>
>Nathan, imho, this is a compiler bug and only two versions are affected : - 
>intel icc 14.0.0.080 (aka 2013sp1) - intel icc 14.0.1.106 (aka 2013sp1u1) /* 
>note the bug only occurs with -O1 and higher optimization levels */ here is 
>attached a simple reproducer a simple workaround is to configure with 
>ac_cv_type___int128=0 Cheers, Gilles On 2015/02/04 4:17, Nathan Hjelm wrote: 
>Thats the second report involving icc 14. I will dig into this later this 
>week. -Nathan On Mon, Feb 02, 2015 at 11:03:41PM -0800, Paul Hargrove wrote: I 
>have seen opal_fifo hang on 2 distinct systems + Linux/ppc32 with xlc-11.1 + 
>Linux/x86-64 with icc-14.0.1.106 I have no explanation to offer for either 
>hang. No "weird" configure options were passed to either. -Paul -- Paul H. 
>Hargrove phhargr...@lbl.gov Computer Languages & Systems Software (CLaSS) 
>Group Computer Science Department Tel: +1-510-495-2352 Lawrence Berkeley 
>National Laboratory Fax: +1-510-486-6900 
>_______________________________________________ devel mailing 
>listde...@open-mpi.org Subscription: 
>http://www.open-mpi.org/mailman/listinfo.cgi/devel Link to this post: 
>http://www.open-mpi.org/community/lists/devel/2015/02/16911.php 
>_______________________________________________ devel mailing 
>listde...@open-mpi.org Subscription: 
>http://www.open-mpi.org/mailman/listinfo.cgi/devel Link to this post: 
>http://www.open-mpi.org/community/lists/devel/2015/02/16920.php 
>_______________________________________________ devel mailing list 
>de...@open-mpi.org Subscription: 
>http://www.open-mpi.org/mailman/listinfo.cgi/devel Link to this post: 
>http://www.open-mpi.org/community/lists/devel/2015/02/16921.php 
>
>_______________________________________________ devel mailing list 
>de...@open-mpi.org Subscription: 
>http://www.open-mpi.org/mailman/listinfo.cgi/devel Link to this post: 
>http://www.open-mpi.org/community/lists/devel/2015/02/16922.php 
>
>
>
>_______________________________________________ devel mailing list 
>de...@open-mpi.org Subscription: 
>http://www.open-mpi.org/mailman/listinfo.cgi/devel
>
>Link to this post: 
>http://www.open-mpi.org/community/lists/devel/2015/02/16923.php 
>
>
>
>_______________________________________________
>devel mailing list
>de...@open-mpi.org
>Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>Link to this post: 
>http://www.open-mpi.org/community/lists/devel/2015/02/16926.php
>
>

Reply via email to