[OMPI devel] openMPI 1.4.2: mpi_write fails on NFSv3 crossmounts

2010-08-24 Thread Oliver Deppert
General informations: 3 node Opteron cluster, 24CPUs, Melanox Infiniband 10Gb interconnect Debian Lenny 5.0 self build kernel from kernel.org: 2.6.32.12, all NFS functions available from kernel side self build NFS-utils 1.2.2 from debian source of sid: nfs-ker

Re: [OMPI devel] 1.5rc5 has been posted

2010-08-24 Thread Paul H. Hargrove
I have access to PGI compilers 10.{3,4,5,8} on an Opteron/InifiniBand cluster running Scientific Linux 5.4 (an RHEL derivative like the Centos distro). I can try to reproduce Larry Baker's problems today with the most recent 10.8 compiler. If there are other things I could/should be trying to

[OMPI devel] Refactoring opal if.c

2010-08-24 Thread Ralph Castain
Per the discussion on today's telecon, I've started working with Jeff on refactoring the opal/util/if.c code into something more understandable without changing the discovery logic. We are creating a framework that solely performs interface discovery, leaving all of the interface wrapper functio

Re: [OMPI devel] Refactoring opal if.c

2010-08-24 Thread Jeff Squyres
More specifically: if.h has not been changed (except for its finalize function). So all this change does it un-spaghettify the if.c code. From an interface perspective, the rest of the code base isn't even aware that this change occurred. Also, I think Ralph meant the following URL: https

[OMPI devel] A different 1.5rc5 build failure w/ PGI compilers

2010-08-24 Thread Paul H. Hargrove
I went to try to reproduce the problem Larry Baker reported w/ the PGI 10.3 compilers and undefined references in libopen-pal.so when linking opal_wrapper. Instead I encountered a different error that does not (as best I can tell) correspond to any of the issues Larry has reported recently. O

[OMPI devel] 1.4.3rc1 out

2010-08-24 Thread Ralph Castain
Okay, release candidate 1 for 1.4.3 is now available on the web site. Please give it a whirl. http://www.open-mpi.org/software/ompi/v1.4/ Ralph

Re: [OMPI devel] 1.4.3rc1 out

2010-08-24 Thread Paul H. Hargrove
With PGI-10.5 I see the same 4 warnings in atomic.h that Jeff reported in June with PGI-7.0.7 http://www.open-mpi.org/community/lists/devel/2010/06/8060.php Other than that, I can report a clean build w/ PGI-10.5. I will try 10.8 next, but will only report to the list only if there are prob

[OMPI devel] 1.5rc5 build failure with icc-11.1

2010-08-24 Thread Paul H. Hargrove
Trying to build 1.5rc5 with icc-11.1.046 is failing in otfprofile.cpp - the same file I saw problems with when using the PGI compilers. $ cat /etc/SuSE-release SUSE Linux Enterprise Server 10 (x86_64) VERSION = 10 PATCHLEVEL = 2 $ uname -a Linux jaguar-ext3 2.6.16.60-0.42.7-smp #1 SMP Tue Nov

Re: [OMPI devel] 1.4.3rc1 out

2010-08-24 Thread Paul H. Hargrove
Building w/ icc-11.1.046 on x86-64 works with some warnings. In addition to the numerous "188" warnings and the single instance of "279" that I reported seeing for 1.5rc5 (http://www.open-mpi.org/community/lists/devel/2010/08/8320.php), there is one additional warning: libtool: compile: icc -

Re: [OMPI devel] 1.4.3rc1 out

2010-08-24 Thread Paul H. Hargrove
Testing on Sun C 5.10 on OpenSolaris yield many warnings (below), but does build to completion. I have run a successful "make check", making me doubt that any of the atomic.h warnings indicate real problems. $ uname -a SunOS osol-x64 5.11 snv_111b i86pc i386 i86pc $ cc -V cc: Sun C 5.10 SunOS_

[OMPI devel] 1.5rc5 - warnings from Sun C 5.10

2010-08-24 Thread Paul H. Hargrove
In addition to the atomic.h and Anachronism warnings seen w/ 1.4.3rc1 (http://www.open-mpi.org/community/lists/devel/2010/08/8322.php), I find some "new" warnings in 1.5rc5. $ uname -a SunOS osol-x64 5.11 snv_111b i86pc i386 i86pc $ cc -V cc: Sun C 5.10 SunOS_i386 2009/06/03 usage: cc [ option

Re: [OMPI devel] A different 1.5rc5 build failure w/ PGI compilers

2010-08-24 Thread Jeff Squyres
Thanks Paul -- reported in #2544. On Aug 24, 2010, at 6:07 PM, Paul H. Hargrove wrote: > I went to try to reproduce the problem Larry Baker reported w/ the PGI 10.3 > compilers and undefined references in libopen-pal.so when linking > opal_wrapper. Instead I encountered a different error that

Re: [OMPI devel] 1.5rc5 build failure with icc-11.1

2010-08-24 Thread Jeff Squyres
Also reported in #2544. Thanks! On Aug 24, 2010, at 7:14 PM, Paul H. Hargrove wrote: > Trying to build 1.5rc5 with icc-11.1.046 is failing in otfprofile.cpp - the > same file I saw problems with when using the PGI compilers. > > $ cat /etc/SuSE-release > SUSE Linux Enterprise Server 10 (x86_64

[OMPI devel] 1.5rc5 warnings from gcc on OpenSolaris

2010-08-24 Thread Paul H. Hargrove
A build of 1.5rc5 with gcc on OpenSolaris/x86 agrees with my Sun C build (http://www.open-mpi.org/community/lists/devel/2010/08/8323.php) about type sloppiness in common_sm_mmap.c. $ uname -a SunOS osol-x64 5.11 snv_111b i86pc i386 i86pc $ gcc --version gcc (GCC) 3.4.3 (csl-sol210-3_4-20050802

Re: [OMPI devel] 1.4.3rc1 out

2010-08-24 Thread George Bosilca
This indicates that two contiguous C long double types are not identical to a Fortran COMPLEX32 type. Paul, if you still have the config.log and opal_config.h I would like to take a look at them. Thanks, george. On Aug 24, 2010, at 19:38 , Paul H. Hargrove wrote: > Building w/ icc-11.1.046

Re: [OMPI devel] 1.5rc5 - warnings from Sun C 5.10

2010-08-24 Thread George Bosilca
On Aug 24, 2010, at 20:40 , Paul H. Hargrove wrote: > "../../../openmpi-1.5rc5/opal/include/opal/sys/ia32/atomic.h", line 170: > warning: impossible constraint for "%1" asm operand __asm__ __volatile__( SMPLOCK "addl %1,%0" :"=m" (*v)

[OMPI devel] 1.5rc5 build failure w/ icc-10.1.008 for ia64

2010-08-24 Thread Paul H. Hargrove
An attempt to compile 1.5rc5 on Linux/IA64 w/ icc-10.0.008 failed: $ cat /etc/SuSE-release SUSE Linux Enterprise Server 10 (ia64) VERSION = 10 PATCHLEVEL = 1 $ cat /etc/sgi-release SGI ProPack 5SP4 for Linux, Build 504r4-0801072302 $ uname -a Linux tg-login3.pople.psc.teragrid.org 2.6.16.60-0.

Re: [OMPI devel] 1.5rc5 warnings from gcc on OpenSolaris

2010-08-24 Thread George Bosilca
> ../../../../../openmpi-1.5rc5/ompi/mca/common/sm/common_sm_mmap.c:201: > warning: assignment from incompatible pointer type > ../../../../../openmpi-1.5rc5/ompi/mca/common/sm/common_sm_mmap.c:207: > warning: assignment from incompatible pointer type This belongs to an assignment to the iov_ba

Re: [OMPI devel] 1.4.3rc1 out

2010-08-24 Thread Paul H. Hargrove
My attempt to build 1.4.3rc1 w/ icc-10.1.008 is failing. Everything (except the tmpfile name in the error message, of course) is identical to what I recently reported for 1.5rc5: http://www.open-mpi.org/community/lists/devel/2010/08/8329.php -Paul Ralph Castain wrote: Okay, release candida

Re: [OMPI devel] 1.4.3rc1 out

2010-08-24 Thread George Bosilca
Paul, Thanks for the files. It appears that our configure script detected that a Fortran REAL*16 bit representation differs from a C long double. This is a pretty weird scenario, where a Fortran type cannot be represented in C. As we only support C MPI_Op there is no "simple" way we can support

Re: [OMPI devel] 1.4.3rc1 out

2010-08-24 Thread Paul H. Hargrove
I suspect that the FORTRAN REAL*16 is a 16-byte floating point type while C's "long double" may just be the x86 10-byte type. I'd not be surprised if icc has some option to get 16-byte long double. I don't have time to hunt for such an option, but if somebody know the required compiler option

[OMPI devel] README nit pick (both 1.5rc5 and 1.4.3rc1)

2010-08-24 Thread Paul H. Hargrove
I see the following in the Compiler Notes of the Open MPI README (both 1.5rc5 and 1.4.3rc1) - Open MPI does not support the Sparc v8 CPU target, which is the default on Sun Solaris. The v8plus (32 bit) or v9 (64 bit) targets must be used to build Open MPI on Solaris. This can be done by i

[OMPI devel] Success on FreeBSD-8.0 (both 1.5rc5 and 1.4.3rc1)

2010-08-24 Thread Paul H. Hargrove
By this point you all must either love or hate me, but I still have more platforms to test... This time it is FreeBSD-8.0 on an i386, and I can report that both RCs are successful. However, the build of 1.4.3rc1 produces MANY instances of the following: In file included from [SOMEFILE:SOMELI

Re: [OMPI devel] 1.5rc5 - warnings from Sun C 5.10

2010-08-24 Thread Paul H. Hargrove
Following George's advice, I made the following change (NOT yet conditional on Sun compiler), and can confirm that it eliminates the warnings in atomic.h and "make check" still passes. -Paul --- openmpi-1.5rc5/opal/include/opal/sys/ia32/atomic.h.orig Tue Aug 24 18:40:55 2010 +++ openmpi-1

[OMPI devel] Gcc warnings in datatype test code in 1.5rc5

2010-08-24 Thread Paul H. Hargrove
The following was seen in a "make check" with 1.5rc5 on FreeBSD-8.0/i386 w/ gcc-4.2.1, but I don't think that these warnings are platform specific. These warnings do not occur with 1.4.3rc1 I was vaguely recalling that gcc-4.2.1 might be one of the versions that generates incorrect "discards

[OMPI devel] mixed OpenBSD 4.6/i386 results: 1.5rc5 GOOD but 1.4.3rc1 BAD

2010-08-24 Thread Paul H. Hargrove
I have not sifted through logs for warnings, but can report that 1.5rc5 builds and passes "gmake check" w/ gcc-3.3.5 on OpenBSD 4.6 on an i386. HOWEVER, 1.4.3rc1 fails to build due to errors in ROMIO: $ uname -a OpenBSD obsd46-i386.my.domain 4.6 GENERIC#58 i386 $ gcc --version gcc (GCC) 3.3.5

[OMPI devel] OpenBSD-4.6/i386 warns malloc.h is obsolete for 1.4.3rc1 and 1.5rc5

2010-08-24 Thread Paul H. Hargrove
Building both of the new RCs on OpenBSD 4.6 for i386 is warning that malloc.h is obsolete. The files/lines producing the warnings are nearly the same, but I provide both lists below for completeness. $ uname -a OpenBSD obsd46-i386.my.domain 4.6 GENERIC#58 i386 $ gcc --version gcc (GCC) 3.3.5

[OMPI devel] Success (with warnings) on NetBSD 5.0.2/i386

2010-08-24 Thread Paul H. Hargrove
Both 1.5rc5 and 1.4.3rc1 build and pass "gmake check" on my NetBSD/i386 platform. However, for 1.4.3rc1 (but NOT for 1.5rc5) I see numerous instances of In file included from [SOMEFILE:SOMELINE]: ../../opal/include/opal/sys/cache.h:33:1: warning: "CACHE_LINE_SIZE" redefined In file included fr