Re: [OMPI devel] 1.8.4rc Status

2014-12-18 Thread Paul Hargrove
On Wed, Dec 17, 2014 at 7:17 PM, Paul Hargrove wrote: > > I am going to run the nightly on other configs on both my > Solaris-11/x86-64 and Solaris-10/SPARC systems. > I just want to be sure some other compile/abi/arch combination didn't get > broken by accident. > I will post my results to the li

Re: [OMPI devel] 1.8.4rc Status

2014-12-17 Thread Ralph Castain
Thanks Paul! Sorry I was out all day - stuck in meetings, I fear. On Wed, Dec 17, 2014 at 7:17 PM, Paul Hargrove wrote: > > Short version: > > v1.8 nightly (v1.8.3-313-g54c80c2) PASSED my testing. > > In full: > > I gave openmpi-v1.8.3-313-g54c80c2 a try. > In this test I did not add -D_REENTRAN

Re: [OMPI devel] 1.8.4rc Status

2014-12-17 Thread Paul Hargrove
Short version: v1.8 nightly (v1.8.3-313-g54c80c2) PASSED my testing. In full: I gave openmpi-v1.8.3-313-g54c80c2 a try. In this test I did not add -D_REENTRANT or -mt to any flags at configure time. In addition to --prefix, I passed the following: --enable-debug --with-verbs \ CC=cc CXX=CC FC=f

Re: [OMPI devel] 1.8.4rc Status

2014-12-17 Thread Jeff Squyres (jsquyres)
Paul -- The __sun macro check is now in the OMPI 1.8 tree, and is in the latest nightly tarball. If I'm following this thread right -- and I might not be! -- I think Gilles is saying that now that the __sun check is in, it should fix this -mt/-D_REENTRANT/whatever problem. Can you confirm?

Re: [OMPI devel] 1.8.4rc Status

2014-12-17 Thread Tom Wurgler
ared-intel" \ --enable-mpi-ext Can you see anything that should or shouldn't be there? Thanks From: devel on behalf of Ralph Castain Sent: Monday, December 15, 2014 10:07 PM To: Open MPI Developers Subject: Re: [OMPI devel] 1.8.4rc Status My

Re: [OMPI devel] 1.8.4rc Status

2014-12-16 Thread Paul Hargrove
Gilles, I am running mpirun on a host that ALSO will run one of the application processes. Requested ifconfig and netstat outputs appear below. -Paul [phargrov@pcp-j-20 ~]$ ifconfig -a lo0: flags=2001000849 mtu 8232 index 1 inet 127.0.0.1 netmask ff00 bge0: flags=1004843 mtu 1500 ind

Re: [OMPI devel] 1.8.4rc Status

2014-12-16 Thread Gilles Gouaillardet
Paul, could you please send the output of ifconfig -a netstat -nr on the three hosts you are using (i assume you are still invoking mpirun from one node, and tasks are running on two other nodes) Cheers, Gilles On 2014/12/16 16:00, Paul Hargrove wrote: > Gilles, > > I looked again carefully an

Re: [OMPI devel] 1.8.4rc Status

2014-12-16 Thread Gilles Gouaillardet
Paul, the root cause -D_REENTRANT is not set automatically is we test the __sun__ macro and 12.4 compiler defines only __sun and sun I will make a fix for that ... Cheers, Gilles On 2014/12/16 16:00, Paul Hargrove wrote: > Gilles, > > I looked again carefully and I am *NOT* finding -D_REENTRAN

Re: [OMPI devel] 1.8.4rc Status

2014-12-16 Thread Paul Hargrove
Gilles, I looked again carefully and I am *NOT* finding -D_REENTRANT passed to most compilations. It appears to be used for building libevent and vt, but nothing else. The output from configure contains checking if more special flags are required for pthreads... -D_REENTRANT only in the libevent

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Paul Hargrove
Gilles, I am NOT seeing the problem with gcc. It is only occurring with the Studio compilers. As I've already reported, I have tried adding either "-mt" or "-mt=yes" to both LDFLAGS and --with-wrapper-ldflags. The "cc" manpage (on the Solaris-10 system I can get to right now) says: -mt Co

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Gilles Gouaillardet
Paul, did you manually set -mt ? if i remember correctly, solaris 11 (at least with gcc compilers) do not need any flags (except the -D_REENTRANT that is added automatically) Cheers, Gilles On 2014/12/16 12:10, Paul Hargrove wrote: > Gilles, > > I will try the patch when I can. > However, our

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Paul Hargrove
Gilles, I will try the patch when I can. However, our network is undergoing network maintenance right now, leaving me unable to reach the necessary hosts. As for -D_REENTRANT, I had already reported having verified in the "make" output that it had been added automatically. Additionally, the docs

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Ralph Castain
;> More testing on-going >> >> >> -- >> *From:* devel on behalf of Tom Wurgler < >> twu...@goodyear.com> >> *Sent:* Monday, December 15, 2014 1:23 PM >> >> *To:* Open MPI Developers >> *Subject:* Re: [

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Gilles Gouaillardet
Paul, could you please make sure configure added "-D_REENTRANT" to the CFLAGS ? /* otherwise, errno is a global variable instead of a per thread variable, which can explains some weird behaviour. note this should have been already fixed */ assuming -D_REENTRANT is set, could you please give the

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Paul Hargrove
I have tried with a oob_tcp_if_include setting so that there is now only 1 interface. Even with just one interface and -mt=yes in both LDFLAGS and wrapper-ldflags I *still* getting messages like [pcp-j-20:11470] mca_oob_tcp_accept: accept() failed: Error 0 (0).

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Ralph Castain
t; > > -- > *From:* devel on behalf of Tom Wurgler < > twu...@goodyear.com> > *Sent:* Monday, December 15, 2014 1:23 PM > > *To:* Open MPI Developers > *Subject:* Re: [OMPI devel] 1.8.4rc Status > > > It seems to be working in rc2

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Paul Hargrove
A little more reading finds that... Docs says that one needs "-mt" without the "=yes". That will work for both old and new compilers, where "-mt=yes" chokes older ones. Also, man pages say "-mt" must come before "-lpthread" in the link command. -Paul On Mon, Dec 15, 2014 at 12:52 PM, Paul Hargr

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Tom Wurgler
this is still rc2 More testing on-going From: devel on behalf of Tom Wurgler Sent: Monday, December 15, 2014 1:23 PM To: Open MPI Developers Subject: Re: [OMPI devel] 1.8.4rc Status It seems to be working in rc2 after all. I was still trying to

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Paul Hargrove
On Mon, Dec 15, 2014 at 5:35 AM, Ralph Castain wrote: > > 7. Linkage issue on Solaris-11 reported by Paul Hargrove. Missing the > multi-threaded C libraries, apparently need "-mt=yes" in both compile and > link. Need someone to investigate. The lack of multi-thread libraries is my SPECULATION.

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Tom Wurgler
It seems to be working in rc2 after all. I was still trying to use a rankfile, but it appears that is no longer needed. Thanks! From: devel on behalf of Ralph Castain Sent: Monday, December 15, 2014 8:45 AM To: Open MPI Developers Subject: Re: [OMPI devel

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Ralph Castain
Should be there in rc4, and I thought it made it to rc2 for that matter. I'll take a gander. FWIW: I'm working off-list with IBM to tighten the LSF integration so we correctly read and follow their binding directives. This will also be in 1.8.4 as we are in final test with it now. Ralph On Mon,

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Tom Wurgler
Forgive me if I've missed it, but I believe using physical OR logical core numbering was going to be reimplemented in the 1.8.4 series. I've checked out rc2 and as far as I can tell, it isn't there as yet. Is this correct? thanks! From: devel on behalf o