Re: [OMPI devel] OMPI_PROC_BIND value is invalid errors

2015-06-29 Thread Howard Pritchard
I decided just to disable the carver/pgi mtt runs. 2015-06-29 15:10 GMT-06:00 Ralph Castain : > Very strange then - again, can you run it with the verbose flag and send > me the output? I can't replicate what you are seeing. > > > On Mon, Jun 29, 2015 at 4:05 PM, Howard Pritchard > wrote: > >>

Re: [OMPI devel] OMPI_PROC_BIND value is invalid errors

2015-06-29 Thread Ralph Castain
Very strange then - again, can you run it with the verbose flag and send me the output? I can't replicate what you are seeing. On Mon, Jun 29, 2015 at 4:05 PM, Howard Pritchard wrote: > ibm dataplex and laki ~= cray. nothing to do with cray. > Cray runs fine since I use aprun there. > > > 2015

Re: [OMPI devel] OMPI_PROC_BIND value is invalid errors

2015-06-29 Thread Howard Pritchard
ibm dataplex and laki ~= cray. nothing to do with cray. Cray runs fine since I use aprun there. 2015-06-29 13:54 GMT-06:00 Ralph Castain : > Hmmm...is this some Cray weirdness? I checked the code and it looks right, > and it runs correctly for me on both Mac and Linux. All it is doing is > call

Re: [OMPI devel] OMPI_PROC_BIND value is invalid errors

2015-06-29 Thread Ralph Castain
Hmmm...is this some Cray weirdness? I checked the code and it looks right, and it runs correctly for me on both Mac and Linux. All it is doing is calling "setenv", so I'm wondering if there is something environ-specific going on here? I added some debug in cast that might help - can you run it on

Re: [OMPI devel] the bug in btl_openib_connect_sl.c

2015-06-29 Thread Steve Wise
I'm not familiar at all with this IB code, but the supplied patch seems to post the recv wr only once when the UD QP is created. And get_pathrecord_info() seems to have logic to retry querying path records, and if it does, there won't be a recv posted after the first recv completes. So it seems

Re: [OMPI devel] the bug in btl_openib_connect_sl.c

2015-06-29 Thread Jeff Squyres (jsquyres)
Nathan / Steve -- Can you comment? > On Jun 26, 2015, at 5:13 AM, Алексей Рыжих wrote: > > Hi everybody, > I tried the functionality for 3D-torus cluster topology support and > encountered the bug with error message like below: > > srvmpisnb02][[9011,1],3][ompi/mca/btl/openib/connect/bt

Re: [OMPI devel] Pruning from the 2.x branch

2015-06-29 Thread Jeff Squyres (jsquyres)
Thanks Edgar. I removed it from v2.x. Let us know if you want to bring it back. > On Jun 27, 2015, at 2:03 AM, Edgar Gabriel wrote: > > The ompi sharedfp addproc component should probably also be excluded > > Thanks > Edgar > > On 6/25/2015 5:53 PM, Jeff Squyres (jsquyres) wrote: >> We have

Re: [OMPI devel] OMPI_PROC_BIND value is invalid errors

2015-06-29 Thread Jeff Squyres (jsquyres)
Ahh... it's OMP_PROC_BIND, not OMPI_PROC_BIND. Yes, Ralph just added this. I chatted with him about this on the phone moments ago; he's pretty sure he knows where to go look to find the problem. > On Jun 29, 2015, at 12:00 PM, Howard Pritchard wrote: > > laki is also showing the errors: > >

Re: [OMPI devel] [OMPI users] simple mpi hello world segfaults when coll ml not disabled

2015-06-29 Thread Devendar Bureddy
Fixed this issue in HCOLL by renaming conflicting symbols. Repro case is working fine after this. also explored –Bsymbolic linker option, but it seems not safe to do. -Devendar From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Ralph Castain Sent: Thursday, June 25, 2015 9:31 PM To:

Re: [OMPI devel] MPI_Buffer_detach fortran binding

2015-06-29 Thread Jeff Squyres (jsquyres)
Good catch. The type was wrong, too -- it should be C_PTR. I pushed a fix to master and PRs to v1.10 and v2.x -- can you review? > On Jun 29, 2015, at 10:44 AM, Gilles Gouaillardet > wrote: > > Jeff, > > the first argument of MPI_Buffer_detach is >OMPI_FORTRAN_IGNORE_TKR_TYPE, INTENT(I

Re: [OMPI devel] OMPI_PROC_BIND value is invalid errors

2015-06-29 Thread Howard Pritchard
laki is also showing the errors: Here's the shortened url: http://goo.gl/Ra264U looks like the badness started with the latest nightly. I think there was some activity in the orte binding area recently. Howard 2015-06-29 9:52 GMT-06:00 Jeff Squyres (jsquyres) : > Can you provide an MTT sh

Re: [OMPI devel] OMPI_PROC_BIND value is invalid errors

2015-06-29 Thread Jeff Squyres (jsquyres)
Can you provide an MTT short URL to show the results? Or, if the MTT results are not on the community reporter, can you show a bit more context in the output? > On Jun 29, 2015, at 11:47 AM, Howard Pritchard wrote: > > Hi Folks, > > I'm seeing an error I've not seen before in the MTT runs on

[OMPI devel] OMPI_PROC_BIND value is invalid errors

2015-06-29 Thread Howard Pritchard
Hi Folks, I'm seeing an error I've not seen before in the MTT runs on the ibm dataplex at NERSC. The mpirun launched jobs are failing with OMPI_PROC_BIND value is invalid errors. This is is for the trivial ring tests. Is anyone else seeing these types of errors? Howard

[OMPI devel] MPI_Buffer_detach fortran binding

2015-06-29 Thread Gilles Gouaillardet
Jeff, the first argument of MPI_Buffer_detach is OMPI_FORTRAN_IGNORE_TKR_TYPE, INTENT(IN) :: buffer_addrfrom use-mpi-f08 however, the standard states this is TYPE(C_PTR), INTENT(OUT) (and yes, this is very counter intuitive ... at first glance only) can you please confirm this is an Open MPI bu