Re: [OMPI devel] Openmpi 1.6.5 is freezing under GNU/Linux ia64

2013-09-20 Thread Christopher Samuel
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 On 21/09/13 05:49, Sylvestre Ledru wrote: > Does it ring a bell to anyone ? Possibly, if you run the binary without mpirun does it do the same? If so, could you try and run it with strace -f and see if you see repeating SEGV's? cheers! Chris - --

[OMPI devel] nightly builds failed

2013-09-20 Thread Jeff Squyres (jsquyres)
IU moved the nightly build cron jobs to a new machine today, and they failed. I'm manually running the build cron jobs on the old build machine (eddie) right now. I've alerted IU to what I think the error was in the move; hopefully they'll be able to fix it over the weekend. -- Jeff Squyres

Re: [OMPI devel] Problem with multiple identical entries in ~/.openmpi/mca-params.conf

2013-09-20 Thread Nathan Hjelm
I will look into this. -Nathan On Fri, Sep 20, 2013 at 05:49:41PM -0400, Vallee, Geoffroy R. wrote: > Hi, > > I found a very unexpected behavior with r29217: > > % cat ~/.openmpi/mca-params.conf > #pml_base_verbose=0 > pml_base_verbose=0 > > % mpicc -o helloworld helloworld.c > > Then if i up

Re: [OMPI devel] Openmpi 1.6.5 is freezing under GNU/Linux ia64

2013-09-20 Thread Ralph Castain
Occurs to me - I bet you didn't configure this with --enable-debug, did you? If not, please reconfigure it and rerun so we can see the debug output On Sep 20, 2013, at 2:54 PM, Sylvestre Ledru wrote: > On 20/09/2013 23:46, Ralph Castain wrote: >> That's it?? Wow, that was useless. > Isn't it ?

Re: [OMPI devel] Openmpi 1.6.5 is freezing under GNU/Linux ia64

2013-09-20 Thread Sylvestre Ledru
On 20/09/2013 23:46, Ralph Castain wrote: > That's it?? Wow, that was useless. Isn't it ? ;) It is why I asked for your help... > Can you attach to mpirun with gdb and tell me where it is sitting? > It is about as useful at the previous command: http://paste.debian.net/43882/ Built with: $ mpicc

[OMPI devel] Problem with multiple identical entries in ~/.openmpi/mca-params.conf

2013-09-20 Thread Vallee, Geoffroy R.
Hi, I found a very unexpected behavior with r29217: % cat ~/.openmpi/mca-params.conf #pml_base_verbose=0 pml_base_verbose=0 % mpicc -o helloworld helloworld.c Then if i update the mca-params.conf to have two identical entries, i have segfaults: % cat ~/.openmpi/mca-params.conf pml_base_ver

Re: [OMPI devel] Openmpi 1.6.5 is freezing under GNU/Linux ia64

2013-09-20 Thread Ralph Castain
That's it?? Wow, that was useless. Can you attach to mpirun with gdb and tell me where it is sitting? On Sep 20, 2013, at 2:41 PM, Sylvestre Ledru wrote: > On 20/09/2013 23:36, Ralph Castain wrote: >> Nothing leaps to mind - not seeing that anywhere else. If that's all you >> saw, then it does

Re: [OMPI devel] Openmpi 1.6.5 is freezing under GNU/Linux ia64

2013-09-20 Thread Sylvestre Ledru
On 20/09/2013 23:36, Ralph Castain wrote: > Nothing leaps to mind - not seeing that anywhere else. If that's all you saw, > then it doesn't look like it even got to the point of trying to launch > something. You might set the following: > > mpirun -mca plm_base_verbose 5 -mca ras_base_verbose 5 -

Re: [OMPI devel] Openmpi 1.6.5 is freezing under GNU/Linux ia64

2013-09-20 Thread Ralph Castain
Nothing leaps to mind - not seeing that anywhere else. If that's all you saw, then it doesn't look like it even got to the point of trying to launch something. You might set the following: mpirun -mca plm_base_verbose 5 -mca ras_base_verbose 5 -mca rmaps_base_verbose 5 -mca ess_base_verbose 5 f

Re: [OMPI devel] RFC: Neighborhood collective support

2013-09-20 Thread Nathan Hjelm
On Fri, Sep 20, 2013 at 08:18:28PM +, Jeff Squyres (jsquyres) wrote: > Nathan: can you add these tests (or morphed versions of these tests) into the > ompi-tests/ibm tree? That way, we'll all be running them against MTT. Looks like that has already been done with at least one of those tests.

Re: [OMPI devel] RFC: Neighborhood collective support

2013-09-20 Thread Jeff Squyres (jsquyres)
Nathan: can you add these tests (or morphed versions of these tests) into the ompi-tests/ibm tree? That way, we'll all be running them against MTT. On Sep 20, 2013, at 12:40 PM, "David Goodell (dgoodell)" wrote: > On Sep 19, 2013, at 3:07 PM, "Hjelm, Nathan T" wrote: > >> I have implemente

Re: [OMPI devel] Openmpi 1.6.5 is freezing under GNU/Linux ia64

2013-09-20 Thread Sylvestre Ledru
On 20/09/2013 21:49, Sylvestre Ledru wrote: > Hello, > > In Debian, we are in the process of switching the default MPI > implementation from version 1.4 to 1.6. > > Every architectures are fine beside ia64. Any program based on OpenMPI > 1.6.5 is freezing. > > With a basic test case: > MPI_Init(N

[OMPI devel] Openmpi 1.6.5 is freezing under GNU/Linux ia64

2013-09-20 Thread Sylvestre Ledru
Hello, In Debian, we are in the process of switching the default MPI implementation from version 1.4 to 1.6. Every architectures are fine beside ia64. Any program based on OpenMPI 1.6.5 is freezing. With a basic test case: MPI_Init(NULL, NULL); MPI_Finalize(); mpirun -c 4 foo => freeze The

Re: [OMPI devel] RFC: Neighborhood collective support

2013-09-20 Thread David Goodell (dgoodell)
On Sep 19, 2013, at 3:07 PM, "Hjelm, Nathan T" wrote: > I have implemented simple tests for cartesian, graph, and dist graph > topologies for the Open MPI/IBM test suite and all tests pass. I will push > those tests to MTT tomorrow. Consider also grabbing the dist graph and neighborhood collec

[OMPI devel] more oshmem issues

2013-09-20 Thread Jeff Squyres (jsquyres)
Mellanox -- Please remove all uses of bzero. memset is the portable way to go here. On Sep 20, 2013, at 12:30 PM, "David Goodell (dgoodell)" wrote: > "bzero" should be avoided for maximum portability. Just use "memset" > instead. Even older versions of GCC know how to spot the 0 constant

Re: [OMPI devel] [OMPI users] Error in openmpi-1.9a1r29179

2013-09-20 Thread David Goodell (dgoodell)
"bzero" should be avoided for maximum portability. Just use "memset" instead. Even older versions of GCC know how to spot the 0 constant and substitute the right compiler intrinsic(s), assuming they are available for the target platform. http://pubs.opengroup.org/onlinepubs/009695399/function

[OMPI devel] OMPI Git mirror history: incorrect history & rewinding

2013-09-20 Thread David Goodell (dgoodell)
Short Version: The OMPI GitHub mirror of the OMPI SVN history currently contains some bad history, specifically related to the version tags. Prior to roughly September 16th, this repository also contained several other bits of bad information/history. The git history was *rewound* some time a

Re: [OMPI devel] Intercomm Merge

2013-09-20 Thread Jeff Squyres (jsquyres)
Just to close my end of this loop: as of trunk r29213, it all works for me. Thanks! On Sep 18, 2013, at 12:52 PM, Ralph Castain wrote: > Thanks George - much appreciated > > On Sep 18, 2013, at 9:49 AM, George Bosilca wrote: > >> The test case was broken. I just pushed a fix. >> >> George

Re: [OMPI devel] MPI fails when launched with srun using openib btl.

2013-09-20 Thread Ralph Castain
That looks out-of-date to me - there have been commits all week long. Best guess is that the git mirror is behind On Sep 20, 2013, at 6:48 AM, Victor Kocheganov wrote: > I have HEAD on git revision: > commit 4c282fe5bc8a4143a8c6ac5c0f8d4af591277f6f > Author: Ralph Castain > Date: Sun Sep 15

Re: [OMPI devel] MPI fails when launched with srun using openib btl.

2013-09-20 Thread Victor Kocheganov
I have HEAD on git revision: commit 4c282fe5bc8a4143a8c6ac5c0f8d4af591277f6f Author: Ralph Castain List-Post: devel@lists.open-mpi.org Date: Sun Sep 15 15:33:51 2013 + May be there is a difference in PMI? I have PMI-1 on machine. On Fri, Sep 20, 2013 at 5:37 PM, Ralph Castain wrote: >

Re: [OMPI devel] MPI fails when launched with srun using openib btl.

2013-09-20 Thread Ralph Castain
What revision level are you at? I just checked and it worked fine for me On Sep 20, 2013, at 2:33 AM, Victor Kocheganov wrote: > Hi folks! > > I am trying to launch MPI master branch with srun (simple send/recv program, > see attach) and using openib, but unfortunately I get a segfault. > >

[OMPI devel] MPI fails when launched with srun using openib btl.

2013-09-20 Thread Victor Kocheganov
Hi folks! I am trying to launch *MPI master branch* with srun (simple send/recv program, see attach) and using *openib*, but unfortunately I get a *segfault *. Below is my workflow. 1) I configured ompi/master with following line: ./autogen.sh && ./configure --prefix=$PWD/install --with-openib