Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots (updated findings)

2014-09-18 Thread Jeff Squyres (jsquyres)
rect NUMA libraries aren't >>>> installed? >>>> >>>> Here are some of the other NUMA packages available for CentOS 6.x: >>>> >>>> yum search numa | less >>>> >>>> Loaded plugins: fastestmirror &

Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots (updated findings)

2014-09-02 Thread Ralph Castain
ph Castain > [r...@open-mpi.org] > Sent: Tuesday, September 02, 2014 11:03 AM > To: Open MPI Users > Subject: Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots > (updated findings) > > On Sep 2, 2014, at 10:48 AM, Lane, William wrote: > >

Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots (updated findings)

2014-09-02 Thread Lane, William
@open-mpi.org] on behalf of Ralph Castain [r...@open-mpi.org] Sent: Tuesday, September 02, 2014 11:03 AM To: Open MPI Users Subject: Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots (updated findings) On Sep 2, 2014, at 10:48 AM, Lane, William wrote: > Ralph, > > T

Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots (updated findings)

2014-09-02 Thread Ralph Castain
__ > From: users [users-boun...@open-mpi.org] on behalf of Ralph Castain > [r...@open-mpi.org] > Sent: Saturday, August 30, 2014 7:15 AM > To: Open MPI Users > Subject: Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots > (up

Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots (updated findings)

2014-09-02 Thread Lane, William
y for tuning for Non Uniform Memory > Access machines > > -Bill Lane > ________________ > From: users [users-boun...@open-mpi.org] on behalf of Reuti > [re...@staff.uni-marburg.de] > Sent: Thursday, August 28, 2014 3:27 AM > To: Open MPI Users >

Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots (updated findings)

2014-08-30 Thread Ralph Castain
rm Memory > Access machines > > -Bill Lane > ________________ > From: users [users-boun...@open-mpi.org] on behalf of Reuti > [re...@staff.uni-marburg.de] > Sent: Thursday, August 28, 2014 3:27 AM > To: Open MPI Users > Subject: Re: [OMPI users

Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots (updated findings)

2014-08-30 Thread Lane, William
tuning for Non Uniform Memory Access machines -Bill Lane From: users [users-boun...@open-mpi.org] on behalf of Reuti [re...@staff.uni-marburg.de] Sent: Thursday, August 28, 2014 3:27 AM To: Open MPI Users Subject: Re: [OMPI users] Mpirun 1.5.4 problems when req

Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots (updated findings)

2014-08-28 Thread Reuti
than less. > > Thank you in advance, > > -Bill Lane > > > From: users [users-boun...@open-mpi.org] on behalf of Jeff Squyres (jsquyres) > [jsquy...@cisco.com] > Sent: Friday, August 08, 2014 5:25 AM > To: Open MPI User'

Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots (updated findings)

2014-08-28 Thread Lane, William
users-boun...@open-mpi.org] on behalf of Jeff Squyres (jsquyres) [jsquy...@cisco.com] Sent: Friday, August 08, 2014 5:25 AM To: Open MPI User's List Subject: Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots On Aug 8, 2014, at 1:24 AM, Lane, William wrote: > Using the &qu

Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots

2014-08-08 Thread Jeff Squyres (jsquyres)
On Aug 8, 2014, at 1:24 AM, Lane, William wrote: > Using the "--mca btl tcp,self" switch to mpirun solved all the issues (in > addition to > the requirement to include the --mca btl_tcp_if_include eth0 switch). I > believe > the "--mca btl tcp,self" switch limits inter-process communication wit

Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots

2014-08-08 Thread Lane, William
__ From: users [users-boun...@open-mpi.org] on behalf of Jeff Squyres (jsquyres) [jsquy...@cisco.com] Sent: Tuesday, July 22, 2014 2:29 PM To: Open MPI User's List Subject: Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots Hyperthreading is pretty great for non-HPC applicatio

Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots

2014-07-22 Thread Jeff Squyres (jsquyres)
MPI apps? > > Thanks for you time, > > -Bill Lane > > > From: users [users-boun...@open-mpi.org] on behalf of Ralph Castain > [r...@open-mpi.org] > Sent: Tuesday, July 22, 2014 7:57 AM > To: Open MPI Users > Subject: Re: [OMPI users] Mpirun 1.5.4 problems when req

Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots

2014-07-22 Thread Lane, William
t: Sunday, July 20, 2014 9:30 AM To: Open MPI Users Subject: Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots I'm unaware of any CentOS-OMPI bug, and I've been using CentOS throughout the 6.x series running OMPI 1.6.x and above. I can't speak to the older versions o

Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots

2014-07-22 Thread Ralph Castain
ew.php?id=5812 > > From: users [users-boun...@open-mpi.org] on behalf of Ralph Castain > [r...@open-mpi.org] > Sent: Sunday, July 20, 2014 9:30 AM > To: Open MPI Users > Subject: Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots > > I'm unaware of any Ce

Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots

2014-07-22 Thread Jeff Squyres (jsquyres)
.5.4 release. On Jul 21, 2014, at 2:34 AM, Lane, William wrote: > Please see: > > http://bugs.centos.org/view.php?id=5812 > > From: users [users-boun...@open-mpi.org] on behalf of Ralph Castain > [r...@open-mpi.org] > Sent: Sunday, July 20, 2014 9:30 AM > To: Open MPI Users

Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots

2014-07-21 Thread Lane, William
n-mpi.org>] Sent: Saturday, July 19, 2014 3:21 PM To: Open MPI Users Subject: Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots Not for this test case size. You should be just fine with the default values. If I understand you correctly, you've run this app at scale before

Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots

2014-07-20 Thread Ralph Castain
it jobs through SGE (via > qrsh > or qsub) or outside of SGE which leads me to believe it is an openMPI and/or > CentOS > issue. > > -Bill Lane > > From: users [users-boun...@open-mpi.org] on behalf of Ralph Castain > [r...@open-mpi.org] > Sent: Saturday, July 1

Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots

2014-07-19 Thread Lane, William
...@open-mpi.org>] Sent: Saturday, July 19, 2014 8:07 AM To: Open MPI Users Subject: Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots That's a pretty old OMPI version, and we don't really support it any longer. However, I can provide some advice: * have you tried runni

Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots

2014-07-19 Thread Ralph Castain
er > values > be necessary? > > Thank you for your help. > > -Bill Lane > > From: users [users-boun...@open-mpi.org] on behalf of Ralph Castain > [r...@open-mpi.org] > Sent: Saturday, July 19, 2014 8:07 AM > To: Open MPI Users > Subject: Re: [OMPI users] Mpir

Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots

2014-07-19 Thread Lane, William
s to 4096? Or should even larger values be necessary? Thank you for your help. -Bill Lane From: users [users-boun...@open-mpi.org] on behalf of Ralph Castain [r...@open-mpi.org] Sent: Saturday, July 19, 2014 8:07 AM To: Open MPI Users Subject: Re: [OMPI users] Mpirun

Re: [OMPI users] Mpirun 1.5.4 problems when request > 28 slots

2014-07-19 Thread Ralph Castain
That's a pretty old OMPI version, and we don't really support it any longer. However, I can provide some advice: * have you tried running the simple "hello_c" example we provide? This would at least tell you if the problem is in your app, which is what I'd expect given your description * try u

[OMPI users] Mpirun 1.5.4 problems when request > 28 slots

2014-07-19 Thread Lane, William
I'm getting consistent errors of the form: "mpirun noticed that process rank 3 with PID 802 on node csclprd3-0-8 exited on signal 11 (Segmentation fault)." whenever I request more than 28 slots. These errors even occur when I run mpirun locally on a compute node that has 32 slots (8 cores, 16 wi