Re: [OMPI devel] coll ml error with some nonblocking collectives

2014-09-15 Thread Gilles Gouaillardet
Howard, and Rolf, i initially reported the issue at http://www.open-mpi.org/community/lists/devel/2014/09/15767.php r32659 is not a fix nor a regression, it simply aborts instead of OBJ_RELEASE(mpi_comm_world). /* my point here is we should focus on the root cause and not the consequence */

Re: [OMPI devel] coll ml error with some nonblocking collectives

2014-09-15 Thread Pritchard Jr., Howard
HI Rolf, Okay. I'll work with ORNL folks to see how to really fix this. Howard From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Rolf vandeVaart Sent: Monday, September 15, 2014 3:10 PM To: Open MPI Developers Subject: Re: [OMPI devel] coll ml error with some nonblocking

Re: [OMPI devel] removing cnos support from ompi

2014-09-15 Thread Ralph Castain
No objection from me - that pretty much belongs to you folks On Sep 15, 2014, at 2:51 PM, Pritchard Jr., Howard wrote: > Hi Folks, > > I’d like to rip out the cnos ess/alps code from ompi. Its dead – no > one is using CNOS (old cray xt systems) – and its very confusing > to

Re: [OMPI devel] coll ml error with some nonblocking collectives

2014-09-15 Thread Rolf vandeVaart
Confirmed that trunk version r32658 does pass the test. From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Pritchard Jr., Howard Sent: Monday, September 15, 2014 4:16 PM To: Open MPI Developers Subject: Re: [OMPI devel] coll ml error with some nonblocking collectives Hi Rolf, This

Re: [OMPI devel] coll ml error with some nonblocking collectives

2014-09-15 Thread Pritchard Jr., Howard
Hi Rolf, This may be related to change set 32659. If you back this change out, do the tests pass? Howard From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Rolf vandeVaart Sent: Monday, September 15, 2014 8:55 AM To: de...@open-mpi.org Subject: [OMPI devel] coll ml error with

[OMPI devel] External loopback

2014-09-15 Thread Håkon Bugge
From time-to-time, and have a need for running Open MPI apps using the openib btl on a single node, where port 1 on the HCA is connected to port 2 on the same HCA. Using a vintage 1.5.4, my command line would read: mpiexec --mca btl self,openib --mca btl_openib_cpc_include oob \ -np 1

[OMPI devel] coll ml error with some nonblocking collectives

2014-09-15 Thread Rolf vandeVaart
I wonder if anyone else is seeing this failure. Not sure when this started but it is only on the trunk. Here is a link to my failures as well as an example below that. There are a variety of nonblocking collectives failing like this. http://mtt.open-mpi.org/index.php?do_redir=2208

Re: [OMPI devel] hwloc error

2014-09-15 Thread Alina Sklarevich
Thanks Ralph, adding --hetero-nodes to the command line solved this issue. Alina. On Mon, Sep 15, 2014 at 6:51 AM, Ralph Castain wrote: > Try adding --hetero-nodes to your mpirun cmd line > > On Sep 14, 2014, at 8:25 AM, Alina Sklarevich > wrote:

Re: [OMPI devel] 1.8.3rc1 - start your engines

2014-09-15 Thread Ralph Castain
Your contributions are always appreciated, Paul - thanks! On Sep 13, 2014, at 7:51 PM, Paul Hargrove wrote: > Ralph, > > I am not sure if I will have time to run my full suite of configurations, > including all the PGI, Sun, Intel and IBM compilers on Linux. > > However,

Re: [OMPI devel] hwloc error

2014-09-15 Thread Ralph Castain
Try adding --hetero-nodes to your mpirun cmd line On Sep 14, 2014, at 8:25 AM, Alina Sklarevich wrote: > Hello, > > > I am using ompi-v1.8 and have come across the following error: > > > --