Howard, and Rolf,
i initially reported the issue at
http://www.open-mpi.org/community/lists/devel/2014/09/15767.php
r32659 is not a fix nor a regression, it simply aborts instead of
OBJ_RELEASE(mpi_comm_world).
/* my point here is we should focus on the root cause and not the
consequence */
HI Rolf,
Okay. I'll work with ORNL folks to see how to really fix this.
Howard
From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Rolf vandeVaart
Sent: Monday, September 15, 2014 3:10 PM
To: Open MPI Developers
Subject: Re: [OMPI devel] coll ml error with some nonblocking
No objection from me - that pretty much belongs to you folks
On Sep 15, 2014, at 2:51 PM, Pritchard Jr., Howard wrote:
> Hi Folks,
>
> I’d like to rip out the cnos ess/alps code from ompi. Its dead – no
> one is using CNOS (old cray xt systems) – and its very confusing
> to
Confirmed that trunk version r32658 does pass the test.
From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Pritchard Jr.,
Howard
Sent: Monday, September 15, 2014 4:16 PM
To: Open MPI Developers
Subject: Re: [OMPI devel] coll ml error with some nonblocking collectives
Hi Rolf,
This
Hi Rolf,
This may be related to change set 32659.
If you back this change out, do the tests pass?
Howard
From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Rolf vandeVaart
Sent: Monday, September 15, 2014 8:55 AM
To: de...@open-mpi.org
Subject: [OMPI devel] coll ml error with
From time-to-time, and have a need for running Open MPI apps using the openib
btl on a single node, where port 1 on the HCA is connected to port 2 on the
same HCA.
Using a vintage 1.5.4, my command line would read:
mpiexec --mca btl self,openib --mca btl_openib_cpc_include oob \
-np 1
I wonder if anyone else is seeing this failure. Not sure when this started but
it is only on the trunk. Here is a link to my failures as well as an example
below that. There are a variety of nonblocking collectives failing like this.
http://mtt.open-mpi.org/index.php?do_redir=2208
Thanks Ralph,
adding --hetero-nodes to the command line solved this issue.
Alina.
On Mon, Sep 15, 2014 at 6:51 AM, Ralph Castain wrote:
> Try adding --hetero-nodes to your mpirun cmd line
>
> On Sep 14, 2014, at 8:25 AM, Alina Sklarevich
> wrote:
Your contributions are always appreciated, Paul - thanks!
On Sep 13, 2014, at 7:51 PM, Paul Hargrove wrote:
> Ralph,
>
> I am not sure if I will have time to run my full suite of configurations,
> including all the PGI, Sun, Intel and IBM compilers on Linux.
>
> However,
Try adding --hetero-nodes to your mpirun cmd line
On Sep 14, 2014, at 8:25 AM, Alina Sklarevich wrote:
> Hello,
>
>
> I am using ompi-v1.8 and have come across the following error:
>
>
> --
10 matches
Mail list logo