On Apr 10, 2012, at 7:51 AM, TERRY DONTJE wrote:
> Fair enough sorry about the false report.
No problem - it's a good reminder to all that we changed this policy.
Previously, we allowed oversubscribe by default even on managed systems. This
generated some significant concerns from sys admins
Fair enough sorry about the false report.
I sent you email about the other failures (final and MPI_Errhandler).
--td
On 4/10/2012 9:40 AM, Ralph Castain wrote:
I looked closer at the MTT output, Terry, and loop_spawn is actually
behaving correctly. The problem is that (a) the test creates more
I looked closer at the MTT output, Terry, and loop_spawn is actually behaving
correctly. The problem is that (a) the test creates more children than
allocated slots, and (b) the tests are being executed in a managed environment,
and so we enforce the slot limit. The solution is to set the --over
loop_spawn is working for me on both my mac and on odin, but I'll look and see
if I can replicate somehow.
On Apr 10, 2012, at 4:44 AM, TERRY DONTJE wrote:
> Thanks Ralph the comm_join issue seems to be fix but the other issues
> mentioned still seem to persist. I'll look at this later today u
Thanks Ralph the comm_join issue seems to be fix but the other issues
mentioned still seem to persist. I'll look at this later today unless
someone else decides to fix them :-).
--td
On 4/9/2012 6:45 PM, Ralph Castain wrote:
Should all be fixed now.
On Apr 9, 2012, at 7:17 AM, TERRY DONTJE
Should all be fixed now.
On Apr 9, 2012, at 7:17 AM, TERRY DONTJE wrote:
> After looking at Oracles MTT results there seem to be a (some??) regressions
> between r26240 and 26249 detected by the ibm and intel tests suites. An
> example of this is the failures in the comm_join, final and loop_s
Nobody stepped up, so I fixed this in r26257
On Apr 9, 2012, at 9:21 AM, Ralph Castain wrote:
> FWIW: this isn't a bug in orte_dpm, but in the MPI binding for comm_join. The
> problem is that both sides in the comm_join are setting "send_first" to true
> - i.e., both sides are trying to be the
FWIW: this isn't a bug in orte_dpm, but in the MPI binding for comm_join. The
problem is that both sides in the comm_join are setting "send_first" to true -
i.e., both sides are trying to be the first to send on the handshake. We got
away with this before because of a bug in orte_dpm that made t
This is totally not related to the bug report, but a neat trick in Trac.
My question was "what were the commits between r26240 and 26249"?
In the search box type:
log:@26240:26249
Or use the direct url:
https://svn.open-mpi.org/trac/ompi/log/?revs=26240-26249
nifty...
-- Josh
On Mon, Apr 9,