Re: [OMPI devel] trunk regressions

2012-04-10 Thread Ralph Castain
On Apr 10, 2012, at 7:51 AM, TERRY DONTJE wrote: > Fair enough sorry about the false report. No problem - it's a good reminder to all that we changed this policy. Previously, we allowed oversubscribe by default even on managed systems. This generated some significant concerns from sys admins

Re: [OMPI devel] trunk regressions

2012-04-10 Thread TERRY DONTJE
Fair enough sorry about the false report. I sent you email about the other failures (final and MPI_Errhandler). --td On 4/10/2012 9:40 AM, Ralph Castain wrote: I looked closer at the MTT output, Terry, and loop_spawn is actually behaving correctly. The problem is that (a) the test creates more

Re: [OMPI devel] trunk regressions

2012-04-10 Thread Ralph Castain
I looked closer at the MTT output, Terry, and loop_spawn is actually behaving correctly. The problem is that (a) the test creates more children than allocated slots, and (b) the tests are being executed in a managed environment, and so we enforce the slot limit. The solution is to set the --over

Re: [OMPI devel] trunk regressions

2012-04-10 Thread Ralph Castain
loop_spawn is working for me on both my mac and on odin, but I'll look and see if I can replicate somehow. On Apr 10, 2012, at 4:44 AM, TERRY DONTJE wrote: > Thanks Ralph the comm_join issue seems to be fix but the other issues > mentioned still seem to persist. I'll look at this later today u

Re: [OMPI devel] trunk regressions

2012-04-10 Thread TERRY DONTJE
Thanks Ralph the comm_join issue seems to be fix but the other issues mentioned still seem to persist. I'll look at this later today unless someone else decides to fix them :-). --td On 4/9/2012 6:45 PM, Ralph Castain wrote: Should all be fixed now. On Apr 9, 2012, at 7:17 AM, TERRY DONTJE

Re: [OMPI devel] trunk regressions

2012-04-09 Thread Ralph Castain
Should all be fixed now. On Apr 9, 2012, at 7:17 AM, TERRY DONTJE wrote: > After looking at Oracles MTT results there seem to be a (some??) regressions > between r26240 and 26249 detected by the ibm and intel tests suites. An > example of this is the failures in the comm_join, final and loop_s

Re: [OMPI devel] trunk regressions

2012-04-09 Thread Ralph Castain
Nobody stepped up, so I fixed this in r26257 On Apr 9, 2012, at 9:21 AM, Ralph Castain wrote: > FWIW: this isn't a bug in orte_dpm, but in the MPI binding for comm_join. The > problem is that both sides in the comm_join are setting "send_first" to true > - i.e., both sides are trying to be the

Re: [OMPI devel] trunk regressions

2012-04-09 Thread Ralph Castain
FWIW: this isn't a bug in orte_dpm, but in the MPI binding for comm_join. The problem is that both sides in the comm_join are setting "send_first" to true - i.e., both sides are trying to be the first to send on the handshake. We got away with this before because of a bug in orte_dpm that made t

Re: [OMPI devel] trunk regressions

2012-04-09 Thread Josh Hursey
This is totally not related to the bug report, but a neat trick in Trac. My question was "what were the commits between r26240 and 26249"? In the search box type: log:@26240:26249 Or use the direct url: https://svn.open-mpi.org/trac/ompi/log/?revs=26240-26249 nifty... -- Josh On Mon, Apr 9,