Re: [OMPI devel] Revise paffinity method?

Ralph Castain Thu, 7 May 2009 15:58:05 -0400

FWIW: Jeff and I chatted about this on the phone and came up with twoissues that need resolving:

1. we use mpi_paffinity_alone to indicate that we should bindprocesses, yet the orteds have no way of seeing that MCA param as itis registered and evaluated in the MPI layer. We propose to resolvethis by (a) declaring an opal_paffinity_alone MCA param in thepaffinity framework, and then (b) declaring an alias ofmpi_paffinity_alone for it, also in the paffinity framework. Thisobviously is an abstraction break, but we feel it is an acceptable oneunder the circumstances.

Our apologies to Lenny, whose ears were boxed over doing just thislast year...sigh.

This will allow the orteds to check to see if processes should bebound before launching them.

2. we would not be able to bind processes launched without daemonsunder systems that do not provide their own process bindingcapability. For example, on Torque, we have an ability to nativelylaunch processes from within mpirun - those processes currently canbind themselves in MPI_Init, but would not be able to do so any longerunder this proposed change.

To alleviate that problem, we propose to leave the process bindingcode that is currently in MPI_Init, but surround it with a test to seeif an MCA param has been set indicating that the proc is to use thatcode to bind itself. Thus, when launching without daemons (but viampirun), we can set the flag and instruct the procs to bindthemselves. However, procs that are launched without daemons viasomething which has its own binding capability (e.g., SLURM), andprocs that were launched via daemon (and hence would have already beenbound), would not attempt to do so.



Any further thoughts are welcome...
Ralph


On May 7, 2009, at 12:59 PM, Ralph Castain wrote:

I can do the coding - just want to ensure interested others gettheir $0.002 in on how it should work.
I came up with a way to do it that doesn't require changes to thepaffinity framework. I can complete the prototype next week on an hgbranch and let you look at it. Mostly consists of moving what is nowin MPI_Init into the odls modules between the fork and exec, asBrian suggested.
On May 7, 2009, at 12:43 PM, Terry Dontje wrote:
Brian W. Barrett wrote:
On Wed, 6 May 2009, Ralph Castain wrote:
Any thoughts on this? Should we change it?
Yes, we should change this (IMHO) :).
Me too.
If so, who wants to be involved in the re-design? I'm pretty sureit would require some modification of the paffinity framework,plus some minor mods to the odls framework and (since you cannotbind a process other than yourself) addition of a new small"proxy" script that would bind-then-exec each process started bythe orted (Eugene posted a candidate on the user list, though wewill have to deal with some system-specific issues in it).
I can't contribute a whole lot of time, but I'd be happy to lurk,offer advice, and write some small bits of code. But I definitelycan't lead.
Fist offering of opinion from me. I think we can avoid the"proxy" script by doing the binding after the fork but before theexec. This will definitely require minor changes to the odls andprobably a bunch of changes to the paffinity framework. This willmake things slightly less fragile than a script would, and yet getus what we want.
I'll have to talk with Len to see if Sun has any time to allocateto this.
--td
_______________________________________________
devel mailing list
[email protected]
http://www.open-mpi.org/mailman/listinfo.cgi/devel

Re: [OMPI devel] Revise paffinity method?

Reply via email to