Re: [OMPI devel] OMPI/ORTE and tools

Josh Hursey Wed, 16 Jan 2008 10:58:55 -0500

Ralph,

This looks interesting. Can you point me to the header files and anyORTE tools that you may have converted to use this library already(e.g., orte-ps)? I can port the checkpoint/restart tools to thislibrary and start sending you some feedback on the API.


Cheers,
Josh

On Jan 16, 2008, at 7:47 AM, Ralph Castain wrote:

Hello all

Summary: this note provides a brief overview of how various tools can
interface to OMPI applications once the next version of ORTE isintegratedinto the trunk. It includes a request for input regarding any needs(e.g.,additional commands to be supported in the interface) that have notbeen
adequately addressed.
As many of you know, I have been working on a tmp branch tocomplete therevamp of ORTE that has been in progress for quite some time. Amongotherthings, this revamp is intended to simplify the system, provideenhanced
scalability, and improved reliability.
As part of that effort, I have extensively revised the support forexternaltools. In the past, tools such as the Eclipse PTP could onlyinteract withOpen MPI-based applications via ORTE API's, thus exposing the toolto anychanges in those APIs. Most tools, however, do not require thelevel ofcontrol provided by the APIs and can benefit from a simplifiedinterface.
Accordingly, the revamped ORTE now offers alternative methods of
interaction. The primary change has been the creation of acommunicationslibrary with a simple serial protocol for interacting with OMPIjobs. Thus,
tools now have three choices for interacting with OMPI jobs:
1. I have created a new communications library that tools can linkagainst.It does not include all of the ORTE or OMPI libraries, so it is avery smallmemory footprint. Besides the usual calls to initialize andfinalize, thelibrary contains utilities for finding all of the OMPI jobs runningon thatHNP (i.e., all OMPI jobs whose mpirun was executed from that host),queryingthe status of a job (provides the job map plus all proc states);queryingthe status of nodes (provides node names, status, and list of procson eachnode including their state); querying the status of a specificprocess;spawning a new job; and terminating a job. In addition, you canattach tooutput streams of any process, specifying stdout, stderr, or both -this"tees" the specified streams, so it won't interfere with the job'snormal
output flow.
I could also create a utility to allow attachment to the inputstream of a
process. However, I'm a little concerned about possible conflicts with
whatever is already flowing across that stream. I would appreciate any
suggestions as to whether or not to provide that capability.
Note: we removed the concept of the ORTE "universe", so a tool cannow talk
to any mpirun without complications. Thus, tools can simultaneously
"connect" to and monitor multiple mpiruns, if desired.
2. link against all of OMPI or ORTE, and execute a standaloneprogram. In
this mode, your tool would act as a surrogate for mpirun by directly
spawning the user's application. This provides some flexibility,but it doesmean that both the tool and the job -must- end together, and thatthe tool
may need to be revised whenever OMPI/ORTE APIs are updated.


3. link against all of OMPI or ORTE, executing as a distributed set of
processes. In this mode, you would execute your tool via "mpirun -pernode./my_tool" (or whatever command is appropriate - this example wouldlaunchone tool process on every node in the allocation). If the toolprocessesneed to communicate with each other, they can call MPI_Init ororte_init,depending upon the level of desired communication. Note that thetool job
will be completely standalone from the application job and must be
terminated separately.
In all of these cases, it is possible for tool processes to connect(via MPIand/or ORTE-RML) to a job's processes provided that the applicationsupports
it.
I can provide more details, of course, to anyone wishing them. WhatI would
appreciate, though, is any feedback about desired commands, mode of
operation, etc. that I might have missed or people would prefer bechanged.This code is all in a private repository for my tmp branch, but Iexpect
that to merge with the trunk fairly soon. I have provided a couple of
example tools to illustrate the above modes of operation in that code.

Thanks
Ralph





_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel

Re: [OMPI devel] OMPI/ORTE and tools

Reply via email to