> -Original Message-
> From: devel-boun...@open-mpi.org
> [mailto:devel-boun...@open-mpi.org] On Behalf Of Matthijs Richard Koot
> Sent: Wednesday, June 14, 2006 1:04 AM
> To: de...@open-mpi.org
> Subject: [OMPI devel] Q: Job scheduling of MPI applications?
> (in general)
>
> I'm new to this list, and have a question regarding the how
> MPI jobs are scheduled by JMSs. If I understand correctly, to
> have decent management/scheduling of MPI jobs, there are
> requirements for both the MPI implementation and JMS
> implementation, for them to be 'integrated':
>
> - the JMS needs to be 'parallel-aware', i.e. implement the PSCHED API;
It is probably more precise to say that the JMS ("Job Management
System"?) needs to provide a mechanism to start jobs on allocated nodes.
If it provides a parallel mechanism (e.g., a caller can invoke one
command to launch many processes), so much the better -- but if the
mechanism is serial, that's ok too. All common resource managers
provide *some* way of launching jobs on allocated notes -- indeed, that
is one of their main purposes (to start / stop jobs).
The PSCHED API is one of several such interfaces. A subset of the
PSCHED API is only in common use in the PBS line of resource managers
(Torque, PBS Pro, etc.). I doubt that TM is the native interface that
the PBS flavors use to launch jobs (i.e., I doubt that PBS uses TM
internally for launching processes), but I have not dived into the
implementation enough to know. Other resource managers have different
interfaces.
> - the MPI needs to be 'JMS-aware', i.e. call the PSCHED
> functions at the JMS.
That's correct in spirit, but a little more precise would be to say that
the MPI needs to be aware of and properly utilize the mechanism that the
resource manager provides to start jobs.
> My questions:
> 1. Is this correct?
> 2. Which question should is valid: "does OpenMPI support
> SGE?", or: "does SGE support OpenMPI"?
It's probably more correct to ask if Open MPI supports a given resource
manager.
> 3. How do I know which JMSs (Torque/OpenPBS, SGE, LSF, ...)
> are compatible with which MPI implementations (OpenMPI,
> MPICH, MPICH-G2, ...), and vice versa?
I can't speak for the other MPI implementations, but for Open MPI, you
can look here:
http://www.open-mpi.org/faq/?category=supported-systems#rte
> 4. Is it true that the PSCHED API is the 'de facto' for such
> integration?
No. It was an attempt to standardize such things, but it never really
caught on outside of the PBS family.
--
Jeff Squyres
Server Virtualization Business Unit
Cisco Systems