Ralph,

I think David's concern is that because Slurm has changed their default
behavior, Open MPI's default behavior has changed as well.
The request (on which I have no opinion) appears to be that ORTE make an
explicit request for the behavior that was the previous default in Slurm.
That would ensure that the behavior of Open MPI remains independent of the
Slurm version.

David,

The problem here appears to be that the new (--export=NONE) behavior means
that $PATH and/or $LD_LIBRARY_PATH are not propagated, and thus orted could
not be found.
I believe you can configure Open MPI with --enable-mpirun-prefix-by-default
to resolve the reported "orted: No such file or directory"

-Paul

On Fri, Apr 17, 2015 at 1:13 PM, Ralph Castain <r...@open-mpi.org> wrote:

> Hmmm...but what if a user -doesn't- want their environment forwarded? Seems
> presumptuous of us to arbitrarily decide to do so on their behalf.
>
>
> > On Apr 16, 2015, at 7:42 PM, David Singleton <
> david.b.single...@gmail.com> wrote:
> >
> >
> > Our site effectively runs all slurm jobs with sbatch --export=NONE ...
> and creates the necessary environment inside the batch script.  After
> upgading to 14.11,  OpenMPI mpirun jobs hit
> >
> > 2015-04-15T08:53:54+08:00 nod0138 slurmstepd[3122]: error: execve():
> orted: No such file or directory
> >
> > The issue appears to be that, as of 14.11, srun now recognizes
> --export=NONE and, more importantly, the SLURM_EXPORT_ENV=NONE set in the
> jobs environment if you submit with sbatch --export=NONE .   The simple
> workaround is to unset SLURM_EXPORT_ENV before mpirun.  Possibly mpirun
> should add --export=ALL to its srun commands.
> >
> > Cheers
> > David
> >
> > _______________________________________________
> > devel mailing list
> > de...@open-mpi.org
> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> > Link to this post:
> http://www.open-mpi.org/community/lists/devel/2015/04/17236.php
>
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post:
> http://www.open-mpi.org/community/lists/devel/2015/04/17252.php




-- 
Paul H. Hargrove                          phhargr...@lbl.gov
Computer Languages & Systems Software (CLaSS) Group
Computer Science Department               Tel: +1-510-495-2352
Lawrence Berkeley National Laboratory     Fax: +1-510-486-6900

Reply via email to