Hi Jordi,

This looks reasonable to start with indeed.

Two more than things that came to mind:

* You'll need to distinguish clearly between --resource-manager* and --job*, 
although that is mostly an issue for the documentation, I guess

* The launcher/bootstrap values also depend on which MPI library is being used, 
do they not?


K.

> On 12 Mar 2017, at 07:36, Jordi Blasco <[email protected]> wrote:
> 
> Hi Kenneth,
> 
> I don't have the complete picture yet. Most of our clients are using Slurm, 
> torque and Grid Engine. So, we can contribute with those implementations.
> 
> At this point we have seen a need for the following parameters:
> resource-manager=<torque|slurm|sge|lsf>
> resource-manager-prefix=/usr/local/torque/6.1.0
> resource-manager-etc=/etc/slurm
> resource-manager-launcher=<srun|mpirun|mpiexec.hydra>
> resource-manager-bootstrap=<ssh|rsh|pdsh|pbsdsh|fork|persist|slurm|ll|lsf|sge>
> This approach could also be extended by including the fabric technology (and 
> the OFED distribution if required).
> 
> I'll develop a proof of concept.
> 
> Thanks!!!!
> 
> Jordi 
> 
> 
>> On 9 March 2017 at 22:15, Åke Sandgren <[email protected]> wrote:
>> +1
>> 
>> One would of course also need the install path of the resource manager
>> and possibly version to use.
>> Preferred mpi starter command and arguments to use, like -n vs -np for
>> srun/mpirun. And possibly a bunch of other things.
>> 
>> On 03/09/2017 10:09 AM, Kenneth Hoste wrote:
>> > Can you elaborate a bit on what you have in mind now exactly?
>> >
>> > What kind of information would need to be provided to the EasyBuild
>> > framework, what kind of information do you need in easyblocks?
>> >
>> > Maybe we should implement some kind of auto-detect mechanism in
>> > framework, where the necessary information is collected automatically
>> > based on which workload manager is used (so, you would only need to
>> > configure EasyBuild with --use-resource-manager=Torque for example), but
>> > also provide a way to override those values in case the auto-detection
>> > gets them wrong?
>> 
>> --
>> Ake Sandgren, HPC2N, Umea University, S-90187 Umea, Sweden
>> Internet: [email protected]   Phone: +46 90 7866134 Fax: +46 90-580 14
>> Mobile: +46 70 7716134 WWW: http://www.hpc2n.umu.se
> 

Reply via email to