Hi Chris, I'm interested in SLURM / OpenMPI startup numbers, but I haven't done this testing myself. We're stuck with an older version of SLURM for various internal reasons, and I'm wondering whether it's worth the effort to back port the PMI2 support. Can you share some of the differences in times at different scales? Thanks, -Adam ________________________________________ From: devel [devel-boun...@open-mpi.org] on behalf of Christopher Samuel [sam...@unimelb.edu.au] Sent: Tuesday, May 06, 2014 8:32 PM To: de...@open-mpi.org Subject: Re: [OMPI devel] RFC: Force Slurm to use PMI-1 unless PMI-2 is specifically requested
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On 07/05/14 12:53, Ralph Castain wrote: > We have been seeing a lot of problems with the Slurm PMI-2 support > (not in OMPI - it's the code in Slurm that is having problems). At > this time, I'm unaware of any advantage in using PMI-2 over PMI-1 > in Slurm - the scaling is equally poor, and PMI-2 does not supports > any additional functionality. > > I know that Cray PMI-2 has a definite advantage, so I'm proposing > that we turn PMI-2 "off" when under Slurm unless the user > specifically requests we use it. Our local testing has shown that PMI-2 in 1.7.x gives a massive improvement in scaling when starting jobs with srun over using srun with OMPI 1.6.x and now that OMPI 1.8.x is out we're planning on moving to using PMI2 with OMPI and srun. Using mpirun gives good performance with OMPI 1.6.x but Slurm then gets all its memory stats wrong and if you run with CR_Core_Memory in Slurm you have a very high risk your job will get killed incorrectly. All the best, Chris - -- Christopher Samuel Senior Systems Administrator VLSCI - Victorian Life Sciences Computation Initiative Email: sam...@unimelb.edu.au Phone: +61 (0)3 903 55545 http://www.vlsci.org.au/ http://twitter.com/vlsci -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.14 (GNU/Linux) Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/ iEYEARECAAYFAlNpqUwACgkQO2KABBYQAh/igwCfQSB/v3tI37Rq4z5z/0xT/BYU 6ToAn3Qt6tOt46LQD25eHhlx+3z/sjnQ =LEHf -----END PGP SIGNATURE----- _______________________________________________ devel mailing list de...@open-mpi.org Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel Link to this post: http://www.open-mpi.org/community/lists/devel/2014/05/14691.php