Slurm version 14.11.0 is now available. This is a major Slurm release
with many new features. See the RELEASE_NOTES and NEWS files in the
distribution for detailed descriptions of the changes, a few of which
are noted below.
Upgrading from Slurm versions 2.6 or 14.03 should proceed without loss
of jobs or other state. Just be sure to upgrade the slurmdbd first.
(Upgrades from pre-releases of version 14.11 may result job loss.)
Slurm downloads are available from http://www.schedmd.com/#repos.
Thanks to all those who helped make this release!
Highlights of changes in Slurm version 14.11.0 include:
-- Added job array data structure and removed 64k array size restriction.
-- Added support for reserving CPUs and/or memory on a compute node
for system
use.
-- Added support for allocation of generic resources by model type for
heterogeneous systems (e.g. request a Kepler GPU, a Tesla GPU, or a
GPU of
any type).
-- Added support for non-consumable generic resources that are
limited, but
can be shared between jobs.
-- Added support for automatic job requeue policy based on exit value.
-- Refactor job_submit/lua interface. LUA FUNCTIONS NEED TO CHANGE! The
lua script no longer needs to explicitly load meta-tables, but
information
is available directly using names slurm.reservations, slurm.jobs,
slurm.log_info, etc. Also, the job_submit.lua script is reloaded when
updated without restarting the slurmctld daemon.
-- Eliminate native Cray specific port management. Native Cray systems
must
now use the MpiParams configuration parameter to specify ports to
be used
for communications. When upgrading Native Cray systems from version
14.03,
all running jobs should be killed and the switch_cray_state file (in
SaveStateLocation of the nodes where the slurmctld daemon runs) must be
explicitly deleted.