This is just a guess, but is your topology information below in a file
topology.conf (NOT part of slurm.conf)?
Quoting Fred Smith <[email protected]>:
I am using slurm latest version 14.03.8 on CentOS 6.2. It's compiled
following the instructions in the installation guide for rpmbuild
and rpm install. The plugin rpm is installed, nevertheless slurm
does not start when using the topology sample from the Topology
Guide (the node names are also configured in the slurm.conf NodeName):
SwitchName=s0 Nodes=node-0-[4-5]
SwitchName=s1 Nodes=node-0-[6-7]
SwitchName=s2 Switches=s[0-1]
slurm is configured with TopologyPlugin=topology/tree .
However the start fails with parsing errors:
scontrol: error: Parsing error at unrecognized key: SwitchName
scontrol: error: Parse error in file /etc/slurm/topology.conf line 1:
"SwitchName=s0 Nodes=hpc-0-[4-5] "
scontrol: error: Parsing error at unrecognized key: SwitchName
scontrol: error: Parse error in file /etc/slurm/topology.conf line 2:
"SwitchName=s1 Nodes=hpc-0-[6-7] "
scontrol: error: Parsing error at unrecognized key: SwitchName
scontrol: error: Parse error in file /etc/slurm/topology.conf line 3:
"SwitchName=s2 Switches=s[0-1]"
scontrol: error: "Include" failed in file /etc/slurm/slurm.conf line 85
scontrol: fatal: Unable to process configuration file
the plugin is installed: topology_tree.so in the default location
and does indeed include
strings : SwitchName LinkSpeed Nodes Switches. The slurm start
script does include the default LIBDIR setting.
Any hint what is missing to get the topology to work?
--
Morris "Moe" Jette
CTO, SchedMD LLC