Hi Monica,
Am 01.10.2014 21:55, schrieb Monica Marathe: > Hey, > > It's my first time using SLURM and I'm getting the following error when > I run slurmctld: > > [root@localhost ~]# slurmctld -D -vvvvvv > slurmctld: debug2: No last_config_lite file (/tmp/last_config_lite) to > recover > slurmctld: debug4: unable to create link for /tmp/last_config_lite -> > /tmp/last_config_lite.old: No such file or directory I don't know what that means but this message is probably the least to be concerned about. I think it will disappear once SLURM was started successfully for the first time. > slurmctld: error: Configured MailProg is invalid Make sure that either /bin/mail exists or configure the parameter MailProg to the mail program of your choice. > slurmctld: slurmctld version 14.03.7 started on cluster cluster > slurmctld: debug3: Trying to load plugin > /usr/local/lib/slurm/crypto_munge.so > slurmctld: Munge cryptographic signature plugin loaded > slurmctld: debug3: Success. > slurmctld: debug3: Trying to load plugin > /usr/local/lib/slurm/select_linear.so > slurmctld: debug3: Success. > slurmctld: debug3: Trying to load plugin > /usr/local/lib/slurm/preempt_none.so > slurmctld: preempt/none loaded > slurmctld: debug3: Success. > slurmctld: debug3: Trying to load plugin > /usr/local/lib/slurm/checkpoint_none.so > slurmctld: debug3: Success. > slurmctld: Checkpoint plugin loaded: checkpoint/none > slurmctld: debug3: Trying to load plugin > /usr/local/lib/slurm/acct_gather_energy_none.so > slurmctld: AcctGatherEnergy NONE plugin loaded > slurmctld: debug3: Success. > slurmctld: debug3: Trying to load plugin > /usr/local/lib/slurm/acct_gather_profile_none.so > slurmctld: AcctGatherProfile NONE plugin loaded > slurmctld: debug3: Success. > slurmctld: debug3: Trying to load plugin > /usr/local/lib/slurm/acct_gather_infiniband_none.so > slurmctld: AcctGatherInfiniband NONE plugin loaded > slurmctld: debug3: Success. > slurmctld: debug3: Trying to load plugin > /usr/local/lib/slurm/acct_gather_filesystem_none.so > slurmctld: AcctGatherFilesystem NONE plugin loaded > slurmctld: debug3: Success. > slurmctld: debug2: No acct_gather.conf file > (/usr/local/etc/acct_gather.conf) If you want to use one of the acct_gather plugins, you have to provide a configuration file for it. See http://slurm.schedmd.com/acct_gather.conf.html > slurmctld: debug3: Trying to load plugin > /usr/local/lib/slurm/jobacct_gather_none.so > slurmctld: Job accounting gather NOT_INVOKED plugin loaded > slurmctld: debug3: Success. > slurmctld: debug3: Trying to load plugin > /usr/local/lib/slurm/ext_sensors_none.so > slurmctld: ExtSensors NONE plugin loaded > slurmctld: debug3: Success. > slurmctld: debug3: Trying to load plugin /usr/local/lib/slurm/switch_none.so > slurmctld: switch NONE plugin loaded > slurmctld: debug3: Success. > slurmctld: error: this host (localhost) not valid controller > (localhost.localdomain or (null)) > No not use "localhost" for the parameters BackupController, BackupAddr, ControlAddr and ControlMachine but the real machine's name. And make sure you can lookup that name (either DNS or entry in /etc/hosts). Also, have a look at http://slurm.schedmd.com/slurm.conf.html for parameters used in slurm.conf. If you would like to start over with a new configuration file you can use the simple or full version of the config file generator: http://slurm.schedmd.com/configurator.easy.html http://slurm.schedmd.com/configurator.html Regards, Uwe > How can I fix these issues? > > Regards, > Monica > > -- > - Monica Marathe
