Hi: please mention the below output.

cat /etc/redhat-release

OR

cat /etc/lsb_release

Also, please let us know the detailed log reports that is probably available at /var/log/slurm/slurmctld.log

status of:
ps -ef | grep slurmctld

Thanks & Regards,
Sudeep Narayan Banerjee
System Analyst | Scientist B
Information System Technology Facility
Academic Block 5 | Room 110
Indian Institute of Technology Gandhinagar
Palaj, Gujarat 382355 INDIA

On 11/06/20 5:54 pm, navin srivastava wrote:
Hi Team,

when i am trying to start the slurmd process i am getting the below error.

2020-06-11T13:11:58.652711+02:00 oled3 systemd[1]: Starting Slurm node daemon... 2020-06-11T13:13:28.683840+02:00 oled3 systemd[1]: slurmd.service: Start operation timed out. Terminating. 2020-06-11T13:13:28.684479+02:00 oled3 systemd[1]: Failed to start Slurm node daemon. 2020-06-11T13:13:28.684759+02:00 oled3 systemd[1]: slurmd.service: Unit entered failed state. 2020-06-11T13:13:28.684917+02:00 oled3 systemd[1]: slurmd.service: Failed with result 'timeout'. 2020-06-11T13:15:01.437172+02:00 oled3 cron[8094]: pam_unix(crond:session): session opened for user root by (uid=0)

Slurm version is 17.11.8

The server and slurm is runningĀ from long time and we have not made any changesĀ but today when i am starting it is giving this error message.
Any idea what could be wrong here.

Regards
Navin.




Reply via email to