I am trying to change the JobName of a running task, like this
scontrol update StepId=24236.0 JobName='my new name'
and getting errors in the log file like this:
Update of this parameter is not supported: StepId=24236.0
I get the same error when I try:
scontrol update
Best,
Feng
On Wed, Sep 20, 2023 at 7:29 AM Wagner, Marcus
wrote:
> Even after rebooting, sometimes nodes are stuck because of "completing
> jobs".
>
> What helps then is to set the node down and resume it afterwards:
>
> scontrol update nodename= state=drain reason=stuck; scontrol
> update
Even after rebooting, sometimes nodes are stuck because of "completing
jobs".
What helps then is to set the node down and resume it afterwards:
scontrol update nodename= state=drain reason=stuck; scontrol
update nodename= state=resume
Best
Marcus
Am 20.09.2023 um 09:11 schrieb Ole Holm
On 9/20/23 01:39, Feng Zhang wrote:
Restarting the slurmd dameon of the compute node should work, if the
node is still online and normal.
Probably not. If the filesystem used by the job is hung, the node must
probably be rebooted, and the filesystem must be checked.
/Ole
On Tue, Sep 19,