What's the nsswitch like on the node? from the node, can you do: # getent passwd | grep <ldap user>
On 01/05/2016 08:31 PM, Koji Tanaka wrote:
Hello Slurm Community, I get following errors when I run a job as a LDAP user. However, as a local user, everything works fine. $ srun -N1 hostname srun: error: Task launch for 23.3 failed on node e1c001: User not found on host srun: error: Application launch failed: User not found on host srun: Job step aborted: Waiting up to 32 seconds for job step to finish. e1c001 And, there's following logs on slurmctld.log. [2016-01-05T12:45:41.182] job_step_signal step 23.2 not found [2016-01-05T12:45:41.198] step_partial_comp: StepID=23.2 invalid What could cause this error? I would appreciate any comments and suggestions. Thank you, Koji