[slurm-dev] SlurmUser in slurm.conf
Hello, I have a dedicated user slurm created for the installation purpose. I am using munge for the authtype. Bothe slurm user and munge users work well together. My cluster is operational with these users. I can monitor control and run jobs on the entire cluster. At the same time any other user is not able to schedule any job, I see the following error: srun: error: Task launch for 26.0 failed on node bolsvc01: Invalid job credential srun: error: Application launch failed: Invalid job credential srun: Job step aborted: Waiting up to 2 seconds for job step to finish. srun: error: Timed out waiting for job step to complete Any help and direction would help me move past this issue. Cheers Sanjay
[slurm-dev] LEVEL_BASED prioritization method
Levi Morrison and I have developed a new Slurm prioritization method that we call LEVEL_BASED. It prioritizes users such that users in an under-served account will always have a higher fair share factor than users in an over-served account. It works very well for us, though I understand that many sites have different needs. If you're interested, check out the documentation at https://fsl.byu.edu/documentation/slurm/level_based.php or try it out at https://github.com/BYUHPC/slurm in the level_based branch. If you want some of our problems with existing algorithms (as they apply to our use case), see http://tech.ryancox.net/2014/06/problems-with-slurm-prioritization.html. -- Ryan Cox Operations Director Fulton Supercomputing Lab Brigham Young University