[slurm-dev] SlurmUser in slurm.conf

2014-06-20 Thread Sanjay Tiwari (stiwari)
Hello,

I have a dedicated user slurm created for the installation purpose.

I am using munge for the authtype. Bothe slurm user and munge users work well 
together.

My cluster is operational with these users. I can monitor control and run jobs 
on the entire cluster.

At the same time any other user is not able to schedule any job, I see the 
following error:

srun: error: Task launch for 26.0 failed on node bolsvc01: Invalid job 
credential
srun: error: Application launch failed: Invalid job credential
srun: Job step aborted: Waiting up to 2 seconds for job step to finish.
srun: error: Timed out waiting for job step to complete

Any help and direction would help me move past this issue.

Cheers
Sanjay



[slurm-dev] LEVEL_BASED prioritization method

2014-06-20 Thread Ryan Cox


Levi Morrison and I have developed a new Slurm prioritization method 
that we call LEVEL_BASED.  It prioritizes users such that users in an 
under-served account will always have a higher fair share factor than 
users in an over-served account.


It works very well for us, though I understand that many sites have 
different needs.  If you're interested, check out the documentation at 
https://fsl.byu.edu/documentation/slurm/level_based.php or try it out at 
https://github.com/BYUHPC/slurm in the level_based branch.


If you want some of our problems with existing algorithms (as they apply 
to our use case), see 
http://tech.ryancox.net/2014/06/problems-with-slurm-prioritization.html.


--
Ryan Cox
Operations Director
Fulton Supercomputing Lab
Brigham Young University