[slurm-dev] Re: Tool for automatic generation of Node lists and specs?

2017-05-25 Thread Gennaro Oliva
Hi Mehmet, On Thu, May 25, 2017 at 03:19:08PM -0700, Belgin, Mehmet wrote: > # slurmd -C > NodeName=devel-comp1 CPUs=2 Boards=1 SocketsPerBoard=2 CoresPerSocket=1 > ThreadsPerCore=1 RealMemory=5845 TmpDisk=16260 > > But slurm doesn’t like it when copied directly into the conf: > > # service sl

[slurm-dev] Tool for automatic generation of Node lists and specs?

2017-05-25 Thread Belgin, Mehmet
Greetings! I’ve spent only a few days of experience playing with slurm, so apologies if I’m missing something obvious, but I couldn’t find a good way to query nodes automatically to generate NodeName lines in detail. I saw the "slurm -C” option, which comes *really* close, but some of the field

[slurm-dev] Requested node configuration is not available

2017-05-25 Thread Yong Qin
Hi, We encounter an annoying issue here. Basically within the same partition we have two types of nodes, one with 24 cores and the other one with 28 cores, so we use node feature to distinguish them, savio2_c24 and savio2_c28. Slurm is reporting the proper configuration and feature from all angles

[slurm-dev] Re: How to cleanup mysql db old records?

2017-05-25 Thread Douglas Jacobsen
Regarding the "more allocated time than is possible" messages, I'd suggest checking for runaway jobs: sacctmgr show runawayjobs You might want to look at the records a bit before agreeing to let it fix them automatically. If that doesn't find anything, there might be some nodes incorrectly down

[slurm-dev] How to cleanup mysql db old records?

2017-05-25 Thread Balaji Deivam
Hi, I am trying to cleanup the old records in the mysql DB which is present from Oct 2015. I used below settings in the slurmdbd.conf file, but didn't get cleaned up any records last night. May be due to huge no:of records its not able to purge? How can we handle this situation? We want to keep

[slurm-dev] Re: Compute nodes going to drained/draining state

2017-05-25 Thread Chris Samuel
On Thursday, 25 May 2017 6:51:26 PM AEST Baker D. J. wrote: > Thank you for your response to my email. I've taken a look at one of the > compute nodes that has been drained by the SLURM system -- please see > below. If appears to suggest the node was drained due to a job failing > (running out o

[slurm-dev] Re: Compute nodes going to drained/draining state

2017-05-25 Thread Baker D . J .
Hello, Thank you for your response to my email. I've taken a look at one of the compute nodes that has been drained by the SLURM system -- please see below. If appears to suggest the node was drained due to a job failing (running out of walltime perhaps?). This is very odd since I don't have