Re: [slurm-users] Slurm 19.05 X11-forwarding

2020-02-28 Thread Pär Lundö
Hi all, First off, thank you all for all of your quick replies and suggestions on how to solve this problem of mine. With some additional help from Tina Friedrich I did a test proposed by her: First step: ”ssh -X ”. Second step: ”ssh -X localhost”. Third step: ”srun —x11 ” (It should read a

[slurm-users] Hybrid compiling options

2020-02-28 Thread Brian Andrus
All, Wanted to reach out for input on how folks compile slurm when you have a hybrid cluster. Scenario: you have 4 node types: A) CPU only B) GPU Only C) CPU+IB D) GPU+IB So, you can compile slurm with/without IB support and/or with/without GPU support. Including either option creates a

Re: [slurm-users] Nodelist dependent environment setup ?

2020-02-28 Thread Sajid Ali
Hi Ole, Thanks a lot for sharing the resource! Our biggest concern is the case where a user asks for 2 nodes and one of those is a Cascade Lake node and the other one is a Haswell node. At this point, the environment have modules that work on both, hence my preference for reading the slurm

[slurm-users] Nodelist dependent environment setup ?

2020-02-28 Thread Sajid Ali
Hi Slurm-developers/users, If I install multiple versions of a software library, each optimized for a different partition based on CPU architecture, how would I automatically load the version of software based on the nodes allocated to the job ? Ideally I'd want to store the modules for each cpu

[slurm-users] Question about determining pre-empted jobs

2020-02-28 Thread Jeffrey R. Lang
I need your help. We have had a request to generate a report showing the number of jobs by date showing pre-empted jobs. We used sacct to try to gather the data but we only found a few jobs with the state "PREEMPTED". Scanning the slurmd logs we find there are a lot of job that show

Re: [slurm-users] Slurm 19.05 X11-forwarding

2020-02-28 Thread Pär Lundö
Hi Marcus, You are correct, but it doesnt show anything regarding the X11-forwarding. Thank you for your input! Best regards, Pär Lundö From: "slurm-users" Sent: 28 feb. 2020 15:57 To: "slurm-users@lists.schedmd.com" Subject: Re: [slurm-users] Slurm 19.05

Re: [slurm-users] How to show state of CLOUD nodes

2020-02-28 Thread Carter, Allan
Thanks, that was very useful. The key takeaways for me are: Set “PrivateData=cloud”. Documentation states that the default is that everything’s public and that those options make things private. Apparently except for this case which allows regular uses to see nodes that are powered down. Set

Re: [slurm-users] Slurm 19.05 X11-forwarding

2020-02-28 Thread Marcus Wagner
Hi Pär, yes , you can -v or e.g. -vvv together with srun. I'm not sure, though, if taht shows anything X11-related, but you might try. Best Marcus On 2/28/20 3:45 PM, Pär Lundö wrote: Hi everyone Thank you for your support. I’ve done a few changes and done some further testing but it has

Re: [slurm-users] Slurm 19.05 X11-forwarding

2020-02-28 Thread Pär Lundö
Hi everyone Thank you for your support. I’ve done a few changes and done some further testing but it has not solved my problem. Regardless of the settings for the sshd I can’t get it to I am able to use SSH to the node directly along with ”-X”-argument and get the X11 forwarding to work. Is

[slurm-users] slurm 18.08.3 on CentOS 6.18: error: _slurm_cgroup_destroy

2020-02-28 Thread AMU
Hello, on an old machine CentOS 6.10, i've installed slurm 18.08.3 from sources, and tried to configure a simple configuration (attached slurm.conf). Afterstarting slurmctld et slurmd, sinfo shows everything oaky, but at the first submission with sbatch, i got errors and the node becomes

Re: [slurm-users] Problem with configuration CPU/GPU partitions

2020-02-28 Thread Renfro, Michael
When I made similar queues, and only wanted my GPU jobs to use up to 8 cores per GPU, I set Cores=0-7 and 8-15 for each of the two GPU devices in gres.conf. Have you tried reducing those values to Cores=0 and Cores=20? > On Feb 27, 2020, at 9:51 PM, Pavel Vashchenkov wrote: > > External Email

Re: [slurm-users] How to show state of CLOUD nodes

2020-02-28 Thread Kirill 'kkm' Katsnelson
I'm running clusters entirely in Google Cloud. I'm not sure I'm understanding the issue--do the nodes disappear from view entirely only when they fail to power up by ResumeTimeout? Failures of this kind are happening in GCE when resources are momentarily unavailable, but the nodes are still there,

Re: [slurm-users] Question about SacctMgr....

2020-02-28 Thread Bjørn-Helge Mevik
Ole Holm Nielsen writes: > You may use the (undocumented) format=... option to select only the A while ago, after meticulous study of the man page, I discovered that the format option is not actually undocumented, it is just very well hidden. :) All that "man sacctmgr" says about it is GLOBAL

Re: [slurm-users] Question about SacctMgr....

2020-02-28 Thread Ole Holm Nielsen
sacctmgr show association You may use the (undocumented) format=... option to select only the columns you want, for example: sacctmgr show assoc format=user,account,qos Usage of the format option is only given in the Examples section of the sacctmgr page

Re: [slurm-users] Question about SacctMgr....

2020-02-28 Thread Marcus Boden
Hi, your looking for 'associations' between users, accounts and their limits. Try `sacctmgr show assoc [tree]` Best, Marcus On 20-02-28 09:38, Matthias Krawutschke wrote: > Dear Slurm-User, > > > > I have a simple question about User and Account – Management on SLURM. > > > > How can I

[slurm-users] Question about SacctMgr....

2020-02-28 Thread Matthias Krawutschke
Dear Slurm-User, I have a simple question about User and Account – Management on SLURM. How can I find /print out, which User is associated with which account? I can list accounts and User, but not in combination. I had no found this on the documentation. Best regards….