Hi Rob,
Thank you for helping me with this!
I've rebuilt our environment in slurm 22.05.9 on a VM and verified that
slurm would prohibit a job run on a unpermitted partition
JOBID PARTITION NAME USER ST TIME NODES
NODELIST(REASON)
1 test test
Wow, snazzy!
Looks very good. My compliments.
Brian Andrus
On 3/12/2024 11:24 AM, Victoria Hobson via slurm-users wrote:
Our website has gone through some much needed change and we'd love for
you to explore it!
The new SchedMD.com is equipped with the latest information about
Slurm, your favo
Just wanted to share some slurm utilities that we've written at Harvard
FASRC that maybe useful to the community.
seff-account: https://github.com/fasrc/seff-account Creates job
statistics summaries for users and accounts similar to what seff and
seff-array does.
showq: https://github.com/f
I really struggle to see the point of k8s for large computational workloads.
It adds a lot of complexity, and I don’t see what benefit it brings.
If you really want to run containerised workloads as batch jobs on AWS, for
example, then it’s a great deal simpler to do so using AWS Batch and ECS
Hello,
I haven't played with slurm in k8s but I did attend this talk :
https://fosdem.org/2024/schedule/event/fosdem-2024-2590-kubernetes-and-hpc-bare-metal-bros/
Which shows at least someone was able to do so and maybe it'll be worth
to talk to her about it. I wanted to ask her for the cod
Hi Alan,
Your topic is indeed my PhD thesis (defended late november). It consists
in building autoscaling HPC infrastructure in the cloud (in a compute
node provisioning point of view). In this work I show that kubernetes
default controllers are not well designed for autoscaling containerized
I'm a little late to this party but would love to establish contact with others
using slurm in Kubernetes.
I recently joined a research institute in Vienna (IIASA) and I'm getting to
grips with slurm and Kubernetes (my previous role was data engineering /
fintech). My current setup sounds like