[slurm-users] SLURM , maximum scalable instance is which one

2023-10-29 Thread John Joseph
Dear All, 
Like to know that what is the maximum scalled up instance of SLURM so far.  
From which web site I can get the information of the highest scalable instance 
of SLURM and other popular setup using SLURM 
Thanks 
Joseph John 


[slurm-users] Selecting only a subset of GPU's from all available GPU's

2023-10-29 Thread Minulakshmi S
I'm submitting jobs to a cluster via SLURM scheduler, and let's say I have
access to 8 GPUs in my cluster in same node. They are GPUs of type
A,B,C,D,E,F,G,H. I would like to submit a job that requests the use of GPUs
of type A or B or C but NOT of type D/E/F/G/H. So I need some type of OR
logic with the --gres flag.





Eg : When I request GPU of type A , I can do sbatch –gres=gpu:TypeA:1,  I
need to input a subset of GPU’s and let slurm schedule job utilizing one of
the GPU from this allowed list.



*Regards*

*Minulakshmi S*


[slurm-users] Couldn't find the specified plugin name for auth/munge looking at all files

2023-10-29 Thread C
Hello,

I need to use SLURM for a project. I installed it by this quick start guide
( https://ibmimaster.cs.uni-tuebingen.de/quickstart_admin.html ). First I
just want to run it on one cluster.

- I did steps 1 to 7, create the slurm user with my slurm binaries as home
dir
- created the necessary dirs for log and spool files
- created slurm.conf.with the easy configurator
- munge was also installed via apt packet manager.(using Ubuntu 20)

I do not know what step 8 exactly mean and what the library_location is, so
I skipped this one.

When running slurmd or slurmctld, I get the following error:
root@me:# slurmd
slurmd: error: Couldn't find the specified plugin name for auth/munge
looking at all files
slurmd: error: cannot find auth plugin for auth/munge
slurmd: error: cannot create auth context for auth/munge
slurmd: fatal: failed to initialize auth plugin

root@me:# slurmctld
slurmctld: error: Couldn't find the specified plugin name for auth/munge
looking at all files
slurmctld: error: cannot find auth plugin for auth/munge
slurmctld: error: cannot create auth context for auth/munge
slurmctld: fatal: failed to initialize auth plugin

Any idea how I could fix this? I'd be very thankful.

Kind regards


[slurm-users] Sinfo options not working in SLURM 23.11

2023-10-29 Thread Deepak J
Hello ,



I am working on SLURM 23.11 version.

sinfo  option commands are not working properly  (-a , --all , -o , -m etc)



e.g : sinfo is giving me below


45637@inv456748703$sinfo


PARTITION AVAILTIMELIMIT   NODES  STATE NODELIST




FPGA*up  infinite 1   idle
 FPGA01



also sinfo --help gives same result


45637@inv456748703$ sinfo ---help


PARTITION  AVAIL  TIMELIMITNODES  STATE NODELIST




FPGA* up  infinite   1  idle
 FPGA01



Any pointers will help.


Regards,

DJ


Re: [slurm-users] Sinfo options not working in SLURM 23.11

2023-10-29 Thread Loris Bennett
Hello Deepak,

Deepak J  writes:

> Hello ,
>
>  
>
> I am working on SLURM 23.11 version.
>
> sinfo  option commands are not working properly  (-a , --all , -o , -m etc)
>
>  
>
> e.g : sinfo is giving me below 
>
> 45637@inv456748703$sinfo  
>   
>   
>
> PARTITION AVAILTIMELIMIT   NODES  STATE NODELIST  
>   
>   
>   
>
> FPGA*up  infinite 1   idle 
> FPGA01
>
> also sinfo --help gives same result
>
> 45637@inv456748703$ sinfo ---help
>
> PARTITION  AVAIL  TIMELIMITNODES  STATE NODELIST  
>   
>   
> 
>  
>
> FPGA* up  infinite   1  idle  
>FPGA01 
>
> Any pointers will help.

Why do you think that the output above is wrong?

Cheers,

Loris

> Regards,
>
> DJ
>
-- 
Dr. Loris Bennett (Herr/Mr)
ZEDAT, Freie Universität Berlin