From the NVIDIA docs re: MPS:
On systems with a mix of Volta / pre-Volta GPUs, if the MPS server is set to enumerate any Volta GPU, it will discard all pre-Volta GPUs. In other words, the MPS server will either operate only on the Volta GPUs and expose Volta capabilities, or operate only on pre-Volta GPUs. I'd be curious what happens if you change the ordering (RTX then V100) in the gres.conf -- would the RTX work with MPS and the V100 would not? > On Nov 13, 2020, at 07:23 , Holger Badorreck <h.badorr...@lzh.de> wrote: > > Hello, > > I have a heterogeneous GPU Node with one V100 and two RTX cards. When I > request resources with --gres=mps:100, always the V100 is chosen, and jobs > are waiting if the V100 is completely allocated, while RTX cards are free. If > I use --gres=gpu:1, also the RTX cards are used. Is something wrong with the > configuration or is it another problem? > > The node configuration in slurm.conf: > NodeName=node1 CPUs=48 RealMemory=128530 Sockets=1 CoresPerSocket=24 > ThreadsPerCore=2 Gres=gpu:v100:1,gpu:rtx:2,mps:600 State=UNKNOWN > > gres.conf: > Name=gpu Type=v100 File=/dev/nvidia0 > Name=gpu Type=rtx File=/dev/nvidia1 > Name=gpu Type=rtx File=/dev/nvidia2 > Name=mps Count=200 File=/dev/nvidia0 > Name=mps Count=200 File=/dev/nvidia1 > Name=mps Count=200 File=/dev/nvidia2 > > Best regards, > Holger