I cant run cuda job with slurm. I run the command slurmd -Dcvv and I get the next errors.
slurmd -Dcvvv slurmd: debug2: hwloc_topology_init slurmd: debug2: hwloc_topology_load slurmd: debug: CPUs:4 Boards:1 Sockets:2 CoresPerSocket:2 ThreadsPerCore:1 slurmd: error: You are using cons_res or gang scheduling with Fastschedule=0 and node configuration differs from hardware. The node configuration used will be what is in the slurm.conf because of the bitmaps the slurmctld must create before the slurmd registers. CPUs=1:4(hw) Boards=1:1(hw) SocketsPerBoard=1:2(hw) CoresPerSocket=1:2(hw) ThreadsPerCore=1:1(hw) slurmd: Message aggregation disabled slurmd: debug: init: Gres GPU plugin loaded slurmd: error: can't stat gres.conf file /etc/slurm/gres.conf, assuming zero resource counts slurmd: topology NONE plugin loaded slurmd: route default plugin loaded slurmd: CPU frequency setting not configured for this node slurmd: debug: Resource spec: No specialized cores configured by default on this node slurmd: Resource spec: Reserved system memory limit not configured for this node slurmd: debug: task affinity plugin loaded slurmd: debug: auth plugin for Munge (http://code.google.com/p/munge/) loaded slurmd: debug: spank: opening plugin stack /etc/slurm/plugstack.conf slurmd: Munge cryptographic signature plugin loaded slurmd: Warning: Core limit is only 0 KB slurmd: slurmd version 15.08.2 started slurmd: debug: Job accounting gather LINUX plugin loaded slurmd: debug: job_container none plugin loaded slurmd: debug: switch NONE plugin loaded slurmd: slurmd started on Thu, 05 Nov 2015 14:14:02 -0500 slurmd: CPUs=1 Boards=1 Sockets=1 Cores=1 Threads=1 Memory=3015 TmpDisk=39803 Uptime=19733 CPUSpecList=(null) Any idea? best regards
