I can’t run cuda job with slurm. I run the command slurmd -Dcvv and I get
the next errors.

 

slurmd -Dcvvv

slurmd: debug2: hwloc_topology_init

slurmd: debug2: hwloc_topology_load

slurmd: debug:  CPUs:4 Boards:1 Sockets:2 CoresPerSocket:2 ThreadsPerCore:1

slurmd: error: You are using cons_res or gang scheduling with Fastschedule=0
and node configuration differs from hardware.  The node configuration used
will be what is in the slurm.conf because of the bitmaps the slurmctld must
create before the slurmd registers.

   CPUs=1:4(hw) Boards=1:1(hw) SocketsPerBoard=1:2(hw)
CoresPerSocket=1:2(hw) ThreadsPerCore=1:1(hw)

slurmd: Message aggregation disabled

slurmd: debug:  init: Gres GPU plugin loaded

slurmd: error: can't stat gres.conf file /etc/slurm/gres.conf, assuming zero
resource counts

slurmd: topology NONE plugin loaded

slurmd: route default plugin loaded

slurmd: CPU frequency setting not configured for this node

slurmd: debug:  Resource spec: No specialized cores configured by default on
this node

slurmd: Resource spec: Reserved system memory limit not configured for this
node

slurmd: debug:  task affinity plugin loaded

slurmd: debug:  auth plugin for Munge (http://code.google.com/p/munge/)
loaded

slurmd: debug:  spank: opening plugin stack /etc/slurm/plugstack.conf

slurmd: Munge cryptographic signature plugin loaded

slurmd: Warning: Core limit is only 0 KB

slurmd: slurmd version 15.08.2 started

slurmd: debug:  Job accounting gather LINUX plugin loaded

slurmd: debug:  job_container none plugin loaded

slurmd: debug:  switch NONE plugin loaded

slurmd: slurmd started on Thu, 05 Nov 2015 14:14:02 -0500

slurmd: CPUs=1 Boards=1 Sockets=1 Cores=1 Threads=1 Memory=3015
TmpDisk=39803 Uptime=19733 CPUSpecList=(null)

 

Any idea?

best regards

Reply via email to