Hi,

We've been using jobacct_gather/cgroup for quite some time and haven't had any 
issues (I think). We do see some lengthy job cleanup times when there are lots 
of small jobs completing at once, maybe that is due to the cgroup plugin. At 
SLUG19 a slurm dev presented information that the jobacct_gather/cgroup plugin 
has quite the performance hit and that jobacct_gather/linux should be set 
instead. 

Can someone help me with the difference between these two gather plugins? If 
one were to switch to jobacct_gather/linux, what are the cons? Do you lose some 
job resource usage information?

Checking out the docs again on schedmd site regarding the jobacct_gather 
plugins I see:

cgroup — Gathers information from Linux cgroup infrastructure and adds this 
information to the standard rusage information also gathered for each job. 
(Experimental, not to be used in production.)

I don't believe I saw that before: "Experimental" ! Hah.

Thanks!

Best,
Chris
 
-- 
Christopher Coffey
High-Performance Computing
Northern Arizona University
928-523-1167
 
 

Reply via email to