Hello again,
if I run a production cluster with automatic optimization disabled, am I
losing some performance?
Thanks in advance!
Nathan Harper писал 2014-07-29 12:39:
Hi,
Are you building the RPMs? I had the same problem (after building
and installing the RPMs) a few weeks ago, and it w
On 22/08/14 04:43, Jesse Stroik wrote:
> We recently noticed sporadic performance inconsistencies on one of our
> clusters.
What distro is this? Are you using cgroups?
cheers,
Chris
--
Christopher SamuelSenior Systems Administrator
VLSCI - Victorian Life Sciences Computation Initiat
Yes, but we aren't specifying it for all of these jobs. In the config we
have:
---
TaskPlugin=task/affinity
TaskPluginParam=Sched
SelectTypeParameters=CR_CPU_Memory,CR_CORE_DEFAULT_DIST_BLOCK
---
And we typically suggest "--cpu_bind=core --distribution=block:block"
for srun
Hi Jesse,
Just a shot in the dark, but do you use task affinity or CPU binding?
Cheers,
--
Kilian
We ended up working around our needs by writing a program that provided
users with the appropriate settings.
It may be something to consider for future releases of slurm to be able
to automatically use any available and valid account given a
user-partition request, or allow administrators to
Slurmites,
We recently noticed sporadic performance inconsistencies on one of our
clusters. We discovered that if we restarted slurmd in an interactive
shell, we observed correct performance.
To track down the cause, we ran:
(1) single-node linpack
(2) dual node mp_linpack
(3) mpptest
On a
W dniu czwartek, 21 sierpnia 2014 Antony Cleave
napisał(a):
>
> Is it possible to store the job submission script and the environment
> variables passed to it in the account database or log this data
> automatically to /path/to/spylog/.log files in SLURM?
>
> I'm interested in analysing what th
Is it possible to store the job submission script and the environment
variables passed to it in the account database or log this data
automatically to /path/to/spylog/.log files in SLURM?
I'm interested in analysing what the cluster is used for over time and
this would be a good start in w
No, slurmctld isn't running. Now. It was when I started, but I suspect I
made at least one mod too many to slurm.conf. When I try to start
slurmctld, I get these in slurmctld.log:
[2014-08-21T09:30:09.626] debug2: No ApbasilTimeout configured (65534)
[2014-08-21T09:30:09.630] debug2: No ApbasilTime