[slurm-dev] Re: Disabling automatic optimization

2014-08-21 Thread Vsevolod Nikonorov
Hello again, if I run a production cluster with automatic optimization disabled, am I losing some performance? Thanks in advance! Nathan Harper писал 2014-07-29 12:39: Hi, Are you building the RPMs?  I had the same problem (after building and installing the RPMs) a few weeks ago, and it w

[slurm-dev] Re: Intel MPI Performance inconsistency (and workaround)

2014-08-21 Thread Christopher Samuel
On 22/08/14 04:43, Jesse Stroik wrote: > We recently noticed sporadic performance inconsistencies on one of our > clusters. What distro is this? Are you using cgroups? cheers, Chris -- Christopher SamuelSenior Systems Administrator VLSCI - Victorian Life Sciences Computation Initiat

[slurm-dev] Re: Intel MPI Performance inconsistency (and workaround)

2014-08-21 Thread Jesse Stroik
Yes, but we aren't specifying it for all of these jobs. In the config we have: --- TaskPlugin=task/affinity TaskPluginParam=Sched SelectTypeParameters=CR_CPU_Memory,CR_CORE_DEFAULT_DIST_BLOCK --- And we typically suggest "--cpu_bind=core --distribution=block:block" for srun

[slurm-dev] Re: Intel MPI Performance inconsistency (and workaround)

2014-08-21 Thread Kilian Cavalotti
Hi Jesse, Just a shot in the dark, but do you use task affinity or CPU binding? Cheers, -- Kilian

[slurm-dev] Re: Account / partition association on heterogeneous clusters

2014-08-21 Thread Jesse Stroik
We ended up working around our needs by writing a program that provided users with the appropriate settings. It may be something to consider for future releases of slurm to be able to automatically use any available and valid account given a user-partition request, or allow administrators to

[slurm-dev] Intel MPI Performance inconsistency (and workaround)

2014-08-21 Thread Jesse Stroik
Slurmites, We recently noticed sporadic performance inconsistencies on one of our clusters. We discovered that if we restarted slurmd in an interactive shell, we observed correct performance. To track down the cause, we ran: (1) single-node linpack (2) dual node mp_linpack (3) mpptest On a

[slurm-dev] Re: Storing the job submission script in the accounting database

2014-08-21 Thread Marcin Stolarek
W dniu czwartek, 21 sierpnia 2014 Antony Cleave napisał(a): > > Is it possible to store the job submission script and the environment > variables passed to it in the account database or log this data > automatically to /path/to/spylog/.log files in SLURM? > > I'm interested in analysing what th

[slurm-dev] Storing the job submission script in the accounting database

2014-08-21 Thread Antony Cleave
Is it possible to store the job submission script and the environment variables passed to it in the account database or log this data automatically to /path/to/spylog/.log files in SLURM? I'm interested in analysing what the cluster is used for over time and this would be a good start in w

[slurm-dev] Re: Error: Unable to contact slurm controller

2014-08-21 Thread Gerry Creager - NOAA Affiliate
No, slurmctld isn't running. Now. It was when I started, but I suspect I made at least one mod too many to slurm.conf. When I try to start slurmctld, I get these in slurmctld.log: [2014-08-21T09:30:09.626] debug2: No ApbasilTimeout configured (65534) [2014-08-21T09:30:09.630] debug2: No ApbasilTime