Re: [slurm-users] Rolling upgrade of compute nodes

2022-05-29 Thread Stephan Roth
Hi Byron, If you have the means to set up a test environment to try the upgrade first, I recommend to do it. The upgrade from 19.05 to 20.11 worked for two clusters I maintain with a similar NFS based setup, except we keep the Slurm configuration separated from the Slurm software accessible

Re: [slurm-users] Rolling upgrade of compute nodes

2022-05-29 Thread Christopher Samuel
On 5/29/22 3:09 pm, byron wrote:  This is the first time I've done an upgrade of slurm and I had been hoping to do a rolling upgrade as opposed to waiting for all the jobs to finish on all the compute nodes and then switching across but I dont see how I can do it with this setup.  Does any on

[slurm-users] Regarding acct_gather.conf

2022-05-29 Thread pravin pawar
Hello all, we are trying to collect the lustre data. following is the configuration of acct_gatger.conf:- cat /etc/slurm/acct_gather.conf ProfileHDF5Dir=/home/slurm-hdf5 ProfileHDF5Default=lustre slurm.conf entries:- cat /etc/slurm/slurm.conf | grep lustre JobAcctGatherFrequency=energy=5,lustr

[slurm-users] Rolling upgrade of compute nodes

2022-05-29 Thread byron
Hi I'm currently doing an upgrade from 19.05 to 20.11. All of our compute nodes have the same install of slurm NFS mounted. The system has been setup so that all the start scripts and configuration files point to the default installation which is a soft link to the most recent installation of sl