and
Active Directory for 6.5 years without issue. We’ve always added users
to sacctmgr so that we could track usage by research group or class,
so we never used a default account for all users.
*From: *Richard Chang via slurm-users
*Date: *Saturday, February 3, 2024 at 11:41 PM
*To: *slurm-us
Hi,
I am a little new to this, so please pardon my ignorance.
I have configured slurm in my cluster and it works fine with local
users. But I am not able to get it working with LDAP/SSSD authentication.
User logins using ssh are working fine. An LDAP user can login to the
login, slurmctld
data. There are many ways to do that, but those designs fall under
MariaDB and not Slurm.
Brian Andrus
On 11/1/2022 6:49 PM, Richard Chang wrote:
Does it mean it is best to use a single slurmdbd host in my case?
My primary slurmctld is the backup slurmdbd host, and my worry is if
the primary slur
[1] https://slurm.schedmd.com/publications.html
On 10/31/2022 9:20 PM, Richard Chang wrote:
Hi,
Just for my info, I would like to know what happens when SlurmDBD
loses connection to the backend Database, for ex, MariaDB.
Does it cache the accounting info and keep them till the DB comes
back up ?, or doe
Hello Greg,
I have a two node set up. node1 is primary slurmctld + backup slurmdbd
and node2 is primary slurmdbd + backup slurmctld and mysql database host.
My concern is if node 2 goes down, then the backup slurmdbd will take
over, then what will happen ?
I have read that slurmctld can
Hi,
Just for my info, I would like to know what happens when SlurmDBD loses
connection to the backend Database, for ex, MariaDB.
Does it cache the accounting info and keep them till the DB comes back
up ?, or does it panic and shut down ?
Thank you,
RC.
This is 21.08
Than you,
RC
On 10/31/2022 11:05 AM, Chris Samuel wrote:
On 27/10/22 11:30 pm, Richard Chang wrote:
Yes, the system is a HPE Cray EX, and I am trying to use
switch/hpe_slingshot.
Which version of Slurm are you using Richard?
All the best,
Chris
Hi,
I have two dedicated nodes for slurm, node1 and node2.
I have created the following.
*Role*
*SlurmCTLD*
*SlurmDBD*
*Mariadb Server for accounting storage*
*Primary*
Node1
Node2
Node2
*Backup*
Node2
Yes, the system is a HPE Cray EX, and I am trying to use
switch/hpe_slingshot.
RC
On 10/28/2022 11:21 AM, Ole Holm Nielsen wrote:
On 10/28/22 07:35, Richard Chang wrote:
I have observed that when I specify a switch type in the slurm.conf
file and that particular switch type is not present
Hi,
I have observed that when I specify a switch type in the slurm.conf file
and that particular switch type is not present in the slurmctld node,
slurmctld panics and shuts down. Is this expected ? My slurmctld doesn't
have the switch type, but the computes have that switch type. how can I
Hi,
Is there a thumb rule for the size of the directory that is NFS
exported, and to be used as StateSaveLocation.
I have a two node Slurmctld setup and both will mount an NFS exported
directory as the state save location.
Let me know your thoughts.
Thanks & regards,
RC
not.
-Paul Edmon-
On 8/2/2022 9:30 AM, Richard Chang wrote:
Hi,
I am new to SLURM, so please bear with me.
I need to understand whether the Server/Node running the slurmctld
daemon will need access to the Parallel file system, and if it will
need all the SW run time libraries installed
Hi,
I am new to SLURM, so please bear with me.
I need to understand whether the Server/Node running the slurmctld
daemon will need access to the Parallel file system, and if it will need
all the SW run time libraries installed, as in the compute nodes.
The users will login to the
Hi,
I am new to SLURM and I am still trying to understand stuff. There is
ample documentation available that teaches you how to set it up quickly.
Pardon me if this was asked before, I was not able to find anything
pointing to this.
I am trying to figure out if there is something like
14 matches
Mail list logo