[slurm-users] Re: SLURM configuration for LDAP users

2024-02-05 Thread Richard Chang via slurm-users
and Active Directory for 6.5 years without issue. We’ve always added users to sacctmgr so that we could track usage by research group or class, so we never used a default account for all users. *From: *Richard Chang via slurm-users *Date: *Saturday, February 3, 2024 at 11:41 PM *To: *slurm-us

[slurm-users] SLURM configuration for LDAP users

2024-02-03 Thread Richard Chang via slurm-users
Hi, I am a little new to this, so please pardon my ignorance. I have configured slurm in my cluster and it works fine with local users. But I am not able to get it working with LDAP/SSSD authentication. User logins using ssh are working fine. An LDAP user can login to the login, slurmctld

Re: [slurm-users] SlurmDBD losing connection to the backend MariaDB

2022-11-02 Thread Richard Chang
data. There are many ways to do that, but those designs fall under MariaDB and not Slurm. Brian Andrus On 11/1/2022 6:49 PM, Richard Chang wrote: Does it mean it is best to use a single slurmdbd host in my case? My primary slurmctld is the backup slurmdbd host, and my worry is if the primary slur

Re: [slurm-users] SlurmDBD losing connection to the backend MariaDB

2022-11-01 Thread Richard Chang
[1] https://slurm.schedmd.com/publications.html On 10/31/2022 9:20 PM, Richard Chang wrote: Hi, Just for my info, I would like to know what happens when SlurmDBD loses connection to the backend Database, for ex, MariaDB. Does it cache the accounting info and keep them till the DB comes back up ?, or doe

Re: [slurm-users] [EXTERNAL] SlurmDBD losing connection to the backend MariaDB

2022-11-01 Thread Richard Chang
Hello Greg, I have a two node set up. node1 is primary slurmctld + backup slurmdbd and node2 is primary slurmdbd + backup slurmctld and mysql database host.  My concern is if node 2 goes down, then the backup slurmdbd will take over, then what will happen ? I have read that slurmctld can

[slurm-users] SlurmDBD losing connection to the backend MariaDB

2022-10-31 Thread Richard Chang
Hi, Just for my info, I would like to know what happens when SlurmDBD loses connection to the backend Database, for ex, MariaDB. Does it cache the accounting info and keep them till the DB comes back up ?, or does it panic and shut down ? Thank you, RC.

Re: [slurm-users] Switch setting in slurm.conf breaks slurmctld if the switch type is not there in slurmcrld node

2022-10-31 Thread Richard Chang
This is 21.08 Than you, RC On 10/31/2022 11:05 AM, Chris Samuel wrote: On 27/10/22 11:30 pm, Richard Chang wrote: Yes, the system is a HPE Cray EX, and I am trying to use switch/hpe_slingshot. Which version of Slurm are you using Richard? All the best, Chris

[slurm-users] What happens if slurmdbd loses connection to mysql

2022-10-30 Thread Richard Chang
Hi, I have two dedicated nodes for slurm, node1 and node2. I have created the following. *Role* *SlurmCTLD* *SlurmDBD* *Mariadb Server for accounting storage* *Primary* Node1 Node2 Node2 *Backup* Node2

Re: [slurm-users] Switch setting in slurm.conf breaks slurmctld if the switch type is not there in slurmcrld node

2022-10-28 Thread Richard Chang
Yes, the system is a HPE Cray EX, and I am trying to use switch/hpe_slingshot. RC On 10/28/2022 11:21 AM, Ole Holm Nielsen wrote: On 10/28/22 07:35, Richard Chang wrote: I have observed that when I specify a switch type in the slurm.conf file and that particular switch type is not present

[slurm-users] Switch setting in slurm.conf breaks slurmctld if the switch type is not there in slurmcrld node

2022-10-27 Thread Richard Chang
Hi, I have observed that when I specify a switch type in the slurm.conf file and that particular switch type is not present in the slurmctld node, slurmctld panics and shuts down. Is this expected ? My slurmctld doesn't have the switch type, but the computes have that switch type. how can I

[slurm-users] Ideal NFS exported StateSaveLocation size.

2022-10-23 Thread Richard Chang
Hi, Is there a thumb rule for the size of the directory that is NFS exported, and to be used as StateSaveLocation. I have a two node Slurmctld setup and both will mount an NFS exported directory as the state save location. Let me know your thoughts. Thanks & regards, RC

Re: [slurm-users] Does the slurmctld node need access to Parallel File system and Runtime libraries of the SW in the Compute nodes.

2022-08-02 Thread Richard Chang
not. -Paul Edmon- On 8/2/2022 9:30 AM, Richard Chang wrote: Hi, I am new to SLURM, so please bear with me. I need to understand whether the Server/Node running the slurmctld daemon will need access to the Parallel file system, and if it will need all the SW run time libraries installed

[slurm-users] Does the slurmctld node need access to Parallel File system and Runtime libraries of the SW in the Compute nodes.

2022-08-02 Thread Richard Chang
Hi, I am new to SLURM, so please bear with me. I need to understand whether the Server/Node running the slurmctld daemon will need access to the Parallel file system, and if it will need all the SW run time libraries installed, as in the compute nodes. The users will login to the

[slurm-users] Software and Config for Job submission host only

2022-05-12 Thread Richard Chang
Hi, I am new to SLURM and I am still trying to understand stuff. There is ample documentation available that teaches you how to set it up quickly. Pardon me if this was asked before,  I was not able to find anything pointing to this. I am trying to figure out if there is something like