Re: [slurm-users] Setup for backup slurmctld

2020-02-26 Thread Brian Andrus
I would say so. Certainly, if you have many nodes and/or many jobs being submitted, you will see an impact, but in my experience comparing Slurm to SGE, Slurm has much less overhead to cause as much impact. Brian Andrus On 2/26/2020 1:05 PM, Joshua Baker-LePain wrote: On Wed, 26 Feb 2020

Re: [slurm-users] Setup for backup slurmctld

2020-02-26 Thread Joshua Baker-LePain
On Wed, 26 Feb 2020 at 12:56pm, Brian Andrus wrote Any shared filesystem that both systems can get to will work. I have done it with NFS, Gluster, appliances (NetApp), etc. Being in a separate datacenter is fine, but you will see some latency, which you likely already addressed if you are

Re: [slurm-users] Setup for backup slurmctld

2020-02-26 Thread Brian Andrus
Any shared filesystem that both systems can get to will work. I have done it with NFS, Gluster, appliances (NetApp), etc. Being in a separate datacenter is fine, but you will see some latency, which you likely already addressed if you are pysically splitting a network like that. Also, very

[slurm-users] Setup for backup slurmctld

2020-02-26 Thread Joshua Baker-LePain
We're planning the migration of our moderately sized cluster (~400 nodes, 40K jobs/day) from SGE to slurm. We'd very much like to have a backup slurmctld, and it'd be even better if our backup slurmctld could be in a separate data center from the primary (though they'd still be on the same

[slurm-users] Slurm 17.11 and configuring backfill and oversubscribe to allow concurrent processes

2020-02-26 Thread Robert Kudyba
We run Bright 8.1 and Slurm 17.11. We are trying to allow for multiple concurrent jobs to run on our small 4 node cluster. Based on https://community.brightcomputing.com/question/5d6614ba08e8e81e885f1991?action=artikel=14=410=en=slurm+%2526%252334%253Bgang+scheduling%2526%252334%253B and

Re: [slurm-users] Slurm version 20.02.0 is now available

2020-02-26 Thread Dean Schulze
So it sounds like the simplest approach would be to remove libcons_common from the make file and copy cons_common.[ch] into my project and provide my own implementations in the appropriate functions in cons_common.c. On Wed, Feb 26, 2020 at 6:12 AM Jeffrey T Frey wrote: > Did you reuse the

[slurm-users] Question on how to make slurm aware of a CVMFS revision

2020-02-26 Thread Klein, Dennis
Sry, if this is a double post, but I have the impression my first email was discarded, because I was not yet subscribed to the list - it did not show up in the list archive. From: Klein, Dennis Sent: Tuesday, February 25, 2020 4:56 PM To: slurm-us...@schedmd.com

Re: [slurm-users] Slurm version 20.02.0 is now available

2020-02-26 Thread Jeffrey T Frey
Did you reuse the 20.02 select/cons_res/Makefile.{in,am} in your plugin's source? You probably will have to re-model your plugin after the select/cray_aries plugin if you need to override those two functions (it also defines its own select_p_job_begin() and doesn't link against

Re: [slurm-users] Slurm 19.05 X11-forwarding

2020-02-26 Thread Pär Lundö
Hi, Thank you for your quick replies. Please bear with me as I am a newbie regarding Slurm and Linux. My hostname is not a FQDN and I´m running Slurm on a local node (slurmctld and slurmuser are the same) just to verify that the X11-forwarding is working (amongst other things). The output of