Re: [slurm-users] Parse error on starting slurmctld/slurmd

2019-05-26 Thread Priya Mishra
This is the configuration file :

# slurm.conf file generated by configurator easy.html.
# Put this file on all nodes of your cluster.
# See the slurm.conf man page for more information.
#
SlurmctldHost=LAPTOP-KKM6C1SN
#
#MailProg=/bin/mail
MpiDefault=none
#MpiParams=ports=#-#
ProctrackType=proctrack/pgid
ReturnToService=1
SlurmctldPidFile=/var/run/slurm-llnl/slurmctld.pid
#SlurmctldPort=6817
SlurmdPidFile=/var/run/slurm-llnl/slurmd.pid
#SlurmdPort=6818
SlurmdSpoolDir=/var/lib/slurm-llnl/slurmd
SlurmUser=priya
#SlurmdUser=root
StateSaveLocation=/var/lib/slurm-llnl/slurmctld
SwitchType=switch/none
TaskPlugin=task/none
#
#
# TIMERS
#KillWait=30
#MinJobAge=300
#SlurmctldTimeout=120
#SlurmdTimeout=300
#
#
# SCHEDULING
FastSchedule=1
SchedulerType=sched/backfill
SelectType=select/cons_res
SelectTypeParameters=CR_Core
#
#
# LOGGING AND ACCOUNTING
AccountingStorageType=accounting_storage/none
ClusterName=cluster
#JobAcctGatherFrequency=30
JobAcctGatherType=jobacct_gather/none
#SlurmctldDebug=info
SlurmctldLogFile=/var/log/slurm-llnl/slurmctld.log
#SlurmdDebug=info
SlurmdLogFile=/var/log/slurm-llnl/slurmd.log
#
#
# COMPUTE NODES
NodeName=LAPTOP-KKM6C1SN CPUs=1 State=UNKNOWN
PartitionName=debug Nodes=LAPTOP-KKM6C1SN Default=YES MaxTime=INFINITE
State=UP


Re: [slurm-users] Slurm Install on Remote System

2019-05-26 Thread Chris Samuel
On Saturday, 25 May 2019 11:40:26 PM PDT Benson Muite wrote:

> This has been a helpful thread. In addition to virtual machines,
> containers (such as Singularity) may be a possible way to setup your own
> SLURM installation.

I'd strongly recommend against that, if you want to be able to trial things 
like cgroups you'll want a VM at least.

-- 
  Chris Samuel  :  http://www.csamuel.org/  :  Berkeley, CA, USA






Re: [slurm-users] Parse error on starting slurmctld/slurmd

2019-05-26 Thread Chris Samuel
On Sunday, 26 May 2019 6:01:01 AM PDT Priya Mishra wrote:

> slurmctld: error: _parse_next_key: Parsing error at unrecognized key:
> SlurmctldHost slurmctld: error: Parse error in file
> /etc/slurm-llnl/slurm.conf line 5: "SlurmctldHost=LAPTOP-KKM6C1SN"
> slurmctld: fatal: Unable to process configuration file

You probably will need to post your slurm.conf to get a better idea, a parse 
error like that seems unusual.  It _might_ be that Slurm doesn't like the 
hyphen in the hostname, but I would find that unlikely, could there be extra 
characters, or the difference between DOS & Linux line ending conventions?

See this for more on that last point:   https://kb.iu.edu/d/acux

All the best,
Chris
-- 
  Chris Samuel  :  http://www.csamuel.org/  :  Berkeley, CA, USA






Re: [slurm-users] Using cgroups to hide GPUs on a shared controller/node

2019-05-26 Thread Steven Dick
What operating system are you running?
Modern versions of systemd automatically put login sessions into their
own cgroup which are themselves in a "user" group.
When slurm is running parallel to this, it makes its own slurm cgroup.

It should be possible to have something at boot modify the systemd
user cgroup to remove the gpus.

Alternately, there may be options in pam-slurm-adopt to automatically
move login sessions to a default cgroup.  (I can't remember without
looking how easy this would be.)



[slurm-users] Parse error on starting slurmctld/slurmd

2019-05-26 Thread Priya Mishra
I am using Ubuntu-18.04 on Windows Subsystem for Linux. I have installed
slurm using the ubuntu package (slurm-wlm). I get the the following error
while starting slurmctld /slurmd:

slurmctld: error: _parse_next_key: Parsing error at unrecognized key:
SlurmctldHost slurmctld: error: Parse error in file
/etc/slurm-llnl/slurm.conf line 5: "SlurmctldHost=LAPTOP-KKM6C1SN"
slurmctld: fatal: Unable to process configuration file
The hostname provided in the configuration file is the output of 'hostname
-s' . What maybe causing the problem?

Thanks,
Priya


Re: [slurm-users] Slurm Install on Remote System

2019-05-26 Thread Priya Mishra
Installing slurm without root access is proving to be difficult. I will
explore the other options suggested in this thread.
Thanks,
Priya

>
>
>


Re: [slurm-users] Slurm Install on Remote System

2019-05-26 Thread John Hearns
Think of system administrators like grumpy bears in their caves.
They will growl at you and make fierce noises.
Btu bring them cookies and they will roll over and let their tummies be
tickled.

On Sun, 26 May 2019 at 05:25, Raymond Wan  wrote:

>
>
> On 25/5/2019 7:37 PM, John Hearns wrote:
>  > > On Sat, 25 May 2019 at 11:38, Priya Mishra
>  > > <26priy...@gmail.com > wrote:
>  > >
>  > > I am trying to install slurm on a remote system for
>  > > which I don't have root access.
>  > > After following the instructions on the official
> website
> > OK, I am going to stick my neck out here.
> > You say a 'remote system' - is this a single server? If it
> > is, for what purpose do you need Slurm?
> > If you want to schedule some tasks to run one after the
> > other, simply start a screen session then put the takss into
> > a script.
>
>
> Let me add to John's reply by saying you can then use a
> (user-level) cron job and/or the "at" command.  Or, create a
> pipeline using something like Snakemake (which is like a
> Makefile).
>
> I'm not knowledgeable enough to know whether or not you'd
> (the OP) be successful, but even if you were, your system
> administrator might start asking why you are doing what
> you're doing.  :-)
>
> Ray
>
>
>
>


Re: [slurm-users] Slurm Install on Remote System

2019-05-26 Thread John Hearns
Priya, you could set up a cluster on Aamazon or another cloud for testing.
Please have a look at this
https://elasticluster.readthedocs.io/en/latest/

If you want to set up some virtual machines on your own laptop or server,
Google for  vagrant  slurm   There are several vagrant recipes on the net.
Please keep posting - we are all very happy to help here.


On Sun, 26 May 2019 at 07:43, Benson Muite 
wrote:

>
> On 5/26/19 7:26 AM, Chris Samuel wrote:
> > On Saturday, 25 May 2019 6:56:58 AM PDT Priya Mishra wrote:
> >
> >> Yes,it is a single server. I am an undergraduate student . I am doing a
> >> project on job scheduling in clusters and needed slurm for the same. So
> I
> >> started with a single system and once I am able to run slurm
> successfully
> >> on one system , I will expand to include more systems.
> > I suspect you're going to find this very hard without having root access
> as you
> > will need to ensure you have the same munge keys across all systems (as
> well
> > as the same usernames and UNIX UID's for them).
> >
> > You might find that firing up a little cluster in a set of VM's makes
> life a lot
> > easier for you as you'll be able to control the whole environment.
> >
> > All the best,
> > Chris
> This has been a helpful thread. In addition to virtual machines,
> containers (such as Singularity) may be a possible way to setup your own
> SLURM installation.
>
>