I'll put in a plug for Slurm. I'd say it's got the features you need and it scales up really well.
Accounting: it can write to file, but you're best off configuring the database backend (MySQL, Mariadb). Then there's both the built-in sacct tool and (if you're so inclined) queries directly against the database. It can also be set up to track job resource usage. CGROUPS: I've only used it in our test cluster, but I found it pretty straight forward. There's both cgroup limits and scheduler-enforced limits. As far a setting up easy shell access, we've written a wrapper that uses the "srun --pty /bin/bash" command to drop users into a shell on a compute node. If you've the budget, we've found the support offered by SchedMD to be a good value as well. The community is pretty helpful if that's sufficient for your environment. I'd used Torque/Moab as well as Slurm/Moab in past. Those worked well, but we've found that the pure Slurm answer has been more than satisfactory for our cluster (a smallish, ~2000 core capability compute system). Hope that helps. Michael On Tue, Jul 14, 2015 at 2:40 PM, John Hearns <[email protected]> wrote: > > > We are investigating options for a cluster/cloud scenario where I need > to keep track of CPU, > > >memory, disk used during the runs. This > "keeping track" should be available via command line tools >(preferably in > JSON/XML/CSV output that I can easily parse). > > Does Apache Mesos suit your needs? > > > http://opensource.com/business/14/9/open-source-datacenter-computing-apache-mesos > > I feel the days of us dinosaurs who write 'batch scripts' in shell and use > qsub/bsub to submit them to the almighty 'batch scheduler' are numbered. > The stateless mice with their http whiskers are running around our feet. > > ##################################################################################### > Scanned by MailMarshal - M86 Security's comprehensive email content > security solution. > > ##################################################################################### > Any views or opinions presented in this email are solely those of the > author and do not necessarily represent those of the company. Employees of > XMA Ltd are expressly required not to make defamatory statements and not to > infringe or authorise any infringement of copyright or any other legal > right by email communications. Any such communication is contrary to > company policy and outside the scope of the employment of the individual > concerned. The company will not accept any liability in respect of such > communication, and the employee responsible will be personally liable for > any damages or other liability arising. XMA Limited is registered in > England and Wales (registered no. 2051703). Registered Office: Wilford > Industrial Estate, Ruddington Lane, Wilford, Nottingham, NG11 7EP > _______________________________________________ > Beowulf mailing list, [email protected] sponsored by Penguin Computing > To change your subscription (digest mode or unsubscribe) visit > http://www.beowulf.org/mailman/listinfo/beowulf >
_______________________________________________ Beowulf mailing list, [email protected] sponsored by Penguin Computing To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf
