[slurm-dev] Re: Struggling with QOS?

2016-09-28 Thread Lachlan Musicman
Ah! Reading the docs is a killer. Leads to embarrassing admissions. AccountingStorageEnforce=Associations should be set to limits or qos. What is the difference between QOS and limits? Since my qos has a limit...I'm confused. I presume I should set AccountingStorageEnforce=qos, but I have no

[slurm-dev] Struggling with QOS?

2016-09-28 Thread Lachlan Musicman
Hi, After some fun incidents with accidental monopolization of the cluster, we decided to enforce some QOS. I read the documentation. Thus far in the set up the only thing I've done that's even close is I assigned "share" values when I set up each association. The cluster had a QOS called

[slurm-dev] Re: Slurmctld auto restart and kill running job, why ?

2016-09-28 Thread John DeSantis
-BEGIN PGP SIGNED MESSAGE- Hash: SHA256 Christopher, Yes, it does restart - but that's how we've configured logrotate. John DeSantis On 09/28/2016 07:55 PM, Christopher Samuel wrote: > > On 29/09/16 01:16, John DeSantis wrote: > >> We get the same snippet when our logrotate takes

[slurm-dev] Re: Slurmctld auto restart and kill running job, why ?

2016-09-28 Thread Christopher Samuel
On 29/09/16 01:16, John DeSantis wrote: > We get the same snippet when our logrotate takes action against the > cltdlog: Does your slurmctld restart then too? -- Christopher SamuelSenior Systems Administrator VLSCI - Victorian Life Sciences Computation Initiative Email:

[slurm-dev] auto detect Node definition details

2016-09-28 Thread Tus
Hi All, Is there a way to auto detect node details that go in slurm.conf? If I just have the NodeName in there can slurm get the basic info (i.e cpu, sockets, threads)? Thanks

[slurm-dev] Re: Slurm web dashboards

2016-09-28 Thread Lachlan Musicman
I can't see the dashing fork? -- The most dangerous phrase in the language is, "We've always done it this way." - Grace Hopper On 28 September 2016 at 17:46, Nathan Harper wrote: > *I rather liked the look of this, based on > dashing,io

[slurm-dev] Re: Send notification email

2016-09-28 Thread Eckert, Phil
If I understand your question, you can set it in the in slurm.conf file, the default is: MailProg = /usr/bin/mail From: Fanny Pagés Díaz > Reply-To: slurm-dev > Date: Wednesday, September 28, 2016 at 11:45

[slurm-dev] Send notification email

2016-09-28 Thread Fanny Pagés Díaz
I need send notification email from Slurm using other mail server which is not the standard one. Any can help me?

[slurm-dev] Re: Slurmctld auto restart and kill running job, why ?

2016-09-28 Thread John DeSantis
-BEGIN PGP SIGNED MESSAGE- Hash: SHA256 Christopher, >> [2016-09-26T08:02:16.582] Terminate signal (SIGINT or SIGTERM) >> received > > So that's some external process sending one of those two signals > to slurmctld, it's not something it's choosing to do at all. We've > never seen

[slurm-dev] Re: Slurm web dashboards

2016-09-28 Thread Nathan Harper
*I rather liked the look of this, based on dashing,io https://github.com/julcollas/dashing-slurm/blob/master/README.md * *Sadly dashing.io is not being supported, and this looks two years old now.* There is

[slurm-dev] Re: Invalid Protocol Version

2016-09-28 Thread Christopher Samuel
On 28/09/16 16:25, Barbara Krasovec wrote: > Yes, this worked! Thank you very much for your help! My pleasure! -- Christopher SamuelSenior Systems Administrator VLSCI - Victorian Life Sciences Computation Initiative Email: sam...@unimelb.edu.au Phone: +61 (0)3 903 55545

[slurm-dev] Re: Invalid Protocol Version

2016-09-28 Thread Barbara Krasovec
On 09/28/2016 01:50 AM, Christopher Samuel wrote: > > On 27/09/16 23:54, Barbara Krasovec wrote: > >> The version of the client and server is the same. I guess the problem is >> in the slurmctld state file, where the slurm protocol version of some >> worker nodes must be wrong. > > I suspect