On 10/02/15 17:14, Mark Hahn wrote: > it's also worth pointing out that you can explicitly > tell the OOM killer to lay off a process (/proc/$pid/oom_*). > sshd does this to itself, for instance - it would make sense for a > scheduler daemon to do so as well.
The Slurm daemon that lives on the nodes already does this (code contributed by NUDT in China in 2009). * set_oomadj.c - prevent slurmd/slurmstepd from being killed by the * kernel OOM killer -- Christopher Samuel Senior Systems Administrator VLSCI - Victorian Life Sciences Computation Initiative Email: [email protected] Phone: +61 (0)3 903 55545 http://www.vlsci.org.au/ http://twitter.com/vlsci _______________________________________________ Beowulf mailing list, [email protected] sponsored by Penguin Computing To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf
