On Thu, Dec 13, 2012 at 11:55 AM, William Stein <wst...@gmail.com> wrote: > On Wed, Dec 12, 2012 at 11:30 PM, William Stein <wst...@gmail.com> wrote: >> On Wed, Dec 12, 2012 at 7:00 PM, Anne Schilling <a...@math.ucdavis.edu> >> wrote: >>> Hi William, >>> >>> Is combinat.math.washington.edu out once again? The machine does not >>> seem to respond. >> >> It's not crashed due to a memory error, since it responds to ping >> requests. However, I can't ssh into it, which can happen when too >> many people run jobs at once (and the vm.overcommit ratio is too big, >> and there isn't enough swap). I'll get the sysadmins to reboot the >> machine tomorrow morning, then tighten up the vm.overcommit, and add >> more swap. > > The machine is back up. > The vm.overcommit stuff looks fine -- it's the same settings as on > sage.math, etc. > > Later today, I'm going to add an extra 96GB of swap to the little 32GB > of swap currently there; this should help with stability a lot.
I changed the vm.overcommit settings to: vm.overcommit_memory=2 vm.overcommit_ratio=60 and I've added the swap, so now there's 134GB of swap: root@combinat:/etc# free total used free shared buffers cached Mem: 198068436 20060256 178008180 0 585484 4144288 -/+ buffers/cache: 15330484 182737952 Swap: 134537208 0 134537208 Let me know if there's any trouble. I can buy and install another disk so we have more swap, if people feel that is a good idea (there is definitely plenty of room in the grant for this.) -- William > > William > >> >> -- William >> >>> I saw that several people were running heavy computations on it for the >>> last week and until yesterday, everything seemed fine. >> >> >> >>> >>> Thanks, >>> >>> Anne >>> >>> On 12/6/12 10:36 AM, William Stein wrote: >>>> Hi, >>>> >>>> After moving memory around, memtest86 (and the BIOS memtest) detected >>>> no errors. If people can try to stress test >>>> combinat.math.washington.edu for the next 24 hours (especially with >>>> large-memory computations), that would be very useful! >>>> >>>> William >>>> >>>> On Wed, Dec 5, 2012 at 3:25 PM, William Stein <wst...@gmail.com> wrote: >>>>> Hi, >>>>> >>>>> Andrew Ohana and I swapped two chips around and are currently running >>>>> memtest86 on >>>>> combinat.math. I'll check on the results tomorrow at about noon. >>>>> >>>>> -- William >>>>> >>>>> On Tue, Dec 4, 2012 at 10:49 PM, Andrew Mathas >>>>> <andrew.mat...@sydney.edu.au> wrote: >>>>>> Yes, thank you for taking care of this William. I am sure you have better >>>>>> things to do! >>>>>> >>>>>> Andrew >>>>>> >> >> >> >> -- >> William Stein >> Professor of Mathematics >> University of Washington >> http://wstein.org > > > > -- > William Stein > Professor of Mathematics > University of Washington > http://wstein.org -- William Stein Professor of Mathematics University of Washington http://wstein.org -- You received this message because you are subscribed to the Google Groups "sage-combinat-devel" group. To post to this group, send email to sage-combinat-devel@googlegroups.com. To unsubscribe from this group, send email to sage-combinat-devel+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/sage-combinat-devel?hl=en.