On Wed, Dec 12, 2012 at 11:30 PM, William Stein <wst...@gmail.com> wrote: > On Wed, Dec 12, 2012 at 7:00 PM, Anne Schilling <a...@math.ucdavis.edu> wrote: >> Hi William, >> >> Is combinat.math.washington.edu out once again? The machine does not >> seem to respond. > > It's not crashed due to a memory error, since it responds to ping > requests. However, I can't ssh into it, which can happen when too > many people run jobs at once (and the vm.overcommit ratio is too big, > and there isn't enough swap). I'll get the sysadmins to reboot the > machine tomorrow morning, then tighten up the vm.overcommit, and add > more swap.
The machine is back up. The vm.overcommit stuff looks fine -- it's the same settings as on sage.math, etc. Later today, I'm going to add an extra 96GB of swap to the little 32GB of swap currently there; this should help with stability a lot. William > > -- William > >> I saw that several people were running heavy computations on it for the >> last week and until yesterday, everything seemed fine. > > > >> >> Thanks, >> >> Anne >> >> On 12/6/12 10:36 AM, William Stein wrote: >>> Hi, >>> >>> After moving memory around, memtest86 (and the BIOS memtest) detected >>> no errors. If people can try to stress test >>> combinat.math.washington.edu for the next 24 hours (especially with >>> large-memory computations), that would be very useful! >>> >>> William >>> >>> On Wed, Dec 5, 2012 at 3:25 PM, William Stein <wst...@gmail.com> wrote: >>>> Hi, >>>> >>>> Andrew Ohana and I swapped two chips around and are currently running >>>> memtest86 on >>>> combinat.math. I'll check on the results tomorrow at about noon. >>>> >>>> -- William >>>> >>>> On Tue, Dec 4, 2012 at 10:49 PM, Andrew Mathas >>>> <andrew.mat...@sydney.edu.au> wrote: >>>>> Yes, thank you for taking care of this William. I am sure you have better >>>>> things to do! >>>>> >>>>> Andrew >>>>> > > > > -- > William Stein > Professor of Mathematics > University of Washington > http://wstein.org -- William Stein Professor of Mathematics University of Washington http://wstein.org -- You received this message because you are subscribed to the Google Groups "sage-combinat-devel" group. To post to this group, send email to sage-combinat-devel@googlegroups.com. To unsubscribe from this group, send email to sage-combinat-devel+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/sage-combinat-devel?hl=en.