On Thu, Dec 13, 2012 at 11:55 AM, William Stein <wst...@gmail.com> wrote:
> On Wed, Dec 12, 2012 at 11:30 PM, William Stein <wst...@gmail.com> wrote:
>> On Wed, Dec 12, 2012 at 7:00 PM, Anne Schilling <a...@math.ucdavis.edu> 
>> wrote:
>>> Hi William,
>>>
>>> Is combinat.math.washington.edu out once again? The machine does not
>>> seem to respond.
>>
>> It's not crashed due to a memory error, since it responds to ping
>> requests.  However, I can't ssh into it, which can happen when too
>> many people run jobs at once (and the vm.overcommit ratio is too big,
>> and there isn't enough swap).   I'll get the sysadmins to reboot the
>> machine tomorrow morning, then tighten up the vm.overcommit, and add
>> more swap.
>
> The machine is back up.
> The vm.overcommit stuff looks fine -- it's the same settings as on
> sage.math, etc.
>
> Later today, I'm going to add an extra 96GB of swap to the little 32GB
> of swap currently there; this should help with stability a lot.

I changed the vm.overcommit settings to:

vm.overcommit_memory=2
vm.overcommit_ratio=60

and I've added the swap, so now there's 134GB of swap:

root@combinat:/etc# free
             total       used       free     shared    buffers     cached
Mem:     198068436   20060256  178008180          0     585484    4144288
-/+ buffers/cache:   15330484  182737952
Swap:    134537208          0  134537208

Let me know if there's any trouble.     I can buy and install another
disk so we have
more swap, if people feel that is a good idea (there is definitely
plenty of room in the
grant for this.)

 -- William

>
> William
>
>>
>>  -- William
>>
>>> I saw that several people were running heavy computations on it for the
>>> last week and until yesterday, everything seemed fine.
>>
>>
>>
>>>
>>> Thanks,
>>>
>>> Anne
>>>
>>> On 12/6/12 10:36 AM, William Stein wrote:
>>>> Hi,
>>>>
>>>> After moving memory around, memtest86 (and the BIOS memtest) detected
>>>> no errors.  If people can try to stress test
>>>> combinat.math.washington.edu for the next 24 hours (especially with
>>>> large-memory computations), that would be very useful!
>>>>
>>>> William
>>>>
>>>> On Wed, Dec 5, 2012 at 3:25 PM, William Stein <wst...@gmail.com> wrote:
>>>>> Hi,
>>>>>
>>>>> Andrew Ohana and I swapped two chips around and are currently running
>>>>> memtest86 on
>>>>> combinat.math.  I'll check on the results tomorrow at about noon.
>>>>>
>>>>>  -- William
>>>>>
>>>>> On Tue, Dec 4, 2012 at 10:49 PM, Andrew Mathas
>>>>> <andrew.mat...@sydney.edu.au> wrote:
>>>>>> Yes, thank you for taking care of this William. I am sure you have better
>>>>>> things to do!
>>>>>>
>>>>>> Andrew
>>>>>>
>>
>>
>>
>> --
>> William Stein
>> Professor of Mathematics
>> University of Washington
>> http://wstein.org
>
>
>
> --
> William Stein
> Professor of Mathematics
> University of Washington
> http://wstein.org



-- 
William Stein
Professor of Mathematics
University of Washington
http://wstein.org

-- 
You received this message because you are subscribed to the Google Groups 
"sage-combinat-devel" group.
To post to this group, send email to sage-combinat-devel@googlegroups.com.
To unsubscribe from this group, send email to 
sage-combinat-devel+unsubscr...@googlegroups.com.
For more options, visit this group at 
http://groups.google.com/group/sage-combinat-devel?hl=en.

Reply via email to