Re: [sage-combinat-devel] Terminated jobs on combinat server?

2013-06-18 Thread Florent Hivert
  Hi Andrew,

  I definitely didn't kill them, and the machine hasn't been rebooted in 
  2 months.  Your jobs might have used too much memory and been killed 
  by the Linux kernel. 
 
  Thanks William, that's probably what happened. I'll try and improve my 
 memory management!

I'm sorry your computation where stopped.

I have some extremely CPU intensive computations running on combinat. I
designed it to run on the CPU cache so that it needs very few memory: It
actually runs on 32 core but take less than a MB of memory. I'm watching it on
a regular basis and I remember seeing one of your code which had allocated
mode than 30 GB. If there were only one it shouldn't pose any problem on
combinat. However, there are some virtual machine running and several other
memory eating processes.

Cheers,

Florent

-- 
You received this message because you are subscribed to the Google Groups 
sage-combinat-devel group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to sage-combinat-devel+unsubscr...@googlegroups.com.
To post to this group, send email to sage-combinat-devel@googlegroups.com.
Visit this group at http://groups.google.com/group/sage-combinat-devel.
For more options, visit https://groups.google.com/groups/opt_out.




Re: [sage-combinat-devel] Terminated jobs on combinat server?

2013-06-18 Thread Anne Schilling
Hi Andrew,

I also had a large job running on combinat. I just checked and it also seems to 
have
disappeared. Too bad! So it might be good to find out what happened!?

Best,

Anne

On 6/17/13 11:29 PM, Florent Hivert wrote:
   Hi Andrew,
 
 I definitely didn't kill them, and the machine hasn't been rebooted in 
 2 months.  Your jobs might have used too much memory and been killed 
 by the Linux kernel. 

 Thanks William, that's probably what happened. I'll try and improve my 
 memory management!
 
 I'm sorry your computation where stopped.
 
 I have some extremely CPU intensive computations running on combinat. I
 designed it to run on the CPU cache so that it needs very few memory: It
 actually runs on 32 core but take less than a MB of memory. I'm watching it on
 a regular basis and I remember seeing one of your code which had allocated
 mode than 30 GB. If there were only one it shouldn't pose any problem on
 combinat. However, there are some virtual machine running and several other
 memory eating processes.
 
 Cheers,
 
 Florent

-- 
You received this message because you are subscribed to the Google Groups 
sage-combinat-devel group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to sage-combinat-devel+unsubscr...@googlegroups.com.
To post to this group, send email to sage-combinat-devel@googlegroups.com.
Visit this group at http://groups.google.com/group/sage-combinat-devel.
For more options, visit https://groups.google.com/groups/opt_out.




Re: [sage-combinat-devel] Terminated jobs on combinat server?

2013-06-18 Thread William Stein
We could repartition the new /scratch disk to provide a bigger swap...?
On Jun 18, 2013 2:00 AM, Anne Schilling a...@math.ucdavis.edu wrote:

 Hi Andrew,

 I also had a large job running on combinat. I just checked and it also
 seems to have
 disappeared. Too bad! So it might be good to find out what happened!?

 Best,

 Anne

 On 6/17/13 11:29 PM, Florent Hivert wrote:
Hi Andrew,
 
  I definitely didn't kill them, and the machine hasn't been rebooted in
  2 months.  Your jobs might have used too much memory and been killed
  by the Linux kernel.
 
  Thanks William, that's probably what happened. I'll try and improve my
  memory management!
 
  I'm sorry your computation where stopped.
 
  I have some extremely CPU intensive computations running on combinat. I
  designed it to run on the CPU cache so that it needs very few memory: It
  actually runs on 32 core but take less than a MB of memory. I'm watching
 it on
  a regular basis and I remember seeing one of your code which had
 allocated
  mode than 30 GB. If there were only one it shouldn't pose any problem on
  combinat. However, there are some virtual machine running and several
 other
  memory eating processes.
 
  Cheers,
 
  Florent

 --
 You received this message because you are subscribed to the Google Groups
 sage-combinat-devel group.
 To unsubscribe from this group and stop receiving emails from it, send an
 email to sage-combinat-devel+unsubscr...@googlegroups.com.
 To post to this group, send email to sage-combinat-devel@googlegroups.com.
 Visit this group at http://groups.google.com/group/sage-combinat-devel.
 For more options, visit https://groups.google.com/groups/opt_out.




-- 
You received this message because you are subscribed to the Google Groups 
sage-combinat-devel group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to sage-combinat-devel+unsubscr...@googlegroups.com.
To post to this group, send email to sage-combinat-devel@googlegroups.com.
Visit this group at http://groups.google.com/group/sage-combinat-devel.
For more options, visit https://groups.google.com/groups/opt_out.




Re: [sage-combinat-devel] Terminated jobs on combinat server?

2013-06-18 Thread Anne Schilling
Yes, this might be a good idea!

Anne

On 6/18/13 6:24 AM, William Stein wrote:
 We could repartition the new /scratch disk to provide a bigger swap...?
 
 On Jun 18, 2013 2:00 AM, Anne Schilling a...@math.ucdavis.edu 
 mailto:a...@math.ucdavis.edu wrote:
 
 Hi Andrew,
 
 I also had a large job running on combinat. I just checked and it also 
 seems to have
 disappeared. Too bad! So it might be good to find out what happened!?
 
 Best,
 
 Anne
 
 On 6/17/13 11:29 PM, Florent Hivert wrote:
Hi Andrew,
 
  I definitely didn't kill them, and the machine hasn't been rebooted in
  2 months.  Your jobs might have used too much memory and been killed
  by the Linux kernel.
 
  Thanks William, that's probably what happened. I'll try and improve my
  memory management!
 
  I'm sorry your computation where stopped.
 
  I have some extremely CPU intensive computations running on combinat. I
  designed it to run on the CPU cache so that it needs very few memory: It
  actually runs on 32 core but take less than a MB of memory. I'm 
 watching it on
  a regular basis and I remember seeing one of your code which had 
 allocated
  mode than 30 GB. If there were only one it shouldn't pose any problem on
  combinat. However, there are some virtual machine running and several 
 other
  memory eating processes.
 
  Cheers,
 
  Florent
 
 --
 You received this message because you are subscribed to the Google Groups 
 sage-combinat-devel group.
 To unsubscribe from this group and stop receiving emails from it, send an 
 email to sage-combinat-devel+unsubscr...@googlegroups.com 
 mailto:sage-combinat-devel%2bunsubscr...@googlegroups.com.
 To post to this group, send email to sage-combinat-devel@googlegroups.com 
 mailto:sage-combinat-devel@googlegroups.com.
 Visit this group at http://groups.google.com/group/sage-combinat-devel.
 For more options, visit https://groups.google.com/groups/opt_out.
 
 
 -- 
 You received this message because you are subscribed to the Google Groups 
 sage-combinat-devel group.
 To unsubscribe from this group and stop receiving emails from it, send an 
 email to sage-combinat-devel+unsubscr...@googlegroups.com.
 To post to this group, send email to sage-combinat-devel@googlegroups.com.
 Visit this group at http://groups.google.com/group/sage-combinat-devel.
 For more options, visit https://groups.google.com/groups/opt_out.

-- 
You received this message because you are subscribed to the Google Groups 
sage-combinat-devel group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to sage-combinat-devel+unsubscr...@googlegroups.com.
To post to this group, send email to sage-combinat-devel@googlegroups.com.
Visit this group at http://groups.google.com/group/sage-combinat-devel.
For more options, visit https://groups.google.com/groups/opt_out.




Re: [sage-combinat-devel] Terminated jobs on combinat server?

2013-06-17 Thread William Stein
On Mon, Jun 17, 2013 at 4:25 PM, Andrew Mathas
andrew.mat...@sydney.edu.au wrote:
 Hi All,

 I had some (very long and quite intensive but niced) jobs running on the
 combinat server but they have all been terminated. I'm not sure whether this
 was done accidentally or because they were hogging resources or for some
 other reason (although I was monitoring them and they seemed OK). This
 calculation has been all that I have had time for sagewise for the past few
 months:( Unfortunately I have lost some of the calculation but luckily I can
 restart them within a few days of when they died.

 Before restarting them I thought that I should check to see if I was
 breaking the usage guidelines or etiquette. Please advise!

I definitely didn't kill them, and the machine hasn't been rebooted in
2 months.  Your jobs might have used too much memory and been killed
by the Linux kernel.


 Cheers,
 Andrew


 --
 You received this message because you are subscribed to the Google Groups
 sage-combinat-devel group.
 To unsubscribe from this group and stop receiving emails from it, send an
 email to sage-combinat-devel+unsubscr...@googlegroups.com.
 To post to this group, send email to sage-combinat-devel@googlegroups.com.
 Visit this group at http://groups.google.com/group/sage-combinat-devel.
 For more options, visit https://groups.google.com/groups/opt_out.





-- 
William Stein
Professor of Mathematics
University of Washington
http://wstein.org

-- 
You received this message because you are subscribed to the Google Groups 
sage-combinat-devel group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to sage-combinat-devel+unsubscr...@googlegroups.com.
To post to this group, send email to sage-combinat-devel@googlegroups.com.
Visit this group at http://groups.google.com/group/sage-combinat-devel.
For more options, visit https://groups.google.com/groups/opt_out.




Re: [sage-combinat-devel] Terminated jobs on combinat server?

2013-06-17 Thread Andrew Mathas


On Tuesday, 18 June 2013 11:26:13 UTC+10, William Stein wrote:

 I definitely didn't kill them, and the machine hasn't been rebooted in 
 2 months.  Your jobs might have used too much memory and been killed 
 by the Linux kernel. 

 Thanks William, that's probably what happened. I'll try and improve my 
memory management!

Andrew 

-- 
You received this message because you are subscribed to the Google Groups 
sage-combinat-devel group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to sage-combinat-devel+unsubscr...@googlegroups.com.
To post to this group, send email to sage-combinat-devel@googlegroups.com.
Visit this group at http://groups.google.com/group/sage-combinat-devel.
For more options, visit https://groups.google.com/groups/opt_out.