On Mon, 2018-02-05 at 10:15 +1300, Ben Caradoc-Davies wrote:
> On 05/02/18 09:49, Stephen P. Molnar wrote:
> > They seem to happen when I am
> > running  four or more apps at the same time.
> 
> I would never expect to see orphaned inodes except after a system
> crash 
> or kernel memory corruption. How did you test your CPU and RAM? Do
> you 
> see any other symptoms such as segfaults that could suggest memory 
> problems under concurrent load? How long have you seen this problem?
> I 
> see you are using ext4; are the inodes on these filesystems? ext4 is 
> very well tested and robust.
> 
> My preferred memory test for my 4-core (8-thread) Kaby Lake i7 is to
> run 
> concurrent "memtester" instances equal to the number of cores (4 in
> my 
> case), concurrent with "stress" equal to the number of cores ("stress
> -c 
> 4" in my case). This workout detected memory problems not found by
> other 
> tools such as "memtest86+" or "mprime -t".
> 
> Other hardware issues to consider are overheating (addressed with
> better 
> cooling and thermald) and power supply problems which may only be 
> evident at load. Is your system prime stable (i.e. runs with "mprime
> -t" 
> (with AVX disabled) for many hours)? How do you monitor system
> temperature?
> 
> Kind regards,
> 
Thanks for your reply.

As a matter of fact I did get an overheating warning from the OS when
running a rather large organic molecule on the Orca package with 8
threads.  I upgraded the CPU cooler to a Hyper 212 EVO and the problem
went away.

-- 
Stephen P. Molnar, Ph.D.
Consultant
www.molecular-modeling.net
(614)312-7528 (c)
Skype: smolnar1

Reply via email to