On Tue, 2011-08-09 at 15:56 +0200, Simon Morvan wrote: > Le 09/08/2011 15:09, Ben Hutchings a écrit : > > On Tue, 2011-08-09 at 14:55 +0200, Simon Morvan wrote: > >> Le 09/08/2011 14:50, Ben Hutchings a écrit : > >>> On Tue, 2011-08-09 at 12:22 +0200, Simon Morvan wrote: > >>>> We're getting random crash of the system (panics, GPF). Stack trace is > >>>> always different. > >>> Can you check that the power supply is sufficient for all these disks? > >> Do you have recommendations ? I haven't found so much information on how > >> to estimate the power need. Currently this is a 600W power supply (FWIW: > >> Cooler Master Silent Pro M - 600W) > > Many motherboards have a voltage monitoring chip, which you should be > > able to read with the 'sensors' command from the 'lm-sensors' package. > > This should show whether the actual voltages are being pulled down > > because the power supply is overloaded. You would need to actually make > > all the hard drives active while checking this. > it8720-isa-0228 > Adapter: ISA adapter > Vcore: +1.33 V (min = +0.78 V, max = +1.50 V) > Vdram: +1.50 V (min = +1.42 V, max = +1.57 V) > +3.3V: +3.30 V (min = +3.14 V, max = +3.47 V) > *+5V: +4.92 V (min = +4.76 V, max = +5.24 V)* > +12V: +12.36 V (min = +11.41 V, max = +12.62 V) > in5: +2.70 V (min = +0.00 V, max = +4.08 V) > 5VSB: +4.92 V (min = +4.76 V, max = +5.24 V) > Vbat: +3.25 V > CPU Fan: 0 RPM (min = 0 RPM) > Sys Fan: 0 RPM (min = 0 RPM) > Sys Fan: 0 RPM (min = 0 RPM) > fan5: 0 RPM (min = 0 RPM) > temp1: +44.0°C (low = +127.0°C, high = +127.0°C) sensor = > thermistor > CPU Temp: +59.0°C (low = +127.0°C, high = +127.0°C) sensor = > thermal diode > temp3: +54.0°C (low = +127.0°C, high = +127.0°C) sensor = > thermistor > cpu0_vid: +0.513 V > > Assuming the sensors.conf is OK (which is not 100% sure for that > Gigabyte GA-890GPA-UD3H mobo, AFAIK), do you think the 4.92V for the 5V > is too low ?
No, that's entirely normal. Aside from Vcore, the BIOS appears to set warning limits of 5% below and above the nominal values, and all voltages are within those limits. The temperatures also look fine. > I was hdparm'ing some disks while, compiling a kernel and a raid5 sync > was inprogress at that time (and it freezed, of course) So, new theory required. Given you said you're not using ECC memory, can you test it with memtest86+ for a few hours? Ben.
signature.asc
Description: This is a digitally signed message part