Bug#707178: Breakin - stress-test and hardware diagnostics tool - Please see if you are able to assist to an issue we are having now for more than a month on 3 servers

2014-01-19 Thread Bryan Fisher
Thank you very much Andreas!

Kind regards,

Bryan


-Original Message-
From: Andreas Cadhalpun [mailto:andreas.cadhal...@googlemail.com] 
Sent: 17 January 2014 09:51 PM
To: Bryan Fisher
Cc: 707...@bugs.debian.org; 733...@bugs.debian.org; Antoine Beaupré; 
tagg...@debian.org; d...@fifthhorseman.net; jroll...@finestructure.net
Subject: Re: Bug#707178: Breakin - stress-test and hardware diagnostics tool - 
Please see if you are able to assist to an issue we are having now for more 
than a month on 3 servers

Hi Bryan,

On 17.01.2014 10:13, Bryan Fisher wrote:
> I was hoping that maybe you could assist me in the issue that I am 
> getting with server h/w please.
>
> Attached is a screenshot of what happens when I insert a USB key to 
> copy the Breakin log file. It also indicates that 'Failid - Other 
> tests have errors, tuning on ID light..' would it be possible if you 
> could point me in a direction to find the fault please?

The error message (repeated 3 times) I read from the screenshot is:
kernel: [ 3009.877308] sd 9:0:0:0: [sdb] No Caching mode page present
kernel: [ 3009.877311] sd 9:0:0:0: [sdb] Assuming drive cache: write through

This reminds me of bug #733565 [1], which is about a request to silence these 
error messages.
I have seen similar messages and they seem to be totally harmless and have 
nothing to do with hardware failure.

Best regards,
Andreas


1: http://bugs.debian.org/733565


--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org



Bug#707178: Breakin - stress-test and hardware diagnostics tool - Please see if you are able to assist to an issue we are having now for more than a month on 3 servers

2014-01-17 Thread Bryan Fisher
Good day,

My name is Bryan Fisher, and I work for a company called Pinnacle Africa in 
South Africa, Cape Town.

I was hoping that maybe you could assist me in the issue that I am getting with 
server h/w please.

Attached is a screenshot of what happens when I insert a USB key to copy the 
Breakin log file. It also indicates that 'Failid - Other tests have errors, 
tuning on ID light..' would it be possible if you could point me in a direction 
to find the fault please?

I have run multiple memtest and it passes I have run burin in test in windows 
it passes, I have ran Sandra test & diagnostics and it passes while monitoring 
voltages and system temperatures it is all stable.

However, the server works for a few weeks, couple of months and it starts 
getting issues like reboots, freezing up and basically becomes unstable. We 
have changed mainboards, all ram modules, tested the PSU but the issue still 
remains.

This is the server H/W below that is in use;
ECC REG RAM - 8GB x4 modules
X9SRI-F - mainboard
E5-2620V2 2.1GB 6C Ivy bridge
1U Dual Xeon Chassis CSE-813MTQ-600CB

Please advise if you are able to assist or even just tell where the issue might 
be.

I got you e-mail address from here;
http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=707178


Thank you kindly in advance! Hope to hearfrom you ASAP

Kind regards,


Bryan Fisher | Server Specialist
Cape Town | Pinnacle Africa
Direct: +27 21 5500 357 | Fax: +27 21 551 3444