Bug#707178: Breakin - stress-test and hardware diagnostics tool - Please see if you are able to assist to an issue we are having now for more than a month on 3 servers

2014-01-19 Thread Bryan Fisher
Thank you very much Andreas!

Kind regards,

Bryan


-Original Message-
From: Andreas Cadhalpun [mailto:andreas.cadhal...@googlemail.com] 
Sent: 17 January 2014 09:51 PM
To: Bryan Fisher
Cc: 707...@bugs.debian.org; 733...@bugs.debian.org; Antoine Beaupré; 
tagg...@debian.org; d...@fifthhorseman.net; jroll...@finestructure.net
Subject: Re: Bug#707178: Breakin - stress-test and hardware diagnostics tool - 
Please see if you are able to assist to an issue we are having now for more 
than a month on 3 servers

Hi Bryan,

On 17.01.2014 10:13, Bryan Fisher wrote:
 I was hoping that maybe you could assist me in the issue that I am 
 getting with server h/w please.

 Attached is a screenshot of what happens when I insert a USB key to 
 copy the Breakin log file. It also indicates that 'Failid - Other 
 tests have errors, tuning on ID light..' would it be possible if you 
 could point me in a direction to find the fault please?

The error message (repeated 3 times) I read from the screenshot is:
kernel: [ 3009.877308] sd 9:0:0:0: [sdb] No Caching mode page present
kernel: [ 3009.877311] sd 9:0:0:0: [sdb] Assuming drive cache: write through

This reminds me of bug #733565 [1], which is about a request to silence these 
error messages.
I have seen similar messages and they seem to be totally harmless and have 
nothing to do with hardware failure.

Best regards,
Andreas


1: http://bugs.debian.org/733565


--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#707178: Breakin - stress-test and hardware diagnostics tool - Please see if you are able to assist to an issue we are having now for more than a month on 3 servers

2014-01-17 Thread Bryan Fisher
Good day,

My name is Bryan Fisher, and I work for a company called Pinnacle Africa in 
South Africa, Cape Town.

I was hoping that maybe you could assist me in the issue that I am getting with 
server h/w please.

Attached is a screenshot of what happens when I insert a USB key to copy the 
Breakin log file. It also indicates that 'Failid - Other tests have errors, 
tuning on ID light..' would it be possible if you could point me in a direction 
to find the fault please?

I have run multiple memtest and it passes I have run burin in test in windows 
it passes, I have ran Sandra test  diagnostics and it passes while monitoring 
voltages and system temperatures it is all stable.

However, the server works for a few weeks, couple of months and it starts 
getting issues like reboots, freezing up and basically becomes unstable. We 
have changed mainboards, all ram modules, tested the PSU but the issue still 
remains.

This is the server H/W below that is in use;
ECC REG RAM - 8GB x4 modules
X9SRI-F - mainboard
E5-2620V2 2.1GB 6C Ivy bridge
1U Dual Xeon Chassis CSE-813MTQ-600CB

Please advise if you are able to assist or even just tell where the issue might 
be.

I got you e-mail address from here;
http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=707178


Thank you kindly in advance! Hope to hearfrom you ASAP

Kind regards,


Bryan Fisher | Server Specialist
Cape Town | Pinnacle Africa
Direct: +27 21 5500 357 | Fax: +27 21 551 3444




Bug#707178: Breakin - stress-test and hardware diagnostics tool - Please see if you are able to assist to an issue we are having now for more than a month on 3 servers

2014-01-17 Thread Daniel Kahn Gillmor
Hi Bryan--

On 01/17/2014 04:13 AM, Bryan Fisher wrote:

 My name is Bryan Fisher, and I work for a company called Pinnacle Africa in 
 South Africa, Cape Town.
 
 I was hoping that maybe you could assist me in the issue that I am getting 
 with server h/w please.

I think you're asking about something underlated to what
http://bugs.debian.org/707178 is talking about.  The people that you've
e-mailed don't have anything to do with the breakin project, and we
can't support your organization's hardware at any rate.

I recommend you follow up with your hardware vendor (or retain other
local technical staff), and explain to them the errors that you're
having.  But your post is off-topic for the discussion of
http://bugs.debian.org/707178.

Regards,

--dkg



signature.asc
Description: OpenPGP digital signature


Bug#707178: Breakin - stress-test and hardware diagnostics tool - Please see if you are able to assist to an issue we are having now for more than a month on 3 servers

2014-01-17 Thread Andreas Cadhalpun

Hi Bryan,

On 17.01.2014 10:13, Bryan Fisher wrote:

I was hoping that maybe you could assist me in the issue that I am
getting with server h/w please.

Attached is a screenshot of what happens when I insert a USB key to copy
the Breakin log file. It also indicates that ‘Failid – Other tests have
errors, tuning on ID light..’ would it be possible if you could point me
in a direction to find the fault please?


The error message (repeated 3 times) I read from the screenshot is:
kernel: [ 3009.877308] sd 9:0:0:0: [sdb] No Caching mode page present
kernel: [ 3009.877311] sd 9:0:0:0: [sdb] Assuming drive cache: write through

This reminds me of bug #733565 [1], which is about a request to silence 
these error messages.
I have seen similar messages and they seem to be totally harmless and 
have nothing to do with hardware failure.


Best regards,
Andreas


1: http://bugs.debian.org/733565


--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org