Bug#707178: Breakin - stress-test and hardware diagnostics tool - Please see if you are able to assist to an issue we are having now for more than a month on 3 servers
Thank you very much Andreas! Kind regards, Bryan -Original Message- From: Andreas Cadhalpun [mailto:andreas.cadhal...@googlemail.com] Sent: 17 January 2014 09:51 PM To: Bryan Fisher Cc: 707...@bugs.debian.org; 733...@bugs.debian.org; Antoine Beaupré; tagg...@debian.org; d...@fifthhorseman.net; jroll...@finestructure.net Subject: Re: Bug#707178: Breakin - stress-test and hardware diagnostics tool - Please see if you are able to assist to an issue we are having now for more than a month on 3 servers Hi Bryan, On 17.01.2014 10:13, Bryan Fisher wrote: > I was hoping that maybe you could assist me in the issue that I am > getting with server h/w please. > > Attached is a screenshot of what happens when I insert a USB key to > copy the Breakin log file. It also indicates that 'Failid - Other > tests have errors, tuning on ID light..' would it be possible if you > could point me in a direction to find the fault please? The error message (repeated 3 times) I read from the screenshot is: kernel: [ 3009.877308] sd 9:0:0:0: [sdb] No Caching mode page present kernel: [ 3009.877311] sd 9:0:0:0: [sdb] Assuming drive cache: write through This reminds me of bug #733565 [1], which is about a request to silence these error messages. I have seen similar messages and they seem to be totally harmless and have nothing to do with hardware failure. Best regards, Andreas 1: http://bugs.debian.org/733565 -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org
Bug#707178: Breakin - stress-test and hardware diagnostics tool - Please see if you are able to assist to an issue we are having now for more than a month on 3 servers
Good day, My name is Bryan Fisher, and I work for a company called Pinnacle Africa in South Africa, Cape Town. I was hoping that maybe you could assist me in the issue that I am getting with server h/w please. Attached is a screenshot of what happens when I insert a USB key to copy the Breakin log file. It also indicates that 'Failid - Other tests have errors, tuning on ID light..' would it be possible if you could point me in a direction to find the fault please? I have run multiple memtest and it passes I have run burin in test in windows it passes, I have ran Sandra test & diagnostics and it passes while monitoring voltages and system temperatures it is all stable. However, the server works for a few weeks, couple of months and it starts getting issues like reboots, freezing up and basically becomes unstable. We have changed mainboards, all ram modules, tested the PSU but the issue still remains. This is the server H/W below that is in use; ECC REG RAM - 8GB x4 modules X9SRI-F - mainboard E5-2620V2 2.1GB 6C Ivy bridge 1U Dual Xeon Chassis CSE-813MTQ-600CB Please advise if you are able to assist or even just tell where the issue might be. I got you e-mail address from here; http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=707178 Thank you kindly in advance! Hope to hearfrom you ASAP Kind regards, Bryan Fisher | Server Specialist Cape Town | Pinnacle Africa Direct: +27 21 5500 357 | Fax: +27 21 551 3444