Re: PowerEdge 2800: megaraid/scsi errors (PERC 4e/di)

Eric Wood Wed, 10 Nov 2010 12:18:22 -0800

I disable the onboard Perc4/Di and put in a Perc4/DC which I purchased used) 
with a new scsi cable.  The system booted the RAID-5 array just fine. OS is 
RHEL 3.   However, I'm still getting frequent i/o errors:


Nov 10 14:02:09 dauphin_fly kernel:  I/O error: dev 08:07, sector 19312

megamgr still tells me for all three ST336753LC (Rev. DX10) drives:

No Predictive Failures
Media Errors: 0
Other Erros: 0

I'll shut down and remove the memory chip and battery modules.   Maybe I'm 
unlucky now with onboard and add-in controllers both having bad memory 
chips.  Ouch!

-Eric Wood
CTO
International Plastics, Inc.


----- Original Message ----- 
From: "Marek Ondrej"
Sent: Wednesday, November 10, 2010 2:54 AM
Subject: Re: PowerEdge 2800: megaraid/scsi errors (PERC 4e/di)


> hi,
>
> It's 99% bad cache memory module on controller. You can put it directly 
> into server memory bank
> and test it with memcheck. To repair you need to buy "ESG-X,DIMM, 256, 
> 400M, 32X72, 8, 240, ROMB"
> for cca. 55€. I have already repaired three PE1850s PERC 4e with the same 
> symptoms as you wrote.
>
> Marek
>
>
> Am Thursday 05 August 2010 schrieb Marc Petitmermet:
>> Dear all
>>
>> We have two identical PowerEdge 2800 (I know, 5 years old). Because it 
>> took the Dell Support people/ contractors so very long to set up 
>> everything (fibre channels switch, EMC CX300, custom drivers, etc.) to 
>> get it finally working, the system is more or less unchanged since the 
>> beginning. One of those PowerEdge 2800 is now acting up. I see messages 
>> like:
>>
>> megaraid: aborting-12854 cmd=2a <c=2 t=0 I=0>
>> megaraid abort: [255:128], driver owner
>> megaraid: resetting the host...
>> megaraid: 2 outstanding commands. Max wait 180 sec
>> etc.
>> scsi0 (0:0): rejecting I/O to offline device
>> etc.
>>
>> When I look at the RAID controller everything seems to be fine:
>> - Logical Drive, RAID 1, Size 34680MB, Stripes 2, StrSz 64KB, 
>> Drive-State: optimal
>> Battery:
>> - Battery Backup Module: present
>> - Battery Pack: present
>> - Temperature: good
>> - Voltage: good
>> - fast charging: in progress
>> - No of Cycles: 50
>>
>> What do the above errors mean? Are the disks failing or is this an other 
>> hardware issue? I booted from a Redhat CD in linux rescue mode and I 
>> could fsck all partitions without any problems at all.
>>
>> Any advise would be greatly appreciated.
>>
>> Regards,
>> Marc
>>
>>
>> Some more details about the hardware/software:
>> - Redhat Enterprise Linux 4.5 (2.6.9-22.0.2.ELsmp #1 SMP Thu Jan 5 
>> 17:11:56 EST 2006 x86_64 x86_64 x86_64 GNU/Linux)
>> - PERC 4e/di standard FW 521S DRAM=256MB (SDRAM)
>> - RAID 1; 2 x Seagate Cheetah 15K.4, Firmware D402
>>
>> _______________________________________________
>> Linux-PowerEdge mailing list
>> Linux-PowerEdge@dell.com
>> https://lists.us.dell.com/mailman/listinfo/linux-poweredge
>> Please read the FAQ at http://lists.us.dell.com/faq
>>
>
>
>
> _______________________________________________
> Linux-PowerEdge mailing list
> Linux-PowerEdge@dell.com
> https://lists.us.dell.com/mailman/listinfo/linux-poweredge
> Please read the FAQ at http://lists.us.dell.com/faq 

_______________________________________________
Linux-PowerEdge mailing list
Linux-PowerEdge@dell.com
https://lists.us.dell.com/mailman/listinfo/linux-poweredge
Please read the FAQ at http://lists.us.dell.com/faq

Re: PowerEdge 2800: megaraid/scsi errors (PERC 4e/di)

Reply via email to