I disable the onboard Perc4/Di and put in a Perc4/DC which I purchased used) with a new scsi cable. The system booted the RAID-5 array just fine. OS is RHEL 3. However, I'm still getting frequent i/o errors:
Nov 10 14:02:09 dauphin_fly kernel: I/O error: dev 08:07, sector 19312 megamgr still tells me for all three ST336753LC (Rev. DX10) drives: No Predictive Failures Media Errors: 0 Other Erros: 0 I'll shut down and remove the memory chip and battery modules. Maybe I'm unlucky now with onboard and add-in controllers both having bad memory chips. Ouch! -Eric Wood CTO International Plastics, Inc. ----- Original Message ----- From: "Marek Ondrej" Sent: Wednesday, November 10, 2010 2:54 AM Subject: Re: PowerEdge 2800: megaraid/scsi errors (PERC 4e/di) > hi, > > It's 99% bad cache memory module on controller. You can put it directly > into server memory bank > and test it with memcheck. To repair you need to buy "ESG-X,DIMM, 256, > 400M, 32X72, 8, 240, ROMB" > for cca. 55€. I have already repaired three PE1850s PERC 4e with the same > symptoms as you wrote. > > Marek > > > Am Thursday 05 August 2010 schrieb Marc Petitmermet: >> Dear all >> >> We have two identical PowerEdge 2800 (I know, 5 years old). Because it >> took the Dell Support people/ contractors so very long to set up >> everything (fibre channels switch, EMC CX300, custom drivers, etc.) to >> get it finally working, the system is more or less unchanged since the >> beginning. One of those PowerEdge 2800 is now acting up. I see messages >> like: >> >> megaraid: aborting-12854 cmd=2a <c=2 t=0 I=0> >> megaraid abort: [255:128], driver owner >> megaraid: resetting the host... >> megaraid: 2 outstanding commands. Max wait 180 sec >> etc. >> scsi0 (0:0): rejecting I/O to offline device >> etc. >> >> When I look at the RAID controller everything seems to be fine: >> - Logical Drive, RAID 1, Size 34680MB, Stripes 2, StrSz 64KB, >> Drive-State: optimal >> Battery: >> - Battery Backup Module: present >> - Battery Pack: present >> - Temperature: good >> - Voltage: good >> - fast charging: in progress >> - No of Cycles: 50 >> >> What do the above errors mean? Are the disks failing or is this an other >> hardware issue? I booted from a Redhat CD in linux rescue mode and I >> could fsck all partitions without any problems at all. >> >> Any advise would be greatly appreciated. >> >> Regards, >> Marc >> >> >> Some more details about the hardware/software: >> - Redhat Enterprise Linux 4.5 (2.6.9-22.0.2.ELsmp #1 SMP Thu Jan 5 >> 17:11:56 EST 2006 x86_64 x86_64 x86_64 GNU/Linux) >> - PERC 4e/di standard FW 521S DRAM=256MB (SDRAM) >> - RAID 1; 2 x Seagate Cheetah 15K.4, Firmware D402 >> >> _______________________________________________ >> Linux-PowerEdge mailing list >> Linux-PowerEdge@dell.com >> https://lists.us.dell.com/mailman/listinfo/linux-poweredge >> Please read the FAQ at http://lists.us.dell.com/faq >> > > > > _______________________________________________ > Linux-PowerEdge mailing list > Linux-PowerEdge@dell.com > https://lists.us.dell.com/mailman/listinfo/linux-poweredge > Please read the FAQ at http://lists.us.dell.com/faq _______________________________________________ Linux-PowerEdge mailing list Linux-PowerEdge@dell.com https://lists.us.dell.com/mailman/listinfo/linux-poweredge Please read the FAQ at http://lists.us.dell.com/faq