On Jan 22, 2012, at 4:41 PM, Ross Walker <rswwal...@gmail.com> wrote:

> On Jan 22, 2012, at 10:00 AM, Boris Epstein <borepst...@gmail.com> wrote:
> 
>> Jan 22 09:17:53 nrims-bs kernel: 3w-9xxx: scsi6: AEN: ERROR (0x04:0x0026):
>> Drive ECC error reported:port=4, unit=0.
>> Jan 22 09:17:53 nrims-bs kernel: 3w-9xxx: scsi6: AEN: ERROR (0x04:0x002D):
>> Source drive error occurred:port=4, unit=0.
>> Jan 22 09:17:53 nrims-bs kernel: 3w-9xxx: scsi6: AEN: ERROR (0x04:0x0004):
>> Rebuild failed:unit=0.
>> Jan 22 09:17:53 nrims-bs kernel: 3w-9xxx: scsi6: AEN: INFO (0x04:0x003B):
>> Rebuild paused:unit=0.
> 
> From 3ware's site:
> 004h Rebuild failed
> 
> The 3ware RAID controller was unable to complete a rebuild operation. This 
> error can be caused by drive errors on either the source or the destination 
> of the rebuild. However, due to ATA drives' ability to reallocate sectors on 
> write errors, the rebuild failure is most likely caused by the source drive 
> of the rebuild detecting some sort of read error. The default operation of 
> the 3ware RAID controller is to abort a rebuild if an error is encountered. 
> If it is desired to continue on error, you can set the Continue on Source 
> Error During Rebuild policy for the unit on the Controller Settings page in 
> 3DM.
> 
> 026h Drive ECC error reported
> 
> This AEN may be sent when a drive returns the ECC error response to an 3ware 
> RAID controller command. The AEN may or may not be associated with a host 
> command. Internal operations such as Background Media Scan post this AEN 
> whenever drive ECC errors are detected.
> 
> Drive ECC errors are an indication of a problem with grown defects on a 
> particular drive. For redundant arrays, this typically means that dynamic 
> sector repair would be invoked (see AEN 023h). For non-redundant arrays 
> (JBOD, RAID 0 and degraded arrays), drive ECC errors result in the 3ware RAID 
> controller returning failed status to the associated host command.
> 
> Sounds awfully like a hardware error on one of the drives. Replace the failed 
> drive and try rebuilding.
> 

This error code does not bode well.

02Dh Source drive error occurred

If an error is encountered during a rebuild operation, this AEN is generated if 
the error was on a source drive of the rebuild. Knowing if the error occurred 
on the source or the destination of the rebuild is useful for troubleshooting.



It's possible the whole RAID6 is corrupt.

-Ross


_______________________________________________
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos

Reply via email to