Re: [zfs-discuss] zfs corruption w/ sil3114 sata controllers

Neal Pollack Mon, 29 Oct 2007 21:43:53 -0800

Edward Saipetch wrote:
> Neal Pollack wrote:
>> Ed Saipetch wrote:
>>> Hello,
>>>
>>> I'm experiencing major checksum errors when using a syba silicon 
>>> image 3114 based pci sata controller w/ nonraid firmware.  I've 
>>> tested by copying data via sftp and smb.  With everything I've 
>>> swapped out, I can't fathom this being a hardware problem.  
>>
>> I can.  But I suppose it could also be in some unknown way a driver 
>> issue.
>> Even before ZFS, I've had numerous situations where various si3112 
>> and 3114 chips
>> would corrupt data on UFS and PCFS, with very simple  copy and checksum
>> test scripts, doing large bulk transfers.
>>
>> Si chips are best used to clean coffee grinders.  Go buy a real SATA 
>> controller.
>>
>> Neal
> I have no problem ponying up money for a better SATA controller.  I 
> saw a bunch of blog posts that people were successful using the card 
> so I thought maybe I had a bad card with corrupt firmware nvram.  Is 
> it worth trying to trace down the bug?


Of course it is.  File a bug so someone on the SATA team can study it.

> If this type of corruption exists, nobody should be using this card.  
> As a side note, what SATA cards are people having luck with?

A lot of people are happy with the 8 port PCI SATA card made by 
SuperMicro that has the Marvell chip on it.
Don't buy other marvell cards on ebay, because Marvell dumped a ton of 
cards that ended up with an earlier
rev of the silicon that can corrupt data.  But all the cards made by 
SuperMicro and sold by them have the c rev
or later silicon and work great.

That said, I wish someone would investigate the Silicon Image issues, 
but there are only so many engineers,
with so little time.
>>
>>> There have been quite a few blog posts out there with people having 
>>> a similar config and not having any problems.
>>>
>>> Here's what I've done so far:
>>> 1. Changed solaris releases from S10 U3 to NV 75a
>>> 2. Switched out motherboards and cpus from AMD sempron to a Celeron D
>>> 3. Switched out memory to use completely different dimms
>>> 4. Switched out sata drives (2-3 250gb hitachi's and seagates in 
>>> RAIDZ, 3x400GB seagates RAIDZ and 1x250GB hitachi with no raid)
>>>
>>> Here's output of a scrub and the status (ignore the date and time, I 
>>> haven't reset it on this new motherboard) and please point me in the 
>>> right direction if I'm barking up the wrong tree.
>>>
>>> # zpool scrub tank
>>> # zpool status
>>>   pool: tank
>>>  state: ONLINE
>>> status: One or more devices has experienced an error resulting in data
>>>         corruption.  Applications may be affected.
>>> action: Restore the file in question if possible.  Otherwise restore 
>>> the
>>>         entire pool from backup.
>>>    see: http://www.sun.com/msg/ZFS-8000-8A
>>>  scrub: scrub completed with 140 errors on Sat Sep 15 02:07:35 2007
>>> config:
>>>
>>>         NAME        STATE     READ WRITE CKSUM
>>>         tank        ONLINE       0     0   293
>>>           c0d1      ONLINE       0     0   293
>>>
>>> errors: 140 data errors, use '-v' for a list
>>>  
>>>  
>>> This message posted from opensolaris.org
>>> _______________________________________________
>>> zfs-discuss mailing list
>>> zfs-discuss@opensolaris.org
>>> http://mail.opensolaris.org/mailman/listinfo/zfs-discuss
>>>   
>>
>

_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] zfs corruption w/ sil3114 sata controllers

Reply via email to