On 05/30/2012 10:53 PM, Richard Elling wrote:
> On May 30, 2012, at 1:07 PM, Sašo Kiselkov wrote:
> 
>> On 05/25/2012 08:40 PM, Richard Elling wrote:
>>> See the soluion at https://www.illumos.org/issues/644
>>> -- richard
>>
>> And predictably, I'm back with another n00b question regarding this
>> array. I've put a pair of LSI-9200-8e controllers in the server and
>> attached the cables to the enclosure to each of the HBAs. As a result
>> (why?) I'm getting some really strange behavior:
>>
>> * piss poor performance (around 5MB/s per disk tops)
>> * fmd(1M) running one core at near 100% saturation each time something
>>   writes or reads from the pool
>> * using fmstat I noticed that its the eft module receiving hundreds of
>>   fault reports every second
>> * fmd is flooded by multipath failover ereports like:
>>
>> ...
>> May 29 21:11:44.9408 ereport.io.scsi.cmd.disk.tran
>> May 29 21:11:44.9423 ereport.io.scsi.cmd.disk.tran
>> May 29 21:11:44.8474 ereport.io.scsi.cmd.disk.recovered
>> May 29 21:11:44.9455 ereport.io.scsi.cmd.disk.tran
>> May 29 21:11:44.9457 ereport.io.scsi.cmd.disk.dev.rqs.derr
>> May 29 21:11:44.9462 ereport.io.scsi.cmd.disk.tran
>> May 29 21:11:44.9527 ereport.io.scsi.cmd.disk.tran
>> May 29 21:11:44.9535 ereport.io.scsi.cmd.disk.dev.rqs.derr
>> May 29 21:11:44.6362 ereport.io.scsi.cmd.disk.recovered
>> ...
>>
>>
>>
>> I suspect that multipath is something not exactly very happy with my
>> Toshiba disks, but I have no idea what to do to make it work at least
>> somehow acceptably. I tried messing with scsi_vhci.conf to try and set
>> load-balance="none", change the scsi-vhci-failover-override for the
>> Toshiba disks to f_asym_lsi, flashing the latest as well as old firmware
>> in the cards, reseating them to other PCI-e slots, removing one cable
>> and even removing one whole HBA, unloading the eft fmd module etc, but
>> nothing helped so far and I'm sort of out of ideas. Anybody else got an
>> idea on what I might try?
> 
> Those ereports are consistent with faulty cabling. You can trace all of the
> cables and errors using tools like lsiutil, sg_logs, kstats, etc. 
> Unfortunately,
> it is not really possible to get into this level of detail over email, and it 
> can
> consume many hours.
>  -- richard

That's actually a pretty good piece of information for me! I will try
changing my cabling to see if I can get the errors to go away. Thanks
again for the suggestions!

Cheers
--
Saso
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to