On May 30, 2012, at 1:07 PM, Sašo Kiselkov wrote:

> On 05/25/2012 08:40 PM, Richard Elling wrote:
>> See the soluion at https://www.illumos.org/issues/644
>> -- richard
> 
> And predictably, I'm back with another n00b question regarding this
> array. I've put a pair of LSI-9200-8e controllers in the server and
> attached the cables to the enclosure to each of the HBAs. As a result
> (why?) I'm getting some really strange behavior:
> 
> * piss poor performance (around 5MB/s per disk tops)
> * fmd(1M) running one core at near 100% saturation each time something
>   writes or reads from the pool
> * using fmstat I noticed that its the eft module receiving hundreds of
>   fault reports every second
> * fmd is flooded by multipath failover ereports like:
> 
> ...
> May 29 21:11:44.9408 ereport.io.scsi.cmd.disk.tran
> May 29 21:11:44.9423 ereport.io.scsi.cmd.disk.tran
> May 29 21:11:44.8474 ereport.io.scsi.cmd.disk.recovered
> May 29 21:11:44.9455 ereport.io.scsi.cmd.disk.tran
> May 29 21:11:44.9457 ereport.io.scsi.cmd.disk.dev.rqs.derr
> May 29 21:11:44.9462 ereport.io.scsi.cmd.disk.tran
> May 29 21:11:44.9527 ereport.io.scsi.cmd.disk.tran
> May 29 21:11:44.9535 ereport.io.scsi.cmd.disk.dev.rqs.derr
> May 29 21:11:44.6362 ereport.io.scsi.cmd.disk.recovered
> ...
> 
> 
> 
> I suspect that multipath is something not exactly very happy with my
> Toshiba disks, but I have no idea what to do to make it work at least
> somehow acceptably. I tried messing with scsi_vhci.conf to try and set
> load-balance="none", change the scsi-vhci-failover-override for the
> Toshiba disks to f_asym_lsi, flashing the latest as well as old firmware
> in the cards, reseating them to other PCI-e slots, removing one cable
> and even removing one whole HBA, unloading the eft fmd module etc, but
> nothing helped so far and I'm sort of out of ideas. Anybody else got an
> idea on what I might try?

Those ereports are consistent with faulty cabling. You can trace all of the
cables and errors using tools like lsiutil, sg_logs, kstats, etc. Unfortunately,
it is not really possible to get into this level of detail over email, and it 
can
consume many hours.
 -- richard

--
ZFS Performance and Training
richard.ell...@richardelling.com
+1-760-896-4422







_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to