Re: [zfs-discuss] scrubing/resilvering - controller problem

2009-10-08 Thread Roch Bourbonnais


You might try setting zfs_scrub_limit to 1 or 2 and attach a customer  
service record to :


6494473 ZFS needs a way to slow down resilvering

-r


Le 7 oct. 09 à 06:14, John a écrit :


Hi,

We are running b118, with a LSI 3801 controller which is connected  
to 44 drives (yes it's a lot behind a single controller). We also  
use a pair of ssd connected to another controller for read cache.
Everything works fine and we achieve acceptable performance for our  
needs.
However, during scrubbing or resilvering operations, it seems ZFS  
generates so much traffic that it overwhelmes the controller. The  
controller then logs the following errors:


Oct  6 07:30:04 nas101 scsi: [ID 107833 kern.warning] WARNING: / 
p...@0,0/pci8086,6...@4/pci1000,3...@0/s...@16,0 (sd19):

Oct  6 07:30:04 nas101incomplete read- retrying


Is there anything that can be done to slow down zfs operations such  
as resilvering/scrubbing? We tried tuning  zfs:zfs_vdev_max_pending  
but it did not really help.
This is a bit frustrating because this configuration works well to  
serve data. It's just too aggressive when the kernel accesses drives  
for some operations.



Iostat looks liks this:
   r/sw/s   kr/s   kw/s wait actv wsvc_t asvc_t  %w  %b s/w h/w  
trn tot device
9681.5   37.3 116403.3   35.1  0.0 30.20.03.1   0 1000   0   
21   8  29 c9
 420.31.5 5058.21.4  0.0  1.30.03.1   0  44   0
0   0   0 c9t8d0




Any help would be appreciated.

thanks,

JJ
--
This message posted from opensolaris.org
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss




smime.p7s
Description: S/MIME cryptographic signature
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


[zfs-discuss] scrubing/resilvering - controller problem

2009-10-06 Thread John
Hi,

We are running b118, with a LSI 3801 controller which is connected to 44 drives 
(yes it's a lot behind a single controller). We also use a pair of ssd 
connected to another controller for read cache.
Everything works fine and we achieve acceptable performance for our needs.
However, during scrubbing or resilvering operations, it seems ZFS generates so 
much traffic that it overwhelmes the controller. The controller then logs the 
following errors:

Oct  6 07:30:04 nas101 scsi: [ID 107833 kern.warning] WARNING: 
/p...@0,0/pci8086,6...@4/pci1000,3...@0/s...@16,0 (sd19):
Oct  6 07:30:04 nas101incomplete read- retrying


Is there anything that can be done to slow down zfs operations such as 
resilvering/scrubbing? We tried tuning  zfs:zfs_vdev_max_pending but it did not 
really help.
This is a bit frustrating because this configuration works well to serve data. 
It's just too aggressive when the kernel accesses drives for some operations.


Iostat looks liks this:
r/sw/s   kr/s   kw/s wait actv wsvc_t asvc_t  %w  %b s/w h/w trn tot 
device
 9681.5   37.3 116403.3   35.1  0.0 30.20.03.1   0 1000   0  21   8  29 
c9
  420.31.5 5058.21.4  0.0  1.30.03.1   0  44   0   0   0   0 
c9t8d0



Any help would be appreciated.

thanks,

JJ
-- 
This message posted from opensolaris.org
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss