Filippo Giunchedi has uploaded a new change for review. https://gerrit.wikimedia.org/r/315103
Change subject: raid: increase check_hpssacli timeout ...................................................................... raid: increase check_hpssacli timeout When swift HP boxes are busy the command can take a long time to complete. Sometimes the SCSI command is aborted as well, this can be seen in the logs with CDB 0x1201 (Inquiry) [512075.863854] hpsa 0000:08:00.0: CDB 12018000600000000000000000000000 was aborted with status 0x0 Change-Id: Ie27b001f2b73b78d79aee6ff2cb9a92c2f405ef4 --- M modules/raid/manifests/init.pp 1 file changed, 1 insertion(+), 1 deletion(-) git pull ssh://gerrit.wikimedia.org:29418/operations/puppet refs/changes/03/315103/1 diff --git a/modules/raid/manifests/init.pp b/modules/raid/manifests/init.pp index 41cc3c2..660949a 100644 --- a/modules/raid/manifests/init.pp +++ b/modules/raid/manifests/init.pp @@ -80,7 +80,7 @@ nrpe::monitor_service { 'raid_hpssacli': description => 'HP RAID', nrpe_command => '/usr/local/lib/nagios/plugins/check_hpssacli', - timeout => 40, # can take > 10s on servers with lots of disks + timeout => 50, # can take > 10s on servers with lots of disks } $get_raid_status_hpssacli = '/usr/local/lib/nagios/plugins/get-raid-status-hpssacli' -- To view, visit https://gerrit.wikimedia.org/r/315103 To unsubscribe, visit https://gerrit.wikimedia.org/r/settings Gerrit-MessageType: newchange Gerrit-Change-Id: Ie27b001f2b73b78d79aee6ff2cb9a92c2f405ef4 Gerrit-PatchSet: 1 Gerrit-Project: operations/puppet Gerrit-Branch: production Gerrit-Owner: Filippo Giunchedi <fgiunch...@wikimedia.org> _______________________________________________ MediaWiki-commits mailing list MediaWiki-commits@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-commits