Filippo Giunchedi has submitted this change and it was merged.

Change subject: swift: check high load average on backend machines
......................................................................


swift: check high load average on backend machines

use load average as a proxy for an xfs bug we get into from time to time on
backend machines.

Note: this is a bandaid to at least have notification whenever this occurs, at
the moment there's no such indication.

RT #8249

Change-Id: I4e1c263951060060d528299cdd04386ebc56ace3
---
M manifests/swift.pp
1 file changed, 4 insertions(+), 0 deletions(-)

Approvals:
  Filippo Giunchedi: Verified; Looks good to me, approved



diff --git a/manifests/swift.pp b/manifests/swift.pp
index 3d058d9..27ad5c4 100644
--- a/manifests/swift.pp
+++ b/manifests/swift.pp
@@ -313,6 +313,10 @@
             }
         }
         include nrpe
+        nrpe::monitor_service { 'load_average':
+            description  => 'very high load average, likely xfs',
+            nrpe_command => '/usr/lib/nagios/plugins/check_load -w 80,80,80 -c 
200,100,100',
+        }
 
         # RT-2593. Moved here from nrpe_local.cfg
         monitor_swift_daemon { [

-- 
To view, visit https://gerrit.wikimedia.org/r/157672
To unsubscribe, visit https://gerrit.wikimedia.org/r/settings

Gerrit-MessageType: merged
Gerrit-Change-Id: I4e1c263951060060d528299cdd04386ebc56ace3
Gerrit-PatchSet: 2
Gerrit-Project: operations/puppet
Gerrit-Branch: production
Gerrit-Owner: Filippo Giunchedi <fgiunch...@wikimedia.org>
Gerrit-Reviewer: Alexandros Kosiaris <akosia...@wikimedia.org>
Gerrit-Reviewer: Andrew Bogott <abog...@wikimedia.org>
Gerrit-Reviewer: Filippo Giunchedi <fgiunch...@wikimedia.org>
Gerrit-Reviewer: jenkins-bot <>

_______________________________________________
MediaWiki-commits mailing list
MediaWiki-commits@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/mediawiki-commits

Reply via email to