Filippo Giunchedi has submitted this change and it was merged. Change subject: swift: check high load average on backend machines ......................................................................
swift: check high load average on backend machines use load average as a proxy for an xfs bug we get into from time to time on backend machines. Note: this is a bandaid to at least have notification whenever this occurs, at the moment there's no such indication. RT #8249 Change-Id: I4e1c263951060060d528299cdd04386ebc56ace3 --- M manifests/swift.pp 1 file changed, 4 insertions(+), 0 deletions(-) Approvals: Filippo Giunchedi: Verified; Looks good to me, approved diff --git a/manifests/swift.pp b/manifests/swift.pp index 3d058d9..27ad5c4 100644 --- a/manifests/swift.pp +++ b/manifests/swift.pp @@ -313,6 +313,10 @@ } } include nrpe + nrpe::monitor_service { 'load_average': + description => 'very high load average, likely xfs', + nrpe_command => '/usr/lib/nagios/plugins/check_load -w 80,80,80 -c 200,100,100', + } # RT-2593. Moved here from nrpe_local.cfg monitor_swift_daemon { [ -- To view, visit https://gerrit.wikimedia.org/r/157672 To unsubscribe, visit https://gerrit.wikimedia.org/r/settings Gerrit-MessageType: merged Gerrit-Change-Id: I4e1c263951060060d528299cdd04386ebc56ace3 Gerrit-PatchSet: 2 Gerrit-Project: operations/puppet Gerrit-Branch: production Gerrit-Owner: Filippo Giunchedi <fgiunch...@wikimedia.org> Gerrit-Reviewer: Alexandros Kosiaris <akosia...@wikimedia.org> Gerrit-Reviewer: Andrew Bogott <abog...@wikimedia.org> Gerrit-Reviewer: Filippo Giunchedi <fgiunch...@wikimedia.org> Gerrit-Reviewer: jenkins-bot <> _______________________________________________ MediaWiki-commits mailing list MediaWiki-commits@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-commits