[Betacluster-alerts] ** PROBLEM alert - deployment-cache-upload04/SSH is CRITICAL **

2016-08-10 Thread shinken
Notification Type: PROBLEM Service: SSH Host: deployment-cache-upload04 Address: 10.68.18.109 State: CRITICAL Date/Time: Thu 11 Aug 04:49:32 UTC 2016 Additional Info: CRITICAL - Socket timeout after 10 seconds ___ Betacluster-alerts mailing list Betac

[Betacluster-alerts] Host DOWN alert for deployment-parsoid05!

2016-08-10 Thread shinken
Notification Type: PROBLEM Host: deployment-parsoid05 State: DOWN Address: 10.68.16.120 Info: CRITICAL - Host Unreachable (10.68.16.120) Date/Time: Thu 11 Aug 02:31:07 UTC 2016 ___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org ht

[Betacluster-alerts] Host UP alert for deployment-parsoid05!

2016-08-10 Thread shinken
Notification Type: RECOVERY Host: deployment-parsoid05 State: UP Address: 10.68.16.120 Info: PING OK - Packet loss = 0%, RTA = 0.99 ms Date/Time: Thu 11 Aug 02:26:43 UTC 2016 ___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org http

[Betacluster-alerts] ** PROBLEM alert - deployment-puppetmaster/Long lived cherry-picks on puppetmaster is CRITICAL **

2016-08-10 Thread shinken
Notification Type: PROBLEM Service: Long lived cherry-picks on puppetmaster Host: deployment-puppetmaster Address: 10.68.16.63 State: CRITICAL Date/Time: Wed 10 Aug 19:51:08 UTC 2016 Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] __

[Betacluster-alerts] Host UP alert for beta-cluster!

2016-08-10 Thread shinken
Notification Type: RECOVERY Host: beta-cluster State: UP Address: en.wikipedia.beta.wmflabs.org Info: PING OK - Packet loss = 0%, RTA = 0.67 ms Date/Time: Wed 10 Aug 18:49:53 UTC 2016 ___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia

[Betacluster-alerts] Host DOWN alert for beta-cluster!

2016-08-10 Thread shinken
Notification Type: PROBLEM Host: beta-cluster State: DOWN Address: en.wikipedia.beta.wmflabs.org Info: PING CRITICAL - Packet loss = 100% Date/Time: Wed 10 Aug 18:26:45 UTC 2016 ___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org h

[Betacluster-alerts] ** PROBLEM alert - deployment-redis02/SSH is CRITICAL **

2016-08-10 Thread shinken
Notification Type: PROBLEM Service: SSH Host: deployment-redis02 Address: 10.68.16.231 State: CRITICAL Date/Time: Wed 10 Aug 16:52:15 UTC 2016 Additional Info: Server answer: ___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org h

[Betacluster-alerts] ** RECOVERY alert - deployment-redis02/SSH is OK **

2016-08-10 Thread shinken
Notification Type: RECOVERY Service: SSH Host: deployment-redis02 Address: 10.68.16.231 State: OK Date/Time: Wed 10 Aug 16:46:15 UTC 2016 Additional Info: SSH OK - OpenSSH_6.6.1p1 Ubuntu-2ubuntu2.7 (protocol 2.0) ___ Betacluster-alerts mailing list Be

[Betacluster-alerts] ** PROBLEM alert - deployment-redis02/SSH is CRITICAL **

2016-08-10 Thread shinken
Notification Type: PROBLEM Service: SSH Host: deployment-redis02 Address: 10.68.16.231 State: CRITICAL Date/Time: Wed 10 Aug 16:36:15 UTC 2016 Additional Info: Server answer: ___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org h

[Betacluster-alerts] ** RECOVERY alert - deployment-redis02/SSH is OK **

2016-08-10 Thread shinken
Notification Type: RECOVERY Service: SSH Host: deployment-redis02 Address: 10.68.16.231 State: OK Date/Time: Wed 10 Aug 16:30:15 UTC 2016 Additional Info: SSH OK - OpenSSH_6.6.1p1 Ubuntu-2ubuntu2.7 (protocol 2.0) ___ Betacluster-alerts mailing list Be

[Betacluster-alerts] ** PROBLEM alert - deployment-changeprop/Puppet run is CRITICAL **

2016-08-10 Thread shinken
Notification Type: PROBLEM Service: Puppet run Host: deployment-changeprop Address: 10.68.16.88 State: CRITICAL Date/Time: Wed 10 Aug 16:21:47 UTC 2016 Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___ Betacluster-alert

[Betacluster-alerts] ** PROBLEM alert - deployment-redis02/SSH is CRITICAL **

2016-08-10 Thread shinken
Notification Type: PROBLEM Service: SSH Host: deployment-redis02 Address: 10.68.16.231 State: CRITICAL Date/Time: Wed 10 Aug 15:20:15 UTC 2016 Additional Info: Server answer: ___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org h

[Betacluster-alerts] ** PROBLEM alert - deployment-cache-upload04/Puppet staleness is CRITICAL **

2016-08-10 Thread shinken
Notification Type: PROBLEM Service: Puppet staleness Host: deployment-cache-upload04 Address: 10.68.18.109 State: CRITICAL Date/Time: Wed 10 Aug 15:05:15 UTC 2016 Additional Info: CRITICAL: 11.11% of data above the critical threshold [43200.0] ___ Bet

[Betacluster-alerts] ** RECOVERY alert - deployment-eventlogging04/Puppet run is OK **

2016-08-10 Thread shinken
Notification Type: RECOVERY Service: Puppet run Host: deployment-eventlogging04 Address: 10.68.23.204 State: OK Date/Time: Wed 10 Aug 14:52:17 UTC 2016 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-eventlogging04/Puppet run is CRITICAL **

2016-08-10 Thread shinken
Notification Type: PROBLEM Service: Puppet run Host: deployment-eventlogging04 Address: 10.68.23.204 State: CRITICAL Date/Time: Wed 10 Aug 14:22:17 UTC 2016 Additional Info: CRITICAL: 44.44% of data above the critical threshold [0.0] ___ Betacluster-a

[Betacluster-alerts] ** RECOVERY alert - deployment-redis02/SSH is OK **

2016-08-10 Thread shinken
Notification Type: RECOVERY Service: SSH Host: deployment-redis02 Address: 10.68.16.231 State: OK Date/Time: Wed 10 Aug 12:08:15 UTC 2016 Additional Info: SSH OK - OpenSSH_6.6.1p1 Ubuntu-2ubuntu2.7 (protocol 2.0) ___ Betacluster-alerts mailing list Be

[Betacluster-alerts] ** PROBLEM alert - deployment-redis02/SSH is CRITICAL **

2016-08-10 Thread shinken
Notification Type: PROBLEM Service: SSH Host: deployment-redis02 Address: 10.68.16.231 State: CRITICAL Date/Time: Wed 10 Aug 11:38:14 UTC 2016 Additional Info: Server answer: ___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org h

[Betacluster-alerts] ** RECOVERY alert - deployment-redis02/SSH is OK **

2016-08-10 Thread shinken
Notification Type: RECOVERY Service: SSH Host: deployment-redis02 Address: 10.68.16.231 State: OK Date/Time: Wed 10 Aug 11:26:14 UTC 2016 Additional Info: SSH OK - OpenSSH_6.6.1p1 Ubuntu-2ubuntu2.7 (protocol 2.0) ___ Betacluster-alerts mailing list Be

[Betacluster-alerts] ** PROBLEM alert - deployment-redis02/SSH is CRITICAL **

2016-08-10 Thread shinken
Notification Type: PROBLEM Service: SSH Host: deployment-redis02 Address: 10.68.16.231 State: CRITICAL Date/Time: Wed 10 Aug 09:46:15 UTC 2016 Additional Info: Server answer: ___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org h

[Betacluster-alerts] ** RECOVERY alert - deployment-redis02/SSH is OK **

2016-08-10 Thread shinken
Notification Type: RECOVERY Service: SSH Host: deployment-redis02 Address: 10.68.16.231 State: OK Date/Time: Wed 10 Aug 09:40:14 UTC 2016 Additional Info: SSH OK - OpenSSH_6.6.1p1 Ubuntu-2ubuntu2.7 (protocol 2.0) ___ Betacluster-alerts mailing list Be

[Betacluster-alerts] ** PROBLEM alert - deployment-redis02/SSH is CRITICAL **

2016-08-10 Thread shinken
Notification Type: PROBLEM Service: SSH Host: deployment-redis02 Address: 10.68.16.231 State: CRITICAL Date/Time: Wed 10 Aug 09:20:14 UTC 2016 Additional Info: Server answer: ___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org h

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki03/App Server Main HTTP Response is OK **

2016-08-10 Thread shinken
Notification Type: RECOVERY Service: App Server Main HTTP Response Host: deployment-mediawiki03 Address: 10.68.17.55 State: OK Date/Time: Wed 10 Aug 08:52:19 UTC 2016 Additional Info: HTTP OK: HTTP/1.1 200 OK - 44522 bytes in 1.403 second response time __

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki01/App Server Main HTTP Response is OK **

2016-08-10 Thread shinken
Notification Type: RECOVERY Service: App Server Main HTTP Response Host: deployment-mediawiki01 Address: 10.68.17.170 State: OK Date/Time: Wed 10 Aug 08:49:00 UTC 2016 Additional Info: HTTP OK: HTTP/1.1 200 OK - 44527 bytes in 0.973 second response time _

[Betacluster-alerts] ** RECOVERY alert - deployment-mediawiki02/App Server Main HTTP Response is OK **

2016-08-10 Thread shinken
Notification Type: RECOVERY Service: App Server Main HTTP Response Host: deployment-mediawiki02 Address: 10.68.16.127 State: OK Date/Time: Wed 10 Aug 08:48:38 UTC 2016 Additional Info: HTTP OK: HTTP/1.1 200 OK - 44529 bytes in 5.633 second response time _

[Betacluster-alerts] ** PROBLEM alert - Generic Beta Cluster/English Wikipedia Mobile Main page is CRITICAL **

2016-08-10 Thread shinken
Notification Type: PROBLEM Service: English Wikipedia Mobile Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: CRITICAL Date/Time: Wed 10 Aug 08:06:16 UTC 2016 Additional Info: HTTP CRITICAL: HTTP/1.1 301 TLS Redirect - string 'Wikipedia' not found on 'http://e

[Betacluster-alerts] ** PROBLEM alert - Generic Beta Cluster/English Wikipedia Main page is CRITICAL **

2016-08-10 Thread shinken
Notification Type: PROBLEM Service: English Wikipedia Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: CRITICAL Date/Time: Wed 10 Aug 08:05:08 UTC 2016 Additional Info: HTTP CRITICAL: HTTP/1.1 301 TLS Redirect - string 'Wikipedia' not found on 'http://en.wikip

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki01/App Server Main HTTP Response is CRITICAL **

2016-08-10 Thread shinken
Notification Type: PROBLEM Service: App Server Main HTTP Response Host: deployment-mediawiki01 Address: 10.68.17.170 State: CRITICAL Date/Time: Wed 10 Aug 07:14:01 UTC 2016 Additional Info: HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - 3230 bytes in 0.080 second response time ___

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki02/App Server Main HTTP Response is CRITICAL **

2016-08-10 Thread shinken
Notification Type: PROBLEM Service: App Server Main HTTP Response Host: deployment-mediawiki02 Address: 10.68.16.127 State: CRITICAL Date/Time: Wed 10 Aug 07:13:33 UTC 2016 Additional Info: HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - 3232 bytes in 1.078 second response time ___

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki03/App Server Main HTTP Response is CRITICAL **

2016-08-10 Thread shinken
Notification Type: PROBLEM Service: App Server Main HTTP Response Host: deployment-mediawiki03 Address: 10.68.17.55 State: CRITICAL Date/Time: Wed 10 Aug 07:12:20 UTC 2016 Additional Info: HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - 3230 bytes in 0.076 second response time

[Betacluster-alerts] ** RECOVERY alert - deployment-redis02/SSH is OK **

2016-08-10 Thread shinken
Notification Type: RECOVERY Service: SSH Host: deployment-redis02 Address: 10.68.16.231 State: OK Date/Time: Wed 10 Aug 07:11:15 UTC 2016 Additional Info: SSH OK - OpenSSH_6.6.1p1 Ubuntu-2ubuntu2.7 (protocol 2.0) ___ Betacluster-alerts mailing list Be

[Betacluster-alerts] ** PROBLEM alert - deployment-redis02/SSH is CRITICAL **

2016-08-10 Thread shinken
Notification Type: PROBLEM Service: SSH Host: deployment-redis02 Address: 10.68.16.231 State: CRITICAL Date/Time: Wed 10 Aug 07:01:16 UTC 2016 Additional Info: Server answer: ___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org h