[Betacluster-alerts] ** PROBLEM alert - deployment-cache-text04/Puppet run is CRITICAL **

2016-08-01 Thread shinken
Notification Type: PROBLEM Service: Puppet run Host: deployment-cache-text04 Address: 10.68.18.103 State: CRITICAL Date/Time: Tue 02 Aug 03:43:27 UTC 2016 Additional Info: CRITICAL: 22.22% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - Generic Beta Cluster/English Wikipedia Mobile Main page is OK **

2016-08-01 Thread shinken
Notification Type: RECOVERY Service: English Wikipedia Mobile Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: OK Date/Time: Tue 02 Aug 02:50:15 UTC 2016 Additional Info: HTTP OK: HTTP/1.1 200 OK - 32491 bytes in 0.983 second response time

[Betacluster-alerts] ** RECOVERY alert - Generic Beta Cluster/English Wikipedia Main page is OK **

2016-08-01 Thread shinken
Notification Type: RECOVERY Service: English Wikipedia Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: OK Date/Time: Tue 02 Aug 02:49:10 UTC 2016 Additional Info: HTTP OK: HTTP/1.1 200 OK - 44821 bytes in 1.919 second response time

[Betacluster-alerts] ** RECOVERY alert - deployment-cache-text04/Puppet run is OK **

2016-08-01 Thread shinken
Notification Type: RECOVERY Service: Puppet run Host: deployment-cache-text04 Address: 10.68.18.103 State: OK Date/Time: Tue 02 Aug 02:32:30 UTC 2016 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** RECOVERY alert - deployment-cache-upload04/Puppet run is OK **

2016-08-01 Thread shinken
Notification Type: RECOVERY Service: Puppet run Host: deployment-cache-upload04 Address: 10.68.18.109 State: OK Date/Time: Tue 02 Aug 02:13:42 UTC 2016 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** RECOVERY alert - deployment-elastic06/SSH is OK **

2016-08-01 Thread shinken
Notification Type: RECOVERY Service: SSH Host: deployment-elastic06 Address: 10.68.17.186 State: OK Date/Time: Tue 02 Aug 02:11:07 UTC 2016 Additional Info: SSH OK - OpenSSH_6.6.1p1 Ubuntu-2ubuntu2.7 (protocol 2.0) ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** RECOVERY alert - deployment-elastic06/SSH is OK **

2016-08-01 Thread shinken
Notification Type: RECOVERY Service: SSH Host: deployment-elastic06 Address: 10.68.17.186 State: OK Date/Time: Tue 02 Aug 02:00:06 UTC 2016 Additional Info: SSH OK - OpenSSH_6.6.1p1 Ubuntu-2ubuntu2.7 (protocol 2.0) ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - Generic Beta Cluster/English Wikipedia Main page is CRITICAL **

2016-08-01 Thread shinken
Notification Type: PROBLEM Service: English Wikipedia Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: CRITICAL Date/Time: Tue 02 Aug 01:59:08 UTC 2016 Additional Info: HTTP CRITICAL: HTTP/1.1 301 TLS Redirect - string 'Wikipedia' not found on

[Betacluster-alerts] ** PROBLEM alert - deployment-cache-upload04/Puppet run is CRITICAL **

2016-08-01 Thread shinken
Notification Type: PROBLEM Service: Puppet run Host: deployment-cache-upload04 Address: 10.68.18.109 State: CRITICAL Date/Time: Tue 02 Aug 01:58:43 UTC 2016 Additional Info: CRITICAL: 30.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - Generic Beta Cluster/English Wikipedia Mobile Main page is CRITICAL **

2016-08-01 Thread shinken
Notification Type: PROBLEM Service: English Wikipedia Mobile Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: CRITICAL Date/Time: Tue 02 Aug 01:55:15 UTC 2016 Additional Info: HTTP CRITICAL: HTTP/1.1 301 TLS Redirect - string 'Wikipedia' not found on

[Betacluster-alerts] ** PROBLEM alert - deployment-elastic06/SSH is CRITICAL **

2016-08-01 Thread shinken
Notification Type: PROBLEM Service: SSH Host: deployment-elastic06 Address: 10.68.17.186 State: CRITICAL Date/Time: Tue 02 Aug 01:35:06 UTC 2016 Additional Info: Server answer: ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** RECOVERY alert - deployment-cache-text04/Puppet run is OK **

2016-08-01 Thread shinken
Notification Type: RECOVERY Service: Puppet run Host: deployment-cache-text04 Address: 10.68.18.103 State: OK Date/Time: Tue 02 Aug 01:21:28 UTC 2016 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** RECOVERY alert - deployment-cache-upload04/Puppet run is OK **

2016-08-01 Thread shinken
Notification Type: RECOVERY Service: Puppet run Host: deployment-cache-upload04 Address: 10.68.18.109 State: OK Date/Time: Tue 02 Aug 01:07:42 UTC 2016 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-cache-upload04/Puppet run is CRITICAL **

2016-08-01 Thread shinken
Notification Type: PROBLEM Service: Puppet run Host: deployment-cache-upload04 Address: 10.68.18.109 State: CRITICAL Date/Time: Tue 02 Aug 00:52:42 UTC 2016 Additional Info: CRITICAL: 40.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-cache-text04/Puppet run is CRITICAL **

2016-08-01 Thread shinken
Notification Type: PROBLEM Service: Puppet run Host: deployment-cache-text04 Address: 10.68.18.103 State: CRITICAL Date/Time: Tue 02 Aug 00:16:27 UTC 2016 Additional Info: CRITICAL: 44.44% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-cache-text04/Puppet run is OK **

2016-08-01 Thread shinken
Notification Type: RECOVERY Service: Puppet run Host: deployment-cache-text04 Address: 10.68.18.103 State: OK Date/Time: Mon 01 Aug 23:50:28 UTC 2016 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-cache-text04/Puppet run is CRITICAL **

2016-08-01 Thread shinken
Notification Type: PROBLEM Service: Puppet run Host: deployment-cache-text04 Address: 10.68.18.103 State: CRITICAL Date/Time: Mon 01 Aug 22:45:27 UTC 2016 Additional Info: CRITICAL: 30.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-cache-upload04/Puppet run is CRITICAL **

2016-08-01 Thread shinken
Notification Type: PROBLEM Service: Puppet run Host: deployment-cache-upload04 Address: 10.68.18.109 State: CRITICAL Date/Time: Mon 01 Aug 22:41:41 UTC 2016 Additional Info: CRITICAL: 60.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-changeprop/Puppet run is CRITICAL **

2016-08-01 Thread shinken
Notification Type: PROBLEM Service: Puppet run Host: deployment-changeprop Address: 10.68.16.88 State: CRITICAL Date/Time: Mon 01 Aug 21:42:48 UTC 2016 Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-aqs01/Puppet run is CRITICAL **

2016-08-01 Thread shinken
Notification Type: PROBLEM Service: Puppet run Host: deployment-aqs01 Address: 10.68.18.237 State: CRITICAL Date/Time: Mon 01 Aug 21:31:46 UTC 2016 Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-db01/Puppet run is OK **

2016-08-01 Thread shinken
Notification Type: RECOVERY Service: Puppet run Host: deployment-db01 Address: 10.68.21.154 State: OK Date/Time: Mon 01 Aug 21:14:11 UTC 2016 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-db01/Puppet run is CRITICAL **

2016-08-01 Thread shinken
Notification Type: PROBLEM Service: Puppet run Host: deployment-db01 Address: 10.68.21.154 State: CRITICAL Date/Time: Mon 01 Aug 21:09:11 UTC 2016 Additional Info: CRITICAL: 20.00% of data above the critical threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-stream/Free space - all mounts is OK **

2016-08-01 Thread shinken
Notification Type: RECOVERY Service: Free space - all mounts Host: deployment-stream Address: 10.68.17.106 State: OK Date/Time: Mon 01 Aug 21:04:17 UTC 2016 Additional Info: OK: All targets OK ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** RECOVERY alert - deployment-ms-be01/Puppet staleness is OK **

2016-08-01 Thread shinken
Notification Type: RECOVERY Service: Puppet staleness Host: deployment-ms-be01 Address: 10.68.16.24 State: OK Date/Time: Mon 01 Aug 20:44:24 UTC 2016 Additional Info: OK: Less than 1.00% above the threshold [3600.0] ___ Betacluster-alerts mailing

[Betacluster-alerts] ** RECOVERY alert - deployment-parsoid09/Puppet run is OK **

2016-08-01 Thread shinken
Notification Type: RECOVERY Service: Puppet run Host: deployment-parsoid09 Address: 10.68.20.142 State: OK Date/Time: Mon 01 Aug 19:11:34 UTC 2016 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-parsoid09/Puppet run is CRITICAL **

2016-08-01 Thread shinken
Notification Type: PROBLEM Service: Puppet run Host: deployment-parsoid09 Address: 10.68.20.142 State: CRITICAL Date/Time: Mon 01 Aug 19:06:35 UTC 2016 Additional Info: CRITICAL: 16.67% of data above the critical threshold [0.0] ___

[Betacluster-alerts] Host DOWN alert for deployment-parsoid08!

2016-08-01 Thread shinken
Notification Type: PROBLEM Host: deployment-parsoid08 State: DOWN Address: 10.68.18.117 Info: CRITICAL - Host Unreachable (10.68.18.117) Date/Time: Mon 01 Aug 18:57:56 UTC 2016 ___ Betacluster-alerts mailing list Betacluster-alerts@lists.wikimedia.org

[Betacluster-alerts] ** RECOVERY alert - Generic Beta Cluster/English Wikipedia Main page is OK **

2016-08-01 Thread shinken
Notification Type: RECOVERY Service: English Wikipedia Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: OK Date/Time: Mon 01 Aug 18:43:11 UTC 2016 Additional Info: HTTP OK: HTTP/1.1 200 OK - 44838 bytes in 1.291 second response time

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki02/App Server Main HTTP Response is CRITICAL **

2016-08-01 Thread shinken
Notification Type: PROBLEM Service: App Server Main HTTP Response Host: deployment-mediawiki02 Address: 10.68.16.127 State: CRITICAL Date/Time: Mon 01 Aug 18:38:42 UTC 2016 Additional Info: CRITICAL - Socket timeout after 10 seconds ___

[Betacluster-alerts] ** PROBLEM alert - Generic Beta Cluster/English Wikipedia Main page is CRITICAL **

2016-08-01 Thread shinken
Notification Type: PROBLEM Service: English Wikipedia Main page Host: Generic Beta Cluster Address: en.wikipedia.beta.wmflabs.org State: CRITICAL Date/Time: Mon 01 Aug 18:38:20 UTC 2016 Additional Info: CRITICAL - Socket timeout after 10 seconds ___

[Betacluster-alerts] ** PROBLEM alert - deployment-mediawiki02/Puppet run is CRITICAL **

2016-08-01 Thread shinken
Notification Type: PROBLEM Service: Puppet run Host: deployment-mediawiki02 Address: 10.68.16.127 State: CRITICAL Date/Time: Mon 01 Aug 18:23:31 UTC 2016 Additional Info: CRITICAL: 30.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-sca01/Puppet run is CRITICAL **

2016-08-01 Thread shinken
Notification Type: PROBLEM Service: Puppet run Host: deployment-sca01 Address: 10.68.20.183 State: CRITICAL Date/Time: Mon 01 Aug 14:59:14 UTC 2016 Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - deployment-sca02/Puppet run is CRITICAL **

2016-08-01 Thread shinken
Notification Type: PROBLEM Service: Puppet run Host: deployment-sca02 Address: 10.68.20.153 State: CRITICAL Date/Time: Mon 01 Aug 14:58:28 UTC 2016 Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - deployment-ms-be01/Puppet staleness is CRITICAL **

2016-08-01 Thread shinken
Notification Type: PROBLEM Service: Puppet staleness Host: deployment-ms-be01 Address: 10.68.16.24 State: CRITICAL Date/Time: Mon 01 Aug 08:29:24 UTC 2016 Additional Info: CRITICAL: 22.22% of data above the critical threshold [43200.0] ___