[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is CRITICAL **

2017-09-11 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: CRITICAL Date/Time: Tue 12 Sep 05:36:47 UTC 2017 Additional Info: CRITICAL: 20.00% of data above the critical threshold [10.0]

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2017-09-11 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Tue 12 Sep 05:01:49 UTC 2017 Additional Info: WARNING: 80.00% of data above the warning threshold [1.0] ___

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is CRITICAL **

2017-09-11 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: CRITICAL Date/Time: Tue 12 Sep 04:56:47 UTC 2017 Additional Info: CRITICAL: 20.00% of data above the critical threshold [10.0]

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2017-09-11 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Tue 12 Sep 04:51:49 UTC 2017 Additional Info: WARNING: 80.00% of data above the warning threshold [1.0] ___

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is CRITICAL **

2017-09-11 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: CRITICAL Date/Time: Tue 12 Sep 04:46:47 UTC 2017 Additional Info: CRITICAL: 20.00% of data above the critical threshold [10.0]

[Betacluster-alerts] ** PROBLEM alert - deployment-kafka01/Puppet staleness is CRITICAL **

2017-09-11 Thread shinken
Notification Type: PROBLEM Service: Puppet staleness Host: deployment-kafka01 Address: 10.68.21.219 State: CRITICAL Date/Time: Tue 12 Sep 02:24:33 UTC 2017 Additional Info: CRITICAL: 100.00% of data above the critical threshold [43200.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-salt02/Puppet errors is OK **

2017-09-11 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-salt02 Address: 10.68.17.58 State: OK Date/Time: Mon 11 Sep 20:59:47 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** RECOVERY alert - deployment-zookeeper02/Puppet errors is OK **

2017-09-11 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-zookeeper02 Address: 10.68.18.75 State: OK Date/Time: Mon 11 Sep 20:57:25 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-zookeeper02/Puppet errors is CRITICAL **

2017-09-11 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-zookeeper02 Address: 10.68.18.75 State: CRITICAL Date/Time: Mon 11 Sep 20:22:26 UTC 2017 Additional Info: CRITICAL: 66.67% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - deployment-salt02/Puppet errors is CRITICAL **

2017-09-11 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-salt02 Address: 10.68.17.58 State: CRITICAL Date/Time: Mon 11 Sep 20:19:47 UTC 2017 Additional Info: CRITICAL: 40.00% of data above the critical threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-mira/Puppet errors is OK **

2017-09-11 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-mira Address: 10.68.20.135 State: OK Date/Time: Mon 11 Sep 19:35:53 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** RECOVERY alert - deployment-ores-redis-01/Puppet errors is OK **

2017-09-11 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-ores-redis-01 Address: 10.68.22.248 State: OK Date/Time: Mon 11 Sep 19:03:13 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing

[Betacluster-alerts] ** PROBLEM alert - deployment-mira/Puppet errors is CRITICAL **

2017-09-11 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-mira Address: 10.68.20.135 State: CRITICAL Date/Time: Mon 11 Sep 18:55:51 UTC 2017 Additional Info: CRITICAL: 60.00% of data above the critical threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - deployment-ores-redis-01/Puppet errors is CRITICAL **

2017-09-11 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-ores-redis-01 Address: 10.68.22.248 State: CRITICAL Date/Time: Mon 11 Sep 18:23:14 UTC 2017 Additional Info: CRITICAL: 33.33% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-mira/Puppet errors is OK **

2017-09-11 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-mira Address: 10.68.20.135 State: OK Date/Time: Mon 11 Sep 17:34:54 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-mira/Puppet errors is CRITICAL **

2017-09-11 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-mira Address: 10.68.20.135 State: CRITICAL Date/Time: Mon 11 Sep 16:54:52 UTC 2017 Additional Info: CRITICAL: 50.00% of data above the critical threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** RECOVERY alert - deployment-logstash2/Puppet errors is OK **

2017-09-11 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-logstash2 Address: 10.68.16.147 State: OK Date/Time: Mon 11 Sep 15:49:20 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-logstash2/Puppet errors is CRITICAL **

2017-09-11 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-logstash2 Address: 10.68.16.147 State: CRITICAL Date/Time: Mon 11 Sep 15:14:22 UTC 2017 Additional Info: CRITICAL: 40.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-puppetmaster02/Long lived cherry-picks on puppetmaster is OK **

2017-09-11 Thread shinken
Notification Type: RECOVERY Service: Long lived cherry-picks on puppetmaster Host: deployment-puppetmaster02 Address: 10.68.21.200 State: OK Date/Time: Mon 11 Sep 14:35:08 UTC 2017 Additional Info: OK: Less than 100.00% above the threshold [0.0] ___

[Betacluster-alerts] ** RECOVERY alert - Graphite Labs/Mediawiki Error Rate is OK **

2017-09-11 Thread shinken
Notification Type: RECOVERY Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: OK Date/Time: Mon 11 Sep 13:35:48 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [1.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2017-09-11 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Mon 11 Sep 13:30:48 UTC 2017 Additional Info: WARNING: 20.00% of data above the warning threshold [1.0] ___

[Betacluster-alerts] ** RECOVERY alert - deployment-imagescaler02/Puppet errors is OK **

2017-09-11 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-imagescaler02 Address: 10.68.18.233 State: OK Date/Time: Mon 11 Sep 13:30:29 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing

[Betacluster-alerts] ** PROBLEM alert - deployment-tin/Free space - all mounts is WARNING **

2017-09-11 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-tin Address: 10.68.21.205 State: WARNING Date/Time: Mon 11 Sep 12:42:14 UTC 2017 Additional Info: WARNING: deployment-prep.deployment-tin.diskspace._mnt.byte_percentfree (No valid datapoints

[Betacluster-alerts] ** PROBLEM alert - deployment-tin/Free space - all mounts is CRITICAL **

2017-09-11 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-tin Address: 10.68.21.205 State: CRITICAL Date/Time: Mon 11 Sep 12:27:15 UTC 2017 Additional Info: CRITICAL: deployment-prep.deployment-tin.diskspace._mnt.byte_percentfree (No valid datapoints

[Betacluster-alerts] ** RECOVERY alert - deployment-aqs01/Puppet errors is OK **

2017-09-11 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-aqs01 Address: 10.68.18.237 State: OK Date/Time: Mon 11 Sep 12:10:58 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing list

[Betacluster-alerts] ** PROBLEM alert - deployment-aqs01/Puppet errors is CRITICAL **

2017-09-11 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-aqs01 Address: 10.68.18.237 State: CRITICAL Date/Time: Mon 11 Sep 11:36:00 UTC 2017 Additional Info: CRITICAL: 60.00% of data above the critical threshold [0.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - deployment-tin/Free space - all mounts is WARNING **

2017-09-11 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-tin Address: 10.68.21.205 State: WARNING Date/Time: Mon 11 Sep 11:17:15 UTC 2017 Additional Info: WARNING: deployment-prep.deployment-tin.diskspace._mnt.byte_percentfree (No valid datapoints

[Betacluster-alerts] ** RECOVERY alert - deployment-imagescaler02/Puppet errors is OK **

2017-09-11 Thread shinken
Notification Type: RECOVERY Service: Puppet errors Host: deployment-imagescaler02 Address: 10.68.18.233 State: OK Date/Time: Mon 11 Sep 10:04:31 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [0.0] ___ Betacluster-alerts mailing

[Betacluster-alerts] ** PROBLEM alert - deployment-kafka01/Free space - all mounts is CRITICAL **

2017-09-11 Thread shinken
Notification Type: PROBLEM Service: Free space - all mounts Host: deployment-kafka01 Address: 10.68.21.219 State: CRITICAL Date/Time: Mon 11 Sep 10:02:04 UTC 2017 Additional Info: CRITICAL: deployment-prep.deployment-kafka01.diskspace.root.byte_percentfree (<100.00%)

[Betacluster-alerts] ** RECOVERY alert - Graphite Labs/Mediawiki Error Rate is OK **

2017-09-11 Thread shinken
Notification Type: RECOVERY Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: OK Date/Time: Mon 11 Sep 08:54:49 UTC 2017 Additional Info: OK: Less than 1.00% above the threshold [1.0] ___ Betacluster-alerts

[Betacluster-alerts] ** PROBLEM alert - deployment-kafka01/Puppet errors is CRITICAL **

2017-09-11 Thread shinken
Notification Type: PROBLEM Service: Puppet errors Host: deployment-kafka01 Address: 10.68.21.219 State: CRITICAL Date/Time: Mon 11 Sep 06:57:17 UTC 2017 Additional Info: CRITICAL: 100.00% of data above the critical threshold [0.0] ___

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is WARNING **

2017-09-11 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: WARNING Date/Time: Mon 11 Sep 06:09:46 UTC 2017 Additional Info: WARNING: 80.00% of data above the warning threshold [1.0] ___

[Betacluster-alerts] ** PROBLEM alert - Graphite Labs/Mediawiki Error Rate is CRITICAL **

2017-09-11 Thread shinken
Notification Type: PROBLEM Service: Mediawiki Error Rate Host: Graphite Labs Address: graphite-labs.wikimedia.org State: CRITICAL Date/Time: Mon 11 Sep 06:04:47 UTC 2017 Additional Info: CRITICAL: 20.00% of data above the critical threshold [10.0]