[prometheus-users] Re: Need help in promtool test - its failing for one alert or the other

2023-09-19 Thread radhamani...@gmail.com
exp:[ 0: Labels:{alert_owner="alerts-infra", alertname="MySQLReplicationDown", severity="High"} Annotations:{description="MySQL replication is broken. \n"} ], got:[] On Tuesday, September 19, 2023

[prometheus-users] Re: Need help in promtool test - its failing for one alert or the other

2023-09-19 Thread radhamani...@gmail.com
aracter in series sequence: '*' On Tuesday, September 19, 2023 at 1:38:12 PM UTC-7 radhamani...@gmail.com wrote: > Thankyou Brian for helping.. > > I am passing multiple values as , values: '1 1 1 1 1 0 0 0 0 -1 -1 -1 -1 > -1 -1' Is this not correct? &g

[prometheus-users] Re: Need help in promtool test - its failing for one alert or the other

2023-09-19 Thread radhamani...@gmail.com
t; > https://prometheus.io/docs/prometheus/latest/configuration/unit_testing_rules/#series > > On Tuesday, 19 September 2023 at 20:49:39 UTC+1 radhamani...@gmail.com > wrote: > >> Hi, >> I have below prometheus unit test >> >> I tried a lot of manipulations on input_se

[prometheus-users] Need help in promtool test - its failing for one alert or the other

2023-09-19 Thread radhamani...@gmail.com
Hi, I have below prometheus unit test I tried a lot of manipulations on input_series. But it is always throwing failure. It only evaluates the first value in the input series. Unit Testing: tests/infra_db_alerts-tests.yml FAILED: alertname: MySQLReplicationDown, time: 10m, exp

[prometheus-users] Re: got: [] in promtool unit test

2023-04-13 Thread radhamani...@gmail.com
rvice", > severity="Critical"} > Annotations:{description="Service memory has been at over > 90% for 1 minute. Service: network-service, DnsName: . \n"} > ] > > You can then modify your expectations to make them match what

[prometheus-users] got: [] in promtool unit test

2023-04-12 Thread radhamani...@gmail.com
Hi, I need help in this alert test. This is the alert rule I have, - alert: ServiceMemoryUsage90Percent expr: sum(container_memory_usage_bytes{job="cadvisor",container_label_io_rancher_stack_name!="", image=~"rancher.*"}) by (container_label_io_rancher_stack_name, instance, dns_name) /1024/10

[prometheus-users] Re: Getting deployment object label in Prometheus alert rules

2020-12-02 Thread radhamani...@gmail.com
(label_axway_com_team) kube_deployment_labels What is the issue with this command? On Wednesday, December 2, 2020 at 8:06:04 PM UTC-7 radhamani...@gmail.com wrote: > > Is there a way to show deployment object label in prometheus alert rules? > > I have a deplo

[prometheus-users] Getting deployment object label in Prometheus alert rules

2020-12-02 Thread radhamani...@gmail.com
Is there a way to show deployment object label in prometheus alert rules? I have a deployment object label set as teamname=teamA I want to show this label in KubeDeploymentReplicasMismatch alert. I could do the same for pods related alerts by modifying expression with, * on(namespace, pod) gro

[prometheus-users] what is the right way to setup alertmanager in Federation?

2020-11-24 Thread radhamani...@gmail.com
If we enable Alertmanagers in dedicated clusters, then how do we route the alerts to receiver ? Do we send the alerts to receivers directly from dedicated clusters? or should we route the alerts from dedicated clusters to central Federation Alertmanager? If yes, how to chain the alertmanagers?

[prometheus-users] Re: Inhibition with exceptions (negative matchers)

2020-11-23 Thread radhamani...@gmail.com
Hi, Did you find solution to this problem? I too have same issue. On Thursday, July 23, 2020 at 3:27:59 AM UTC-7 Denis Kranjčec wrote: > Hi, > > > We have a problem with inhibition rules because we need exceptions to > inhibition rules. > > > For example, we have 50 teams, and every team handles

[prometheus-users] Using jsonnet for grafana dashboard

2020-10-06 Thread radhamani...@gmail.com
I am trying jsonnet (my-custom-grafana.jsonnet) for grafana dashboard.I tried below code. local grafana = import 'grafonnet/grafana.libsonnet'; local dashboard = grafana.dashboard; local row = grafana.row; local prometheus = grafana.prometheus; local template = grafana.template; local graphPanel

[prometheus-users] Missing Kubernetes alerts in OpsGenie integration

2020-10-06 Thread radhamani...@gmail.com
Hi , I integrated Prometheus alerts to OpsGenie and MsTEAMS.I am seeing many alerts in MsTEAMS but not in OpsGenie. Alert such as KubeCronJobRunning is there in TEAMS but not in OpsGenie. I have enabled highest logging level "debug" in alertmanager but still no logs related to this alert in Aler

[prometheus-users] Using jsonnet to update Prometheus operator default rule

2020-10-01 Thread radhamani...@gmail.com
I am trying to override the Default rules in Kube-Prometheus-Stack. I have enabled default rules when Kube-Prometheus-Stack was deployed. I tried with jsonnet as shown in this example https://github.com/prometheus-operator/kube-prometheus/blob/master/docs/developing-prometheus-rules-and-grafa

[prometheus-users] multiple group_left in an expression

2020-09-10 Thread radhamani...@gmail.com
I want to add 3 metrics using two group_left.I currently have below expression..I also want to add *group_left* *(label_teamname) kube_pod_labels. *If I just add group_left to this command,it throws me aggregate group_left failed error. sum by (namespace, pod) ( max by(names

[prometheus-users] Re: How to do relabelling in Prometheus operator?

2020-09-01 Thread radhamani...@gmail.com
tor%2Fvalues.yaml&v=3> On Tuesday, September 1, 2020 at 5:07:22 PM UTC-7 radhamani...@gmail.com wrote: > > I want to do relabelling of some pod labels.But I am not sure which > configuration I need to update in Prometheus Operator. > Should I update the below config? >

[prometheus-users] How to do relabelling in Prometheus operator?

2020-09-01 Thread radhamani...@gmail.com
I want to do relabelling of some pod labels.But I am not sure which configuration I need to update in Prometheus Operator. Should I update the below config? alertmanager: config: Please advice.. -- You received this message because you are subscribed to the Google Groups "Prometheus Use

[prometheus-users] Looking for some Kubernetes custom alerts Prometheus rules

2020-08-28 Thread radhamani...@gmail.com
We are setting up missing alerts in our clusters that are not present in the default alert rules.I want to know if there are any alert rules already implemented for issues such as Unusal autoscaling activity ,Max capacity reached for the cluster,Failed Kubernetes jobs, Pods not running with des

[prometheus-users] Looking for some Kubernetes custom alerts configuration

2020-08-28 Thread radhamani...@gmail.com
We are setting up missing alerts in our clusters that are not present in the default alert rules.I want to know if there are any alert rules already implemented for issues such as Unusal autoscaling activity ,Max capacity reached for the cluster,Failed Kubernetes jobs, Pods not running with de

Re: [prometheus-users] Correlating different alerts to produce single alert

2020-08-27 Thread radhamani...@gmail.com
Thankyou very much..I will update you soon,, On Wednesday, August 26, 2020 at 10:40:42 PM UTC-7 Christian Hoffmann wrote: > Hi, > > On 8/27/20 2:19 AM, radhamani...@gmail.com wrote: > > If pods in two different namespaces go down,then we need to send a alert > > as an app

Re: [prometheus-users] Correlating different alerts to produce single alert

2020-08-26 Thread radhamani...@gmail.com
n"? I just wrote the pseudocode of the expr but I want to know if this expr simply works with just AND operator.. On Wednesday, August 26, 2020 at 2:19:13 PM UTC-7 Christian Hoffmann wrote: > Hi, > > On 8/26/20 11:00 PM, radhamani...@gmail.com wrote: > > I want to send alert b

[prometheus-users] Correlating different alerts to produce single alert

2020-08-26 Thread radhamani...@gmail.com
I want to send alert by doing some correlation based upon multiple alerts.For eg: if podA,podB,serviceA are all 100% down in two different namespaces(namespace1,namespace2),then I want to send alert like ApplicationA is down. Is this possible? How to do the correlation between different aler

[prometheus-users] How to add custom labels to the default Kubernetes rules?

2020-08-24 Thread radhamani...@gmail.com
How to add custom labels to the default Kubernetes rules in Prometheus-operator? I found we cannot update the existing default rules but we need to add new rules in the templates.how to add that? and what should be name the alerts?same name so it will override the defaults? -- You received th

[prometheus-users] What does count by job rule mean?

2020-08-24 Thread radhamani...@gmail.com
Targetdown alert rule shows before PromQL query, (count by(job, namespace, service) (up == 0) / count by(job, namespace, service) (up)) > 10