Re: [prometheus-users] Target Server - Which Prometheus Server Is Scraping

2022-02-15 Thread Stuart Clark
are scraping at the same time but there is no way the scrapes are happening every minute.  The scrape interval is 3m with a scrape time out of 2m45s. So are you seeing more frequent requests than you expect? How are you telling this? Do you have request logs & what do they say/record? -- St

Re: [prometheus-users] Target Server - Which Prometheus Server Is Scraping

2022-02-14 Thread Stuart Clark
target_server_123.  On target_server_123 is there something on this server that says Prom_server_a is scraping right now. Prom_server_b is scraping right now? The source IP address in any logs? -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prome

Re: [prometheus-users] Target issue

2022-02-10 Thread Stuart Clark
tion. If that doesn't work it could be due to incorrect IP/port details being included in the configuration, firewalls not allowing access, missing routing information or applications not running & exposing a web server on the correct port. -- Stuart Clark -- You received this message bec

Re: [prometheus-users] Prometheus installation

2022-02-08 Thread Stuart Clark
Yes it is common to run Prometheus in a Docker container (for example inside Kubernetes) with an image being produced for this purpose. It is also common to run pairs for HA purposes, generally configured to run on different underlying hardware and/or availability zones. With regards to the

Re: [prometheus-users] Evaulating Datadog as a central metrics storage, conformance?

2022-02-04 Thread Stuart Clark
dn't think it worked for what we had/wanted (continuing with our existing Prometheus servers instead of replacing everything with their agent and ingesting Prometheus formatted metrics directly to Datadog). What are you looking for that your current setup doesn't give you? -- Stuart Clark -- Yo

Re: [prometheus-users] change when week starts / change aligment?

2022-02-02 Thread Stuart Clark
ample at  1642982400  (Mon Jan 24 2022) and goes from there weekly? Yes. You need to ensure that you are sending the correct date/time point/range for the query, which defaults to "now" if not set at all. -- Stuart Clark -- You received this message because you are subscribed to the Go

Re: [prometheus-users] Suggestion on reducing cardinality

2022-01-21 Thread Stuart Clark
code. Can i put a first aid any way to reduce the cardinality. (this metric) is too important to drop some thing like project wise etc by writing rewrite rule. if yes please advice how ? You could drop that label in metrics relabling. -- Stuart Clark -- You received this message because you ar

Re: [prometheus-users] Common tag on metrics

2022-01-21 Thread Stuart Clark
On 2022-01-21 10:35, ritesh patel wrote: Hello Team, How I can add common tags on metrics? Any idea how this can be achievable? You can add labels via relabling, but in general that is not advised. What are you trying to achieve? -- Stuart Clark -- You received this message because you

Re: [prometheus-users] Related to TLS connection

2022-01-13 Thread Stuart Clark
targets to be using TLS (although that is generally good security practice to always use https). You just need to set the scheme in your job configuration to http. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To u

Re: [prometheus-users] metrics are not updated

2022-01-06 Thread Stuart Clark
interval set to between about 15 seconds and say a minute. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-users+unsub

Re: [prometheus-users] metrics are not updated

2022-01-06 Thread Stuart Clark
t updated, do you mean running a query in Prometheus doesn't reflect any metrics published by your application? What is the scrape interval for the job that scrapes Pushgateway? -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Promethe

Re: [prometheus-users] Re: Adding a Label to a special Metric from the node exporter.

2021-12-26 Thread Stuart Clark
a recording rule as suggested. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-users+unsubscr...@googlegroups.com. To view this

Re: [prometheus-users] Copyright

2021-12-25 Thread Stuart Clark
sing. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-users+unsubscr...@googlegroups.com. To view this discussion on the we

Re: [prometheus-users] Prometheus resolve alert thrown when no metric is available

2021-12-24 Thread Stuart Clark
the 99th percentile is below 10ms or if the metric no longer exists. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-user

Re: [prometheus-users] Regarding no route to host error in Prometheus for oracle db exporter metrics

2021-12-21 Thread Stuart Clark
which is only available from within that VM. You'll need to expose the port and then configure Prometheus to use the IP address of VM 2. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group

Re: [prometheus-users] same port/exporter, but different job labels

2021-12-13 Thread Stuart Clark
://www.robustperception.io/how-to-have-labels-for-machine-roles -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-user

Re: [prometheus-users] Convert metrics in Open telemetry format to promethues exposition format

2021-12-13 Thread Stuart Clark
(or whatever they use) for what is available? Worst case the format is published so you could write a parser in the language of your choice. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this grou

Re: [prometheus-users] Convert metrics in Open telemetry format to promethues exposition format

2021-12-13 Thread Stuart Clark
library: https://github.com/prometheus/client_python -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-user

Re: [prometheus-users] Convert metrics in Open telemetry format to promethues exposition format

2021-12-13 Thread Stuart Clark
cation which listens for Open Telemetry data, stores it in memory and then returns the latest set of values when a Prometheus scrape is received. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe

Re: [prometheus-users] Re: Timezone in prometheus dashboard

2021-12-12 Thread Stuart Clark
. Definitely not if you are instrumenting an application, with one of the few use-cases being connecting a third-party metrics system into Prometheus. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from

Re: [prometheus-users] Re: Mutiple PromQLs for each expr in alert rules

2021-12-06 Thread Stuart Clark
labels & their values. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-users+unsubscr...@googlegroups.com. To view t

Re: [prometheus-users] Re: Mutiple PromQLs for each expr in alert rules

2021-12-06 Thread Stuart Clark
scription: "[TEST] CPU load is > 80%\n VALUE = {{ $value }}\n  LABELS: {{ $labels }}" #rules for Are there any labels for container_cpu_usage_seconds_total & node_cpu_seconds_total which match? -- Stuart Clark -- You received this message because you are subscribed to the Google

Re: [prometheus-users] How to differentiate similar metrics in grafana by different datasources?

2021-11-26 Thread Stuart Clark
afana variable). | -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-users+unsubscr...@googlegroups.com. To view this dis

Re: [prometheus-users] How to differentiate similar metrics in grafana by different datasources?

2021-11-26 Thread Stuart Clark
nd the services >related to the specific job. > > >On Friday, November 26, 2021 at 3:27:17 PM UTC+1 Stuart Clark wrote: > >> On 26/11/2021 14:08, Mohammad Khoschnasar wrote: >> > Hi all, >> > >> > I'm a newbie in prometheus. So go easy on me please. >> >

Re: [prometheus-users] How to differentiate similar metrics in grafana by different datasources?

2021-11-26 Thread Stuart Clark
in your case you'd only have a single one as you only have one Prometheus server. You can then use PromQL to query your metrics. Each job automatically gets both instance & job labels added, so you'd be able to differentiate between your two apps. -- Stuart Clark -- You received this mes

Re: [prometheus-users] How to run pod based on prometheus alert

2021-11-16 Thread Stuart Clark
ll app (using whatever language you wish) that listens for webhook notifications from Alertmanager than then executes your code when triggered. You can then set your alert routing to send the notification for the alert(s) you wish. -- Stuart Clark -- You received this message bec

Re: [prometheus-users] Regarding timestamp in Prometheus using C++ library

2021-10-17 Thread Stuart Clark
the more usual case of instrumenting your own application they should not be used at all. Instead just set the gauge values or increment counters during the normal flow of your application and then Prometheus will pick up the most current values every time a scrape is initiated. -- Stuart Cl

Re: [prometheus-users] node_exporter save to file to transport stats

2021-10-13 Thread Stuart Clark
ning the textfile collector. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-users+unsubscr...@googlegroup

Re: [prometheus-users] Re: Filter metrics based on timestamp

2021-10-13 Thread Stuart Clark
han making a string out of YmdHMS string values. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-users+unsubscr...@goo

Re: [prometheus-users] Prometheus is now available on AWS. Has anyone from the project been involved?

2021-10-04 Thread Stuart Clark
y". In addition to company contributions there are also a healthy number of individual contributors, which don't suffer from the same financial limitations (just people being willing to offer their time). -- Stuart Clark -- You received this message because you are subscribed

Re: [prometheus-users] re: basic_auth: fails in prometheus.yml

2021-09-23 Thread Stuart Clark
for password Can anyone enlight me on this ? We would need to see more of your config file to be sure, but it sounds like you have the wrong level of indentation. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To u

Re: [prometheus-users] No Telemetry data is sending resolved alert

2021-09-20 Thread Stuart Clark
me know if there is way to fix this thing. I'm not quite clear what you are wanting to fix? Alerts will resolve if the alert query stops returning anything, which seems to be what you are saying is happening. What are you wanting to happen? -- Stuart Clark -- You received this message because

Re: [prometheus-users] How to explain that almost all programs use http.Handle("/metrics", promhttp.Handler())

2021-09-16 Thread Stuart Clark
the virtual memory usage for things running on the same box, but the other two don't sound that useful to aggregate. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving e

Re: [prometheus-users] How to explain that almost all programs use http.Handle("/metrics", promhttp.Handler())

2021-09-16 Thread Stuart Clark
ics around the application, allowing you basic visibility without writing any code. Each metric is different, with labels used to split things as needed (e.g. for histograms). What is the issue you are seeing? -- Stuart Clark -- You received this message because you are subscribed to the Google Groups &

Re: [prometheus-users] Is it possible to write data to Prometheus?

2021-09-16 Thread Stuart Clark
for specific use cases (e.g. cron jobs) rather than being designed to convert Prometheus to push based. There are other tools to allow systems being firewalls, etc. to operate as well. What is the specific use case/issue with using Prometheus as a pull based mechanism? -- Stuart Clark -- You received

Re: [prometheus-users] Single timeseries for a label having different values

2021-09-15 Thread Stuart Clark
be varying over time, with labels able to group together related time series (e.g. different servers running the same software). -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop

Re: Private message regarding: [prometheus-users] AWS CloudWatch Exporter Multiply Regions

2021-09-06 Thread Stuart Clark
might show some errors. Regarding scrape intervals the maximum that can be used is generally 2-2.5 minutes, so you should reduce the interval from 5 minutes (due to staleness handling). Have you configured the exporter with the details of what metrics you are wanting to include? -- Stuart

Re: [prometheus-users] AWS CloudWatch Exporter Multiply Regions

2021-09-06 Thread Stuart Clark
within the different regions, but it might not be worth the extra cost/complexity. How frequently are you wanting to scrape those exporters? As CloudWatch isn't real-time I don't gerneally scrape them at a high frequency. -- Stuart Clark -- You received this message because you are subscribed

Re: [prometheus-users] Re: Understanding the parameter endsAt and resend-delay.

2021-08-31 Thread Stuart Clark
In general I don't think you can. You can use the absent function, but that will also alert if the scrape fails (eg. Host is down). You could also use one of the over_time functions, but that wouldn't be based on the last sample before scrapes stopped. However, taking a step back, why do you

Re: [prometheus-users] Alert History (Show on Grafana Dashboard)

2021-08-31 Thread Stuart Clark
On 2021-08-31 10:06, Brian Candler wrote: On Tuesday, 31 August 2021 at 09:40:38 UTC+1 Stuart Clark wrote: There is a specific Grafana datasource for Alertmanager which might be useful. Is that for the metrics returned by :9093/metrics ? If so, then that's mainly for aggregate alerting

Re: [prometheus-users] Alert History (Show on Grafana Dashboard)

2021-08-31 Thread Stuart Clark
of the alertmanager on table dashboard what should I do? There is a specific Grafana datasource for Alertmanager which might be useful. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop

Re: [prometheus-users]  Prometheus version upgrade, with helm chart

2021-08-26 Thread Stuart Clark
What errors? On 26 August 2021 19:27:41 BST, James S wrote: >But I am running into errors when I upgrade to the latest versions, for >Prometheus, node-eporter and grafana > >On Thursday, August 26, 2021 at 2:24:46 PM UTC-4 sup...@gmail.com wrote: > >> Yes, just updating the image versions is

Re: [prometheus-users] Request help for pilot testing.

2021-08-26 Thread Stuart Clark
themselves rather than being broad experts on everything). -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-users+uns

Re: [prometheus-users] Request help for pilot testing.

2021-08-24 Thread Stuart Clark
monitoring solutions? Brds, Henriett Stuart Clark a következőt írta (2021. augusztus 21., szombat, 23:36:02 UTC+2): On 21/08/2021 21:44, Henriett Braunné Bokor wrote: there are many questions read below: - It can be installed in such a way that in case of an OS / HW, in case of o

Re: [prometheus-users] Re: Prometheus self scrape job taking too much memory

2021-08-23 Thread Stuart Clark
OOMKilled. So at the moment it isn't scraping anything other than itself via the /metrics endpoint? Is this a brand new service (i.e. no existing data stored on disk)? Is there anything querying the server (e.g. Grafana dashboards, etc.)? -- Stuart Clark -- You received this message because

Re: [prometheus-users] How to check Credential Application availability using Blackbox

2021-08-23 Thread Stuart Clark
Each authenticated endpoint (or set if they share the same credentials) would have a job within the Prometheus configuration file. You can then set those credentials (for example supplying a username and password for basic auth). On 23 August 2021 05:16:34 BST, karthick MJ wrote: >Hello Team,

Re: [prometheus-users] "Correct" usage of pushgateway for short running daily jobs

2021-08-23 Thread Stuart Clark
; > >Unfortunately I dont understand the reason when and why I would include an >instance when pushing to the pushgateway. > >Stuart Clark schrieb am Sonntag, 22. August 2021 um 22:09:58 UTC+2: > >> You don't want to include any sort of timestamp otherwise you won't be >> rep

Re: [prometheus-users] "Correct" usage of pushgateway for short running daily jobs

2021-08-22 Thread Stuart Clark
>pushgateway after some time. In my case I only push a handful of metrics >every day (so its really little data) > > >Alerting: >Ideally I want to create alerts that tell me which instance of the job has >failed, so that I know for which date I need to re-run the job. >

Re: [prometheus-users] "Correct" usage of pushgateway for short running daily jobs

2021-08-22 Thread Stuart Clark
It sounds like you just need a label for the customer (X, Y or Z) and the job (A, B or C). You don't want anything about timestamps in the labels. On 22 August 2021 16:15:01 BST, David Leonhartsberger wrote: >I have red the documentation about prometheus and the pushgateway but I >still dont

Re: [prometheus-users] Request help for pilot testing.

2021-08-21 Thread Stuart Clark
to which group the given alarm belongs. For example: Linux server CPU load Linux group, B linux server CPU load DBA group, etc.… Sorry these don't read as questions. Could you maybe rephrase them as specific things you are wanting to ask? -- Stuart Clark -- You received this message b

Re: [prometheus-users] Request help for pilot testing.

2021-08-21 Thread Stuart Clark
c.… Many thanks for your help. What are your actual questions? That just looks to be a set of requirements. Have you actually tried out Prometheus? -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubsc

Re: [prometheus-users] Different Prometheus metrics for different trials

2021-08-04 Thread Stuart Clark
different points, etc. Are you seeing wildly different results between your runs? -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving emails from it, send an email

Re: [prometheus-users] Re: regarding pprof

2021-07-23 Thread Stuart Clark
Hi All, In one of our production setups, we have configured prometheus HA on Virtual machines(node0, node1). I see that node0 prometheus takes around 5gb of ram and node1 takes just 1gb of ram. Are you performing queries on both nodes or just node0? -- Stuart Clark -- You re

Re: [prometheus-users] how to find metric that rarely never be queried?

2021-07-19 Thread Stuart Clark
situation of a scrape target producing thousands of useless time series. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to prome

Re: [prometheus-users] how to find metric that rarely never be queried?

2021-07-19 Thread Stuart Clark
metrics you actually wanted to keep. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-users+unsubscr...@googlegroup

Re: [prometheus-users] False Positive Alerts for node CPU Usage for one node

2021-07-08 Thread Stuart Clark
On 2021-07-08 16:07, James S wrote: We do not see any stress on the cluster and we do not see this in GCP cloud monitoring this behavior. What does the graph of the metric look like? Is this a single or multiple CPU machine? On Thursday, July 8, 2021 at 9:50:37 AM UTC-4 Stuart Clark wrote

Re: [prometheus-users] False Positive Alerts for node CPU Usage for one node

2021-07-08 Thread Stuart Clark
ord: instance:node_cpu_utilisation:rate1m - expr:       1 - avg without (cpu, mode) (rate(node_cpu-seconds_total{job="node_exporter", mode ="idle"} [1m])) What makes you say it is a false positive? What does the graph of that metric show? -- Stuart Clark -- You received this message bec

Re: [prometheus-users] Grafana Alerts: different conditions for different labels?

2021-07-07 Thread Stuart Clark
. Is this possible? That sounds like a Grafana specific question, so while there are probably some people on this mailing list who might know I'd also suggest posting to one of the Grafana lists... -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prome

Re: [prometheus-users] Grafana Alerts: different conditions for different labels?

2021-07-07 Thread Stuart Clark
a single query: rate(my_counter[30s]), and create different conditions for mylabel=1, mylabel=2, mylabel=3. Is it possible? You can use a regex match (=~ instead of =) to match all three of the values in a single query. -- Stuart Clark -- You received this message because you are subscribed

Re: [prometheus-users] Scrape Interval not working for push gateway

2021-07-07 Thread Stuart Clark
. The maximum scrape interval for Prometheus should be no more than about 2 minutes, so I'd suggest adjusting your configuration to scrape the push gateway at that frequency (no problem with only pushing to the gateway every 15 minutes). -- Stuart Clark -- You received this message because you

Re: [prometheus-users] Multiple Prometheus monitor the same targets, and each node's Prometheus will call the remote write, resulting in data duplication

2021-06-28 Thread Stuart Clark
Deduplication is handled by the remote write destination, such as Thanos. Usually there is the requirement to add an external label identifying the cluster and the replica to allow this to happen. On 28 June 2021 08:07:44 BST, ronghao li wrote: >Multiple Prometheus monitor the same targets,

Re: [prometheus-users] install Prometheus with Prometheus Operator

2021-06-10 Thread Stuart Clark
On 10/06/2021 09:51, nina guo wrote: So whether we need to do some configuration for alert manager or it is enabled by default? Take a look at the documentation: https://github.com/prometheus/alertmanager#high-availability -- Stuart Clark -- You received this message because you

Re: [prometheus-users] How to show Gauge with specific label

2021-06-10 Thread Stuart Clark
On 10/06/2021 08:03, stav alfi wrote: Thanks! What does it mean to sum a Gauge? That very much depends on what the metric means. There are other aggregation operators available for when sum isn't appropriate - min, max, count, etc. -- Stuart Clark -- You received this message because you

Re: [prometheus-users] install Prometheus with Prometheus Operator

2021-06-10 Thread Stuart Clark
On 10/06/2021 07:53, nina guo wrote: So for alert manager, the alerts will be sent to every member of an Alertmanager cluster before firing. Is it correct? But when the alerts are fired, it will only have 1 notification eg. in  destination Email or Slack? That is correct. -- Stuart Clark

Re: [prometheus-users] install Prometheus with Prometheus Operator

2021-06-10 Thread Stuart Clark
and will continue to work even if cluster instances stop operating. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-user

Re: [prometheus-users] How to show Gauge with specific label

2021-06-10 Thread Stuart Clark
e specific label: "table_name". You can use aggregation operators to merge together different time series. For example "sum(singer_table_name_estimated_count) by (table_name)". -- Stuart Clark -- You received this message because you are subscribed to the Google Gr

Re: [prometheus-users] install Prometheus with Prometheus Operator

2021-06-09 Thread Stuart Clark
p for each instance? I'm not sure if it is the correct way. Yes you would need to have separate configmaps for each set of targets. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this grou

Re: [prometheus-users] install Prometheus with Prometheus Operator

2021-06-09 Thread Stuart Clark
PM UTC+8 Stuart Clark wrote: When doing autoscaling (not specifically with Prometheus but everything) you need to ensure that you don't have too many changes happening at once, otherwise you might start rejecting requests (if all instances are restarting at the same time

Re: [prometheus-users] Can I export all data of to other prometheus instance's tsdb?

2021-06-08 Thread Stuart Clark
here isn't really the need to have the older data when you create it, although you could do so. If you are thinking "scale out" in terms of clustering then that's not how Prometheus operates. -- Stuart Clark -- You received this message because you are subscribed to the Google Group

Re: [prometheus-users] install Prometheus with Prometheus Operator

2021-06-07 Thread Stuart Clark
distuption budgets. For a pair of Prometheus servers I'd not want more than one change at once. For other systems I might go as far as N-1 changes at once. On Monday, 7 June 2021 at 11:05:20 UTC+1 Stuart Clark wrote: > On 2021-06-07 10:36, nina guo wrote: > > Autoscaling based on the p

Re: [prometheus-users] install Prometheus with Prometheus Operator

2021-06-07 Thread Stuart Clark
of the two instances rather than horizontal pod autoscaling). Similarly for queries it would be more memory you might need mostly rather than additional instances - if you are expecting massive query load maybe something like Thanos might be a good option? -- Stuart Clark -- You received

Re: [prometheus-users] install Prometheus with Prometheus Operator

2021-06-07 Thread Stuart Clark
. but if now the solution is with HA, the mulitpes pods are exactly scraping the same metrics, if then start autoscaling, it will break HA. Autoscaling of what? Both instances of Prometheus in a pair should be scraping the same metrics, so there shouldn't be any issues. -- Stuart Clark -- You received

Re: [prometheus-users] Why not to load balance traffic between Prometheus and its Alertmanagers?

2021-06-07 Thread Stuart Clark
to properly operate. If this isn't the case you could have alerts being resolved incorrectly or notifications sent at the wrong points. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this grou

Re: [prometheus-users] Error: Unknown output 'rce'. Need help

2021-06-06 Thread Stuart Clark
a *systemd service file to easily run prometheus* I entered the command *sudo systemctl edit --full -force prometheus.service* I got this result: *Unknown output 'rce'.* Something went wrong. I ask for help.* * Do you need double dash for --force? -- Stuart Clark -- You received this message

Re: [prometheus-users] Prometheus reports Out of memory while loading segment

2021-06-02 Thread Stuart Clark
? It is really based on the number of time series, so 100 targets ingesting a single metric with no labels would be a lower impact than a single target which has hundreds of metrics, each with a selection of labels. -- Stuart Clark -- You received this message because you are subscribed

Re: [prometheus-users] Re: level=error ts=2021-05-28T09:23:52.326Z caller=main.go:894 err="opening storage failed: lock DB directory: resource temporarily unavailable"

2021-06-02 Thread Stuart Clark
Prometheus crashes (or is forceable destroyed) will be lost, but Prometheus tries to write to the WAL regularly to reduce that risk. When restarted (assuming the same storage is reattached) the WAL is read back into memory. -- Stuart Clark -- You received this message because you are

Re: [prometheus-users] Re: level=error ts=2021-05-28T09:23:52.326Z caller=main.go:894 err="opening storage failed: lock DB directory: resource temporarily unavailable"

2021-06-02 Thread Stuart Clark
On 02/06/2021 09:22, nina guo wrote: If Prometheus deploys in k8s with multiple Pods, the Prometheus Pods are running independently, am I right? That is correct. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users&q

Re: [prometheus-users] Prometheus reports Out of memory while loading segment

2021-06-02 Thread Stuart Clark
 the memory limit? Memory usage is controlled via the number of time series being scraped and the queries performed (there are a couple of settings to reject large queries), so in this case you could reduce the number of targets or use the configuration to reject some of the metrics. -- Stuart

Re: [prometheus-users] Prometheus reports Out of memory while loading segment

2021-06-01 Thread Stuart Clark
the amount of memory available or reduce the number of targets/metrics/time series being scraped. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving emails fr

Re: [prometheus-users] Re: level=error ts=2021-05-28T09:23:52.326Z caller=main.go:894 err="opening storage failed: lock DB directory: resource temporarily unavailable"

2021-06-01 Thread Stuart Clark
(due to issues with what POSIX requests are supported) and so even with different directories for each instance it would not be advisable. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this

Re: [prometheus-users] edit the default alertmanager global url like wechat pagerduty etc

2021-04-22 Thread Stuart Clark
alertmanager config is same like here https://hub.docker.com/r/prom/alertmanager If you aren't using them you don't need to do anything - just don't set any configuration for the receivers you don't need. -- Stuart Clark -- You received this message because you are subscribed to the Google

Re: [prometheus-users] Context deadline exceeded

2021-04-21 Thread Stuart Clark
/scrape values? -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-users+unsubscr...@googlegroups.com. To view this

Re: [prometheus-users] Context deadline exceeded

2021-04-20 Thread Stuart Clark
gt; >On Tuesday, April 20, 2021 at 7:17:08 PM UTC+3 Stuart Clark wrote: > >> On 20/04/2021 16:45, barnyb...@gmail.com wrote: >> > Hello gentlemen. Help me figure out the case. >> > Previously, I used docker snmp-exporter on server 1(10.10.1.5) and >> > d

Re: [prometheus-users] Context deadline exceeded

2021-04-20 Thread Stuart Clark
s directly (with the ?target=IP parameter) do they both work? Does one take longer than the other? Could you have a firewall in the way from one of the two servers? -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group.

Re: [prometheus-users] Alerts to office messenger(communicator)

2021-04-20 Thread Stuart Clark
programming language you wish. One thing to think of however is that an outage with that web server would cause alerts to stop appearing - so make sure you have things correctly monitored & a correct level of High Availability/isolation as needed. -- Stuart Clark -- You received this mes

Re: [prometheus-users] Alerts to office messenger(communicator)

2021-04-20 Thread Stuart Clark
be such an application if it is a common system (for example there is one for MS Teams). If not, it would be a case of calling the correct APIs with the details sent as part of the webhook call. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users&q

Re: [prometheus-users] HP iLo exporter Installation failure : https://github.com/n27051538/hpilo-exporter

2021-04-20 Thread Stuart Clark
="/METRICS" 5. ERROR OBSERVED. The exporter command you gave above suggests it is listening on port 9416, but the Prometheus job config lists 8082. I'd suggest matching the port numbers and trying again. -- Stuart Clark -- You received this message because you are subscribed to

Re: [prometheus-users] Prometheus dynamic alert rule.

2021-04-17 Thread Stuart Clark
Just remove the job selector from your query and it will alert for all jobs On 17 April 2021 10:40:57 BST, akshay sharma wrote: >In my setup, Prometheus is monitoring multiple nodes simultaneously, >say, >x,y z. >I want to raise alerts, once cpu utilization exceeds "a" value for each >of >the

Re: [prometheus-users] Time Interval - Rules.yml -alert manager

2021-04-16 Thread Stuart Clark
What you are asking for is exactly what the "for" option does. It keeps periodically evaluating the expression and will only fire the alert if it constantly returns a result. You can set how often those checks happen, so for example if they were every minute "for: 5m" means 5 evaluations all

Re: [prometheus-users] component=tsdb msg="compaction failed" err="persist head block: mkdir /prometheus/aaaa.tmp: permission denied"

2021-04-16 Thread Stuart Clark
or suggests the user/group Prometheus is running as does not have the correct permissions for the /prometheus directory. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop

Re: [prometheus-users] how to take Configmap changes effectvie?

2021-04-15 Thread Stuart Clark
://prometheus.io/docs/prometheus/latest/management_api/#reload -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-user

Re: [prometheus-users] snmp_exporter: [snmp running only for localhost]

2021-03-30 Thread Stuart Clark
On 30/03/2021 13:11, Rizwan Khan wrote: Yes they are. We have convinced  them of key based access for ssh. Rest they are not opening anything else. So it sounds like you just need to set up some SSH tunnels. Then you can use node exporter or SNMP exporter or whatever... -- Stuart Clark

Re: [prometheus-users] Send resolved without slack

2021-03-30 Thread Stuart Clark
ot; after 5 minutes when 1st alert "alert-emailer-cpu-1h" goes out. Is there any other way to solve that? Thanks for your help! Could you post the whole alertmanager config? -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheu

Re: [prometheus-users] Grafana alerts

2021-03-30 Thread Stuart Clark
o a server going down (as well as other reasons such as network issues). -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to prome

Re: [prometheus-users] snmp_exporter: [snmp running only for localhost]

2021-03-30 Thread Stuart Clark
? If they are worried about that would they be equally concerned about node exporter or SSH tunnels? -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving emails from it, sen

Re: [prometheus-users] snmp_exporter: [snmp running only for localhost]

2021-03-30 Thread Stuart Clark
a try and see how it goes. Can't you install the node exporter on those servers? Why do you "need" SNMP? -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop r

Re: [prometheus-users] snmp_exporter: [snmp running only for localhost]

2021-03-29 Thread Stuart Clark
for the SNMP exporter directly - you are adding a failure mode that you probably want to monitor for as well - a tunnel failing would stop metrics totally, so you'd ideally setup the tunnel in a way that can detect issues (e.g. keepalives) and fix itself (auto restarts). -- Stuart Clark -- You

Re: [prometheus-users] Grafana alerts

2021-03-29 Thread Stuart Clark
alerts (via Alertmanager)? -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to prometheus-users+unsubscr...@googlegroups.com. To view this

Re: [prometheus-users] snmp_exporter: [snmp running only for localhost]

2021-03-29 Thread Stuart Clark
, but for a Linux or Windows machine the normal exporters are much quicker, more reliable and support more detailed information. -- Stuart Clark -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiv

Re: [prometheus-users] Send resolved without slack

2021-03-29 Thread Stuart Clark
On 29/03/2021 08:31, Sebastian Glock wrote: Hi, is there any way to configure send resolved without slack? I only found configuration with slack channel. I tried to add send resolved on true but it's not working. Is that even possible? -- Which method(s) are you trying to use? -- Stuart

<    1   2   3   4   5   6   >