Paulo Roberto de Oliveira Castro created SPARK-33564:
--------------------------------------------------------

             Summary: Prometheus metrics for Master and Worker isn't working 
                 Key: SPARK-33564
                 URL: https://issues.apache.org/jira/browse/SPARK-33564
             Project: Spark
          Issue Type: Bug
          Components: Spark Core, Spark Shell
    Affects Versions: 3.0.1, 3.0.0
            Reporter: Paulo Roberto de Oliveira Castro


Following the [PR|https://github.com/apache/spark/pull/25769] that introduced 
the Prometheus sink, I downloaded the {{spark-3.0.1-bin-hadoop2.7.tgz}}  (also 
tested with 3.0.0), uncompressed the tgz and created a file called 
{{metrics.properties __ }}adding this content:
{{}} 

{{*.sink.prometheusServlet.class=org.apache.spark.metrics.sink.PrometheusServlet}}
{{*.sink.prometheusServlet.path=/metrics/prometheus
master.sink.prometheusServlet.path=/metrics/master/prometheus
applications.sink.prometheusServlet.path=/metrics/applications/prometheus}}

Then I ran:

 

{{$ sbin/start-master.sh}}
{{$ sbin/start-slave.sh spark://`hostname`:7077}}
{{$ bin/spark-shell --master spark://`hostname`:7077 
--files=./metrics.properties --conf spark.metrics.conf=./metrics.properties}}

{{The Spark shell opens without problems:}}

{{}}
{quote}20/11/25 17:36:07 WARN NativeCodeLoader: Unable to load native-hadoop 
library for your platform... using builtin-java classes where applicable

{{}}

Using Spark's default log4j profile: org/apache/spark/log4j-defaults.properties

{{}}

Setting default log level to "WARN".

{{}}

To adjust logging level use sc.setLogLevel(newLevel). For SparkR, use 
setLogLevel(newLevel).

{{}}

Spark context Web UI available at http://192.168.0.6:4040

{{}}

Spark context available as 'sc' (master = 
spark://MacBook-Pro-de-Paulo-2.local:7077, app id = app-20201125173618-0002).

{{}}

Spark session available as 'spark'.

{{}}

Welcome to

{{}}

      ____              __

{{}}

     / __/__  ___ _____/ /__

{{}}

    _\ \/ _ \/ _ `/ __/  '_/

{{}}

   /___/ .__/\_,_/_/ /_/\_\   version 3.0.0

{{}}

      /_/

{{}}

         

{{}}

Using Scala version 2.12.10 (OpenJDK 64-Bit Server VM, Java 1.8.0_212)

{{}}

Type in expressions to have them evaluated.

{{}}

Type :help for more information.

{{}}

 

{{}}

scala>
{quote}
{{And when I try to fetch prometheus metrics for driver, everything works 
fine:}}
{quote}$ curl -s http://localhost:4040/metrics/prometheus/ | head -n 5

metrics_app_20201125173618_0002_driver_BlockManager_disk_diskSpaceUsed_MB_Number\{type="gauges"}
 0

metrics_app_20201125173618_0002_driver_BlockManager_disk_diskSpaceUsed_MB_Value\{type="gauges"}
 0

metrics_app_20201125173618_0002_driver_BlockManager_memory_maxMem_MB_Number\{type="gauges"}
 732

metrics_app_20201125173618_0002_driver_BlockManager_memory_maxMem_MB_Value\{type="gauges"}
 732

metrics_app_20201125173618_0002_driver_BlockManager_memory_maxOffHeapMem_MB_Number\{type="gauges"}
 0
{quote}

*The problem appears when I try accessing master metrics*, and I get the 
following problem:


{quote}$ curl -s http://localhost:8080/metrics/master/prometheus


<!DOCTYPE html><html>

      <head>

        <meta http-equiv="Content-type" content="text/html; 
charset=utf-8"/><link rel="stylesheet" href="/static/bootstrap.min.css" 
type="text/css"/><link rel="stylesheet" 
href="/static/vis-timeline-graph2d.min.css" type="text/css"/><link 
rel="stylesheet" href="/static/webui.css" type="text/css"/><link 
rel="stylesheet" href="/static/timeline-view.css" type="text/css"/><script 
src="/static/sorttable.js"></script><script 
src="/static/jquery-3.4.1.min.js"></script><script 
src="/static/vis-timeline-graph2d.min.js"></script><script 
src="/static/bootstrap-tooltip.js"></script><script 
src="/static/initialize-tooltips.js"></script><script 
src="/static/table.js"></script><script 
src="/static/timeline-view.js"></script><script 
src="/static/log-view.js"></script><script 
src="/static/webui.js"></script><script>setUIRoot('')</script>

        

        <link rel="shortcut icon" 
href="/static/spark-logo-77x50px-hd.png"></link>

        <title>Spark Master at spark://MacBook-Pro-de-Paulo-2.local:7077</title>

      </head>

      <body>

        <div class="container-fluid">

          <div class="row-fluid">

            <div class="span12">

              <h3 style="vertical-align: middle; display: inline-block;">

                <a style="text-decoration: none" href="/">

                  <img src="/static/spark-logo-77x50px-hd.png"/>

                  <span class="version" style="margin-right: 15px;">3.0.0</span>

                </a>

                Spark Master at spark://MacBook-Pro-de-Paulo-2.local:7077

              </h3>

            </div>

          </div>

          <div class="row-fluid">

          <div class="span12">

            <ul class="unstyled">

              <li><strong>URL:</strong> 
spark://MacBook-Pro-de-Paulo-2.local:7077</li>
...
{quote}
The same happens for all of those here:

{quote}{{$ curl -s [http://localhost:8080/metrics/applications/prometheus/]}}
{{$ curl -s [http://localhost:8081/metrics/prometheus/]}}
{quote}
Instead, *I expected metrics in prometheus metrics*. All related JSON endpoints 
seem to be working fine.

{{}}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to