Hello Arvid,

Thanks I’ll check my config and use the correct reporter and test it out.

Thanks,
Hemant

On Fri, 14 Aug 2020 at 6:57 PM, Arvid Heise <ar...@ververica.com> wrote:

> Hi Hemant,
>
> according to the influx section of the 1.9 metric documentation [1], you
> should use the reporter without a factory. The factory was added later.
>
> metrics.reporter.influxdb.class: 
> org.apache.flink.metrics.influxdb.InfluxdbReportermetrics.reporter.influxdb.host:
>  localhostmetrics.reporter.influxdb.port: 8086metrics.reporter.influxdb.db: 
> flinkmetrics.reporter.influxdb.username: 
> flink-metricsmetrics.reporter.influxdb.password: 
> qwertymetrics.reporter.influxdb.retentionPolicy: one_hour
>
>
> [1]
> https://ci.apache.org/projects/flink/flink-docs-release-1.9/monitoring/metrics.html#influxdb-orgapacheflinkmetricsinfluxdbinfluxdbreporter
>
> On Thu, Aug 13, 2020 at 8:10 AM bat man <tintin0...@gmail.com> wrote:
>
>> Anyone who has made metrics integration to external systems for flink
>> running on AWS EMR, can you share if its a configuration issue or EMR
>> specific issue.
>>
>> Thanks,
>> Hemant
>>
>> On Wed, Aug 12, 2020 at 9:55 PM bat man <tintin0...@gmail.com> wrote:
>>
>>> An update in the yarn logs I could see the below -
>>>
>>> Classpath:
>>> *lib/flink-metrics-influxdb-1.9.0.jar:lib/flink-shaded-hadoop-2-uber-2.8.5-amzn-5-7.0.jar:lib/flink-table-blink_2.11-1.9.0.jar:lib/flink-table_2.11-1.9.0.jar:lib/log4j-1.2.17.jar:lib/slf4j-log4j12-1.7.15.jar:log4j.properties:plugins/influxdb/flink-metrics-influxdb-1.9.0.jar....*
>>> *..........*
>>> *......*
>>>
>>> This means the jar is getting loaded, in the logs I could also see -
>>> 2020-08-12 15:28:51,505 INFO
>>>  org.apache.flink.yarn.YarnTaskExecutorRunner                  - Registered
>>> UNIX signal handlers for [TERM, HUP, I
>>> NT]
>>> 2020-08-12 15:28:51,508 INFO
>>>  org.apache.flink.yarn.YarnTaskExecutorRunner                  - Current
>>> working Directory: /mnt/yarn/usercache/ha
>>>
>>> doop/appcache/application_1595767096609_0013/container_1595767096609_0013_01_000004
>>>
>>> *2020-08-12 15:28:51,512 INFO
>>>  org.apache.flink.configuration.GlobalConfiguration            - Loading
>>> configuration property: metrics.reporter.influxdb.interval, 60 SECONDS*
>>>
>>> *2020-08-12 15:28:51,512 INFO
>>>  org.apache.flink.configuration.GlobalConfiguration            - Loading
>>> configuration property: env.yarn.conf.dir, /etc/hadoop/conf*
>>> 2020-08-12 15:28:51,513 INFO
>>>  org.apache.flink.configuration.GlobalConfiguration            - Loading
>>> configuration property: metrics.reporter.
>>> influxdb.host, xx.xxx.xxx.xx
>>> 2020-08-12 15:28:51,513 INFO
>>>  org.apache.flink.configuration.GlobalConfiguration            - Loading
>>> configuration property: high-availability
>>> .cluster-id, application_1595767096609_0013
>>> 2020-08-12 15:28:51,513 INFO
>>>  org.apache.flink.configuration.GlobalConfiguration            - Loading
>>> configuration property: jobmanager.rpc.ad
>>> dress, ip-xx-x-xx-xxx.ec2.internal
>>> 2020-08-12 15:28:51,513 INFO
>>>  org.apache.flink.configuration.GlobalConfiguration            - Loading
>>> configuration property: metrics.reporter.
>>> influxdb.password, ******
>>>
>>> *2020-08-12 15:28:51,513 INFO
>>>  org.apache.flink.configuration.GlobalConfiguration            - Loading
>>> configuration property: FLINK_PLUGINS_DIR, /usr/lib/flink/plugins*
>>> 2020-08-12 15:28:51,513 INFO
>>>  org.apache.flink.configuration.GlobalConfiguration            - Loading
>>> configuration property: metrics.reporter.
>>> influxdb.db, xxxxxx
>>> 2020-08-12 15:28:51,520 INFO
>>>  org.apache.flink.configuration.GlobalConfiguration            - Loading
>>> configuration property: metrics.reporter.
>>> influxdb.connectTimeout, 60000
>>> 2020-08-12 15:28:51,520 INFO
>>>  org.apache.flink.configuration.GlobalConfiguration            - Loading
>>> configuration property: env.hadoop.conf.d
>>> ir, /etc/hadoop/conf
>>> 2020-08-12 15:28:51,521 INFO
>>>  org.apache.flink.configuration.GlobalConfiguration            - Loading
>>> configuration property: taskmanager.numbe
>>> rOfTaskSlots, 1
>>> 2020-08-12 15:28:51,521 INFO
>>>  org.apache.flink.configuration.GlobalConfiguration            - Loading
>>> configuration property: web.port, 0
>>> 2020-08-12 15:28:51,521 INFO
>>>  org.apache.flink.configuration.GlobalConfiguration            - Loading
>>> configuration property: metrics.reporter.influxdb.username, xxxx
>>> 2020-08-12 15:28:51,521 INFO
>>>  org.apache.flink.configuration.GlobalConfiguration            - Loading
>>> configuration property: taskmanager.memory.size, 264241152b
>>> 2020-08-12 15:28:51,521 INFO
>>>  org.apache.flink.configuration.GlobalConfiguration            - Loading
>>> configuration property: web.tmpdir,
>>> /tmp/flink-web-5562f065-6020-4c38-8260-3aea434bf285
>>> 2020-08-12 15:28:51,521 INFO
>>>  org.apache.flink.configuration.GlobalConfiguration            - Loading
>>> configuration property: jobmanager.rpc.port, 32777
>>> 2020-08-12 15:28:51,521 INFO
>>>  org.apache.flink.configuration.GlobalConfiguration            - Loading
>>> configuration property: metrics.reporter.influxdb.port, 8086
>>> 2020-08-12 15:28:51,521 INFO
>>>  org.apache.flink.configuration.GlobalConfiguration            - Loading
>>> configuration property: metrics.reporter.influxdb.retentionPolicy, one_hour
>>> 2020-08-12 15:28:51,522 INFO
>>>  org.apache.flink.configuration.GlobalConfiguration            - Loading
>>> configuration property: internal.cluster.execution-mode, NORMAL
>>> 2020-08-12 15:28:51,522 INFO
>>>  org.apache.flink.configuration.GlobalConfiguration            - Loading
>>> configuration property: metrics.reporter.influxdb.writeTimeout, 60000
>>> 2020-08-12 15:28:51,522 INFO
>>>  org.apache.flink.configuration.GlobalConfiguration            - Loading
>>> configuration property: metrics.reporter.influxdb.consistency, ONE
>>> 2020-08-12 15:28:51,522 INFO
>>>  org.apache.flink.configuration.GlobalConfiguration            - Loading
>>> configuration property: rest.address, ip-xx-x-xx-xxx.ec2.internal
>>> *2020-08-12 15:28:51,522 INFO
>>>  org.apache.flink.configuration.GlobalConfiguration            - Loading
>>> configuration property: metrics.reporter.influxdb.factory.class,
>>> org.apache.flink.metrics.influxdb.InfluxdbReporterFactory*
>>> .......
>>> but then below I could see -
>>>
>>> *2020-08-12 15:28:51,523 WARN  org.apache.flink.core.plugin.PluginConfig
>>>                     - Environment variable [FLINK_PLUGINS_DIR] is set to
>>> [/usr/lib/flink/plugins] but the directory doesn't exist*
>>> 2020-08-12 15:28:51,561 INFO
>>>  org.apache.flink.yarn.YarnTaskExecutorRunner                  - Current
>>> working/local Directory:
>>> /mnt/yarn/usercache/hadoop/appcache/application_1595767096609_0013,/mnt1/yarn/usercache/hadoop/appcache/application_1595767096609_0013
>>> 2020-08-12 15:28:51,564 INFO
>>>  org.apache.flink.runtime.clusterframework.BootstrapTools      - Setting
>>> directories for temporary files to:
>>> /mnt/yarn/usercache/hadoop/appcache/application_1595767096609_0013,/mnt1/yarn/usercache/hadoop/appcache/application_1595767096609_0013
>>> 2020-08-12 15:28:51,564 INFO
>>>  org.apache.flink.yarn.YarnTaskExecutorRunner                  - TM: remote
>>> keytab path obtained null
>>> 2020-08-12 15:28:51,564 INFO
>>>  org.apache.flink.yarn.YarnTaskExecutorRunner                  - TM: remote
>>> keytab principal obtained null
>>> 2020-08-12 15:28:51,566 INFO
>>>  org.apache.flink.yarn.YarnTaskExecutorRunner                  - YARN
>>> daemon is running as: hadoop Yarn client user obtainer: hadoop
>>> 2020-08-12 15:28:51,675 INFO
>>>  org.apache.flink.runtime.security.modules.HadoopModule        - Hadoop
>>> user set to hadoop (auth:xxxxxx)
>>> 2020-08-12 15:28:51,984 WARN
>>>  org.apache.flink.configuration.Configuration                  - Config
>>> uses deprecated configuration key 'web.port' instead of proper key
>>> 'rest.port'
>>> 2020-08-12 15:28:51,987 INFO
>>>  org.apache.flink.runtime.taskexecutor.TaskManagerRunner       - Using
>>> configured hostname/address for TaskManager: ip-xx-x-xx-xxx.ec2.internal.
>>> 2020-08-12 15:28:51,996 INFO
>>>  org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils         - Trying to
>>> start actor system at ip-xx-x-xx-xxx.ec2.internal:0
>>> 2020-08-12 15:28:52,823 INFO  akka.event.slf4j.Slf4jLogger
>>>                    - Slf4jLogger started
>>> 2020-08-12 15:28:52,854 INFO  akka.remote.Remoting
>>>                    - Starting remoting
>>> 2020-08-12 15:28:53,061 INFO  akka.remote.Remoting
>>>                    - Remoting started; listening on addresses
>>> :[akka.tcp://flink@ip-xx-x-xx-xxx.ec2.internal:37937]
>>> 2020-08-12 15:28:53,563 INFO
>>>  org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils         - Actor
>>> system started at akka.tcp://flink@iip-xx-x-xx-xxx.ec2.ec2.internal
>>> :37937
>>> *2020-08-12 15:28:53,593 WARN
>>>  org.apache.flink.runtime.metrics.ReporterSetup                - The
>>> reporter factory
>>> (org.apache.flink.metrics.influxdb.InfluxdbReporterFactory) could not be
>>> found for reporter influxdb. Available factories:*
>>>
>>> *2020-08-12 15:28:53,597 INFO
>>>  org.apache.flink.runtime.metrics.MetricRegistryImpl           - No metrics
>>> reporter configured, no metrics will be exposed/reported.*2020-08-12
>>> 15:28:53,599 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils
>>>       - Trying to start actor system at ip-xx-x-xx-xxx.ec2.ec2.internal:0
>>>
>>> So at one place org.apache.flink.configuration.GlobalConfiguration
>>> refers to the properties and metrics reported but
>>> then org.apache.flink.runtime.metrics.ReporterSetup complains of not
>>> finding it.
>>>
>>> Can anyone guide what I am missing here.
>>>
>>> Thanks,
>>> Hemant
>>>
>>> On Wed, Aug 12, 2020 at 9:15 PM bat man <tintin0...@gmail.com> wrote:
>>>
>>>> Hello Experts,
>>>>
>>>> I am running Flink - 1.9.0 on AWS EMR(emr-5.28.1). I want to push
>>>> metrics to Influxdb. I followed the documentation[1]. I added the
>>>> configuration to /usr/lib/flink/conf/flink-conf.yaml and copied the jar to
>>>> /usr/lib/flink//lib folder on master node. However, I also
>>>> understand that the cluster might need a re-start as only with these steps
>>>> when I run the job I don't see any measurement(table) created in my influx
>>>> db. I am not able to find any documentation on how to restart the cluster
>>>> on EMR.
>>>> Anyone who has configured to push metrics to InfluxDB from AWS EMR
>>>> could you share the steps please.
>>>>
>>>> [1]
>>>> https://ci.apache.org/projects/flink/flink-docs-release-1.9/monitoring/metrics.html#influxdb-orgapacheflinkmetricsinfluxdbinfluxdbreporter
>>>>
>>>> Thanks,
>>>> Hemant
>>>>
>>>
>
> --
>
> Arvid Heise | Senior Java Developer
>
> <https://www.ververica.com/>
>
> Follow us @VervericaData
>
> --
>
> Join Flink Forward <https://flink-forward.org/> - The Apache Flink
> Conference
>
> Stream Processing | Event Driven | Real Time
>
> --
>
> Ververica GmbH | Invalidenstrasse 115, 10115 Berlin, Germany
> <https://www.google.com/maps/search/Invalidenstrasse+115,+10115+Berlin,+Germany?entry=gmail&source=g>
>
> --
> Ververica GmbH
> Registered at Amtsgericht Charlottenburg: HRB 158244 B
> Managing Directors: Timothy Alexander Steinert, Yip Park Tung Jason, Ji
> (Toni) Cheng
>

Reply via email to