delete all available flink timers on app start

2019-01-16 Thread vipul singh
Hello, I have a custom app, in which when due to some exception, the app restarts I want to cancel all registered flink timers in the initializeState method. Based on the documentation I feel like all timer state is saved in the state, so if the app restarts the timers are still active. Is there

Re: Issue with counter metrics for large number of keys

2019-01-16 Thread Zhenghua Gao
So what you want is the counts of every keys ? Why didn't you use count aggregation? -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

Re: Issue with counter metrics for large number of keys

2019-01-16 Thread Gaurav Luthra
Hi Ken, Thanks for your inputs again. I will wait for Flink guys to come back to me for the suggestion of implementation of 100 K unique counters. For time being, I will make the number of counter metric value a configurable parameter in my application. So, user will know what he is trying to do.

Re: Issue with counter metrics for large number of keys

2019-01-16 Thread Ken Krugler
Hi Gaurav, I’ve use a few hundred counters before without problems. My concern about > 100K unique counters is that you wind up generating load (and maybe memory issues) for the JobManager. E.g. with Hadoop’s metric system trying to go much beyond 1000 counters could cause significant

Re: Streaming Checkpoint - Could not materialize checkpoint Exception

2019-01-16 Thread sohimankotia
Hi Andrey, Yes .Setting setFailOnCheckpointingErrors(false) solved the problem. But in between I am getting this error : 2019-01-16 21:07:26,979 ERROR org.apache.flink.runtime.rest.handler.taskmanager.TaskManagerDetailsHandler - Implementation error: Unhandled exception.

Re: Flink Streaming Job Task is getting cancelled silently and causing job to restart

2019-01-16 Thread sohimankotia
Hi Andrey, Yes. CustomBucketingSink is custom class copied from Bucketing Sink itself . Few changes were added : 1. Add timestamp in part files 2. Few Logging statements Note: Looks like I copied it from version 1.4 ( Don't know if that could be the reason for failure) Did it override

RE: EXT :Re: StreamingFileSink cannot get AWS S3 credentials

2019-01-16 Thread Martin, Nick
Does that mean that the full set of fs.s3a.<…> configs in core-site.xml will be picked up from flink-conf.yaml by flink? Or only certain configs involved with authentication? From: Till Rohrmann [mailto:trohrm...@apache.org] Sent: Wednesday, January 16, 2019 3:43 AM To: Vinay Patil Cc: Kostas

Getting RemoteTransportException

2019-01-16 Thread Avi Levi
Hi Guys, We done some load tests and we got the exception below, I saw that the JobManager was restarted, If I understood correctly, it will get new job id and the state will lost - is that correct? how the state is handled setting HA as described here

How to trigger a Global Window with a different Message from the window message

2019-01-16 Thread Oliver Buckley-Salmon
Hi, I have a Flink job where I receive a stream of AggregationKeys, stored in BroadcastState which I join in a Tuple2 with a stream of RiskMeasureMessages, which I then wish to aggregate in a Window. The RiskMeasureMessages are bounded by CalcStart and CalcEnd messages which come on separate

Re: Flink Streaming Job Task is getting cancelled silently and causing job to restart

2019-01-16 Thread Andrey Zagrebin
Hi Sohi, Something was originally interrupted in DFSOutputStream$DataStreamer.run. It was thrown in the timer callback which processed files in CustomBucketingSink. Task reported the failure to JM and JM triggered then job cancelation. I do not see this CustomBucketingSink in Flink code. Is it

Re: Setting flink-conf params in IDE

2019-01-16 Thread Till Rohrmann
Hi Alexandru, you can call `StreamExecutionEnvironment#createLocalEnvironment` which you can pass a Flink configuration object. Cheers, Till On Wed, Jan 16, 2019 at 1:05 PM Alexandru Gutan wrote: > Hi everyone! > > Is there a way to set flink-conf.yaml params but when running from the > IDE?

Re: ElasticSearch Connector

2019-01-16 Thread Dawid Wysakowicz
Hi Ramya, I think the problem is that you access the serializationSchema from the closure of ElasticsearchSinkFunction. Try creating ElasticsearchSinkFunction that will get the serializationSchema in ctor. If this is not the problem could you share the full stack of the error? Best, Dawid On

Re: Flink Streaming Job Task is getting cancelled silently and causing job to restart

2019-01-16 Thread sohimankotia
Hi Andrey , Pls find logs . Attaching dropbox link as logs as large . Job Manager . : https://www.dropbox.com/s/q0rd60coydupl6w/full.log.gz?dl=0 Application : https://www.dropbox.com/s/cn3yrd273wd99f2/jm-sohan.log.gz?dl=0 Thanks Sohi -- Sent from:

Setting flink-conf params in IDE

2019-01-16 Thread Alexandru Gutan
Hi everyone! Is there a way to set flink-conf.yaml params but when running from the IDE? What I'm trying to do is to setup JMX metrics: metrics.reporter.jmx.class: org.apache.flink.metrics.jmx.JMXReportermetrics.reporter.jmx.port: 8789 Thanks!

Re: Flink Streaming Job Task is getting cancelled silently and causing job to restart

2019-01-16 Thread Andrey Zagrebin
Hi Sohi, This still looks like Task Manager logs, could you post Job Master logs, please? Best, Andrey On Tue, Jan 15, 2019 at 7:49 AM sohimankotia wrote: > Hi , > > Any Update/help please ? > > > > -- > Sent from: > http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/ >

Re: StreamingFileSink cannot get AWS S3 credentials

2019-01-16 Thread Till Rohrmann
I haven't configured this myself but I would guess that you need to set the parameters defined here under S3A Authentication methods [1]. If the environment variables don't work, then I would try to set the authentication properties. [1]

Re: StreamingFileSink cannot get AWS S3 credentials

2019-01-16 Thread Vinay Patil
Hi Till, Can you please let us know the configurations that we need to set for Profile based credential provider in flink-conf.yaml Exporting AWS_PROFILE property on EMR did not work. Regards, Vinay Patil On Wed, Jan 16, 2019 at 3:05 PM Till Rohrmann wrote: > The old BucketingSink was using

Re: StreamingFileSink cannot get AWS S3 credentials

2019-01-16 Thread Till Rohrmann
The old BucketingSink was using Hadoop's S3 filesystem directly whereas the new StreamingFileSink uses Flink's own FileSystem which need to be configured via the flink-conf.yaml. Cheers, Till On Wed, Jan 16, 2019 at 10:31 AM Vinay Patil wrote: > Hi Till, > > We are not providing

Re: Streaming Checkpoint - Could not materialize checkpoint Exception

2019-01-16 Thread Andrey Zagrebin
Hi Sohi, Could it be that you configured your job tasks to fail if checkpoint fails (streamExecutionEnvironment.getCheckpointConfig().setFailOnCheckpointingErrors(true))? Could you send the complete job master log? If checkpoint 470 has been subsumed by 471, it could be that its directory is

Re: Unable to override metric format for Prometheus Reporter

2019-01-16 Thread Chesnay Schepler
The prometheus reporter ignores scope formats. In fact all reporters that work with tags (i.e., key-value-pairs) ignore them, the idea being that you would search specific metrics based on their tags. I'm not aware of any intermediate workarounds. There is a JIRA for this issue:

Re: StreamingFileSink cannot get AWS S3 credentials

2019-01-16 Thread Vinay Patil
Hi Till, We are not providing `fs.s3a.access.key: access_key`, `fs.s3a.secret.key: secret_key` in flink-conf.yaml as we are using Profile based credentials provider. The older BucketingSink code is able to get the credentials and write to S3. We are facing this issue only with StreamingFileSink.

Re: Streaming Checkpoint - Could not materialize checkpoint Exception

2019-01-16 Thread Till Rohrmann
Hi Sohimankotia, you can control Flink's failure behaviour in case of a checkpoint failure via the `ExecutionConfig#setFailTaskOnCheckpointError(boolean)`. Per default it is set to true which means that a Flink task will fail if a checkpoint error occurs. If you set it to false, then the job

Re: StreamingFileSink cannot get AWS S3 credentials

2019-01-16 Thread Kostas Kloudas
Actually Till is right. Sorry, my fault, I did not read your second email where Vinay mentions the core-site.xml. Cheers, Kostas On Wed, Jan 16, 2019 at 10:25 AM Till Rohrmann wrote: > Hi Vinay, > > Flink's file systems are self contained and won't respect the > core-site.xml if I'm not

Re: StreamingFileSink cannot get AWS S3 credentials

2019-01-16 Thread Till Rohrmann
Hi Vinay, Flink's file systems are self contained and won't respect the core-site.xml if I'm not mistaken. Instead you have to set the credentials in the flink configuration flink-conf.yaml via `fs.s3a.access.key: access_key`, `fs.s3a.secret.key: secret_key` and so on [1]. Have you tried this

NPE when using spring bean in custom input format

2019-01-16 Thread madan
Hi, Need help in the below scenario, I have CustomInputFormat which loads the records using a bean, public class CustomInputFormat extends GenericInputFormat { private Iterator> recordsIterator; @Override public void open(GenericInputSplit split) throws IOException {

Re: StreamingFileSink cannot get AWS S3 credentials

2019-01-16 Thread Kostas Kloudas
Hi Taher, So you are using the same configuration files and everything and the only thing you change is the "s3://" to "s3a://" and the sink cannot find the credentials? Could you please provide the logs of the Task Managers? Cheers, Kostas On Wed, Jan 16, 2019 at 9:13 AM Dawid Wysakowicz

Re: StreamingFileSink cannot get AWS S3 credentials

2019-01-16 Thread Dawid Wysakowicz
Forgot to cc ;) On 16/01/2019 08:51, Vinay Patil wrote: > Hi, > > Can someone please help on this issue. We have even tried to set > fs.s3a.impl in core-site.xml, still its not working. > > Regards, > Vinay Patil > > > On Fri, Jan 11, 2019 at 5:03 PM Taher Koitawala [via Apache Flink User >

Re: StreamingFileSink cannot get AWS S3 credentials

2019-01-16 Thread Dawid Wysakowicz
Hi, I cc Kostas who should be able to help you. Best, Dawid On 16/01/2019 08:51, Vinay Patil wrote: > Hi, > > Can someone please help on this issue. We have even tried to set > fs.s3a.impl in core-site.xml, still its not working. > > Regards, > Vinay Patil > > > On Fri, Jan 11, 2019 at 5:03 PM