Re: working with flink Session Windows

2018-07-20 Thread antonio saldivar
Hello Actually I evaluate my WindowFunction with a trigger alert, having something like below code (testing with 2 different windows), expecting 5K elements per second arriving SingleOutputStreamOperator windowedElem = element .keyBy("id") .timeWindow(Time.seconds(120)) //

Re: working with flink Session Windows

2018-07-20 Thread Hequn Cheng
Hi antonio, I think it worth a try to test the performance in your scenario, since job performance can be affected by a number of factors(say your WindowFunction). Best, Hequn On Sat, Jul 21, 2018 at 2:59 AM, antonio saldivar wrote: > Hello > > I am building an app but for this UC I want to

Re: Flink on Mesos: containers question

2018-07-20 Thread Renjie Liu
Hi, Alexei: What you paste is expected behavior. Jobmanager, two task managers each should run in a docker instance. 13276 is should be the process of job manager, and it's the same process as 789. They have different processes id because in show them in different namesapces(that's a concept in

Re: ProcessFunction example from the documentation giving me error

2018-07-20 Thread anna stax
It is not the code, but I don't know what the problem is. A simple word count with socketTextStream used to work but now gives the same error. Apps with kafka source which used to work is giving the same error. When I have a source generator within the app itself works good. So, with

Re: Query regarding rest.port property

2018-07-20 Thread Chesnay Schepler
Effectively you can't disable them selectively; reason being that they are actually one and the same. The ultimate solution is to build flink-dist yourself, and exclude "flink-runtime-web" from it, which removes the required files. Note that being able to selectively disable them _for security

Re: org.apache.flink.streaming.runtime.streamrecord.LatencyMarker cannot be cast to org.apache.flink.streaming.runtime.streamrecord.StreamRecord

2018-07-20 Thread Gregory Fee
This is on Flink 1.4.2. I filed it as Flink-9905. Thanks! On Fri, Jul 20, 2018 at 2:51 AM, Dawid Wysakowicz wrote: > Hi Gregory, > I think it is some flink bug. Could you file a JIRA for it? Also which > version of flink are you using? > Best, > Dawid > > On Fri, 20 Jul 2018 at 04:34, vino yang

working with flink Session Windows

2018-07-20 Thread antonio saldivar
Hello I am building an app but for this UC I want to test with session windows and I am not sure if this will be expensive for the compute resources because the gap will be 10 mins, 20 mins 60 mins because I want to trigger an alert if the element reaches some thresholds within those periods of

Re: Flink 1.4.2 -> 1.5.0 upgrade Queryable State debugging

2018-07-20 Thread Philip Doctor
Yeah I just went to reproduce this on a fresh environment, I blew away all the Zookeeper data and the error went away. I'm running HA JobManager (1 active, 2 standby) and 3 TMs. I'm not sure how to fully account for this behavior yet, it looks like I can make this run from a totally fresh

akka.stream.materializer exception

2018-07-20 Thread Rad Rad
Hi, I have a problem when running streaming Flink from jar file using CLI, the program works fine if it runs from IDE? The main exception is [1] When I search for this exception, I tried to solve it by adding akka dependencies to my pom.xml file as [2] and maven shaded plugin for jar execution

Re: ProcessFunction example from the documentation giving me error

2018-07-20 Thread anna stax
My object name is CreateUserNotificationRequests, thats why you see CreateUserNotificationRequests in the Error message. I edited the object name after pasting the code...Hope there is no confusion and I get some help. Thanks On Fri, Jul 20, 2018 at 10:10 AM, anna stax wrote: > Hello all, >

Re: ProcessFunction example from the documentation giving me error

2018-07-20 Thread anna stax
Hello all, This is my code, just trying to make the code example in https://ci.apache.org/projects/flink/flink-docs-release-1.5/dev/stream/ operators/process_function.html work object ProcessFunctionTest { def main(args: Array[String]) { val env =

Re: Flink job hangs using rocksDb as backend

2018-07-20 Thread shishal singh
Hi Richer, Actually for the testing , now I have reduced the number of timers to few thousands (5-6K) but my job still gets stuck randomly. And its not reproducible each time. next time when I restart the job it again starts working for few few hours/days then gets stuck again. I took thread

Re: Query regarding rest.port property

2018-07-20 Thread Vino yang
Hi Vinay, Did job manager run in node "myhost"? Did you check the port you specified open for remote access? Can you try to start web UI, but just forbid its port? Vino yang Thanks. On 2018-07-20 22:48 , Vinay Patil Wrote: Hi, We have disabled Flink Web UI for security reasons however we

Re: Re: Bootstrapping the state

2018-07-20 Thread Vino yang
Hi Henkka, If you want to customize the datastream text source for your purpose. You can use a read counter, if the value of counter would not change in a interval you can guess all the data has been read. Just a idea, you can choose other solution. About creating a savepoint automatically on

Query regarding rest.port property

2018-07-20 Thread Vinay Patil
Hi, We have disabled Flink Web UI for security reasons however we want to use REST Api for monitoring purpose. For that I have set jobmanager.web.port = -1 , rest.port=, rest.address=myhost But I am not able to access any REST api using https:// myhost:/ Is it mandatory to have Flink

Re: Cannot configure akka.ask.timeout

2018-07-20 Thread Lukas Kircher
Thanks for your answers. In my use case I am reading from a large number of individual files. Jobs are issued directly from the Java API, the results are collected (in memory) and re-used partially in follow-up jobs. I feared that using a MiniCluster or local environment I would not be able to

Re: Bootstrapping the state

2018-07-20 Thread Henri Heiskanen
Hi, Thanks. Just to clarify, where would you then invoke the savepoint creation? I basically need to know when all data is read, create a savepoint and then exit. I think I could just as well use the PROCESS_CONTINUOSLY, monitor the stream outside and then use rest api to cancel with savepoint.

Re: Joining streamed data to reference data

2018-07-20 Thread Dawid Wysakowicz
Hi James, 1) Unfortunately, Flink does not support DataSet with DataStream joins as of now. If the "batch" table is small enough you might try the solution suggested by Vino to load it in the UDTF. You can also try implementing the Stream version of this table yourself. You can use the

Re: Joining streamed data to reference data

2018-07-20 Thread vino yang
Hi Porritt, Flink does not support streaming and batch join, currently, streaming and batch job are both independent. I guess your use case is streaming and dimension table join? Unfortunately, it's not possible for the Flink SQL API to join a stream with a common dataset now. 1) As a

Re: org.apache.flink.streaming.runtime.streamrecord.LatencyMarker cannot be cast to org.apache.flink.streaming.runtime.streamrecord.StreamRecord

2018-07-20 Thread Dawid Wysakowicz
Hi Gregory, I think it is some flink bug. Could you file a JIRA for it? Also which version of flink are you using? Best, Dawid On Fri, 20 Jul 2018 at 04:34, vino yang wrote: > Hi Gregory, > > This exception seems a bug, you can create a issues in the JIRA. > > Thanks, vino. > > 2018-07-20 10:28

Joining streamed data to reference data

2018-07-20 Thread Porritt, James
I was hoping to join a StreamTableSource to a BatchTableSource, but I find it's not simple. A couple of questions: 1) Other than just pushing the DataSet to a Kafka topic (either internally or externally to the application) and reading it into a DataStream are there any means of doing

Re: Flink 1.4.2 -> 1.5.0 upgrade Queryable State debugging

2018-07-20 Thread Dawid Wysakowicz
Hi Philip, Could you attach the full stack trace? Are you querying the same job/cluster in both tests? I am also looping in Kostas, who might know more about changes in Queryable state between 1.4.2 and 1.5.0. Best, Dawid On Thu, 19 Jul 2018 at 22:33, Philip Doctor wrote: > Dear Flink Users, >

Re: ProcessFunction example from the documentation giving me error

2018-07-20 Thread vino yang
Hi anna, Can you share your program and the exception stack trace and more details about what's your source and state backend? >From the information you provided, it seems Flink started a network connect but timed out. Thanks, vino. 2018-07-20 14:14 GMT+08:00 anna stax : > Hi all, > > I am

Re: Is KeyedProcessFunction available in Flink 1.4?

2018-07-20 Thread anna stax
Thanks Bowen. On Thu, Jul 19, 2018 at 4:45 PM, Bowen Li wrote: > Hi Anna, > > KeyedProcessFunction is only available starting from Flink 1.5. The doc is > here > . > It

ProcessFunction example from the documentation giving me error

2018-07-20 Thread anna stax
Hi all, I am new to Flink. I am using the classes CountWithTimestamp and CountWithTimeoutFunction from the examples found in https://ci.apache.org/projects/flink/flink-docs-release-1.5/dev/stream/operators/process_function.html I am getting the error Exception in thread "main"