Re: Maintain heavy hitters in Flink application

2018-02-02 Thread m@xi
Hi Timo, Thanks a lot for the advice. I am working on it. Cheers, Max -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

Re: Maintain heavy hitters in Flink application

2018-02-01 Thread Timo Walther
Hi, I think it would be easier to implement a custom key selector and introduce some artifical key that spreads the load more evenly. This would also allow you to use keyed state. You could use a ProcessFunction and set timers to define the "every now and then". Keyed state would also ease th

Re: Maintain heavy hitters in Flink application

2018-02-01 Thread m@xi
Anyone, someone, somebody? -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

Re: Maintain heavy hitters in Flink application

2018-01-31 Thread m@xi
Hello everyone and Happy New Year! Regarding the Heavy Hitter tracking...I wanna do it in a distributed manner. Thus, 1 -- Round Robin the input stream to a number of parallel map instances (say p = env.parallelism) 2 -- Each one of the p mappers maintains approximately the HH of its correspondi

Re: Maintain heavy hitters in Flink application

2017-12-08 Thread m@xi
Kostas and Fabian, Thanks for the advice. I guess I will find a workaround to do the state redistribution. I also read about side outputs in this thread, which might be also an option that I will consider. http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Share-state-across-ope

Re: Maintain heavy hitters in Flink application

2017-12-07 Thread Kostas Kloudas
Hi Max, You are right that Queryable State is not designed to be used as a means for a job to query its own state. In fact, given that you do not know the jobId of your job from within the job itself, I do not think you can use queryable state in your scenario. What you can do is to have a fla

Re: Maintain heavy hitters in Flink application

2017-12-05 Thread Fabian Hueske
Hi, I haven't done that before either. The query API will change with the next version (Flink 1.4.0) which is currently being prepared for releasing. Kostas (in CC) might be able to help you. Best, Fabian 2017-12-05 9:52 GMT+01:00 m@xi : > Hi Fabian, > > Thanks for your answer. Initially, I hav

Re: Maintain heavy hitters in Flink application

2017-12-05 Thread m@xi
Hi Fabian, Thanks for your answer. Initially, I have excluded Queryable State as an option as it explicitly mentioned that it is used for querying state outside flink. Now that I am reading the documentation I am not sure how I may achieve that. I have to set ports and addresses which I am not su

Re: Maintain heavy hitters in Flink application

2017-12-04 Thread Fabian Hueske
Hi Max, state (keyed or operator state) is always local to the task. By default it is not accessible (read or write) from the outside or other tasks of the application. You can expose keyed state as queryable state [1] to perform key look ups. This feature was designed for external application to

Maintain heavy hitters in Flink application

2017-11-29 Thread m@xi
Hello everyone! I want to implement a streaming algorithm like Misa-Gries or Space Saving in Flink. The goal is to maintain the heavy hitters for my (possibly unbounded) input streams throughout all the time my app runs. More precisely, I want to have a non-stop running task that runs the Space Sa