Re: implementing a continuous time window

2016-04-21 Thread John Sherwood
You are looking for sliding windows: https://flink.apache.org/news/2015/12/04/Introducing-windows.html Here you would do .timeWindow(Time.seconds(5), Time.seconds(1)) On Thu, Apr 21, 2016 at 12:06 PM, Jonathan Yom-Tov wrote: > hi, > > Is it possible to implement a continuous time window with f

Re: design question

2016-04-23 Thread John Sherwood
This sounds like you have some per-key state to keep track of, so the 'correct' way to do it would be to keyBy the guid. I believe that if you run your environment using the Rocks DB state backend you will not OOM regardless of the number of GUIDs that are eventually tracked. Whether flink/stream p

S3 Checkpoint Storage

2016-05-02 Thread John Sherwood
Hello all, I'm attempting to set up a taskmanager cluster using S3 as the highly-available store. It looks like the main thing is just setting the ` state.backend.fs.checkpointdir` to the appropriate s3:// URI, but as someone rather new to accessing S3 from Java, how should I provide Flink with th

Configuring taskmanager with HA jobmanager

2016-05-04 Thread John Sherwood
I'm attempting to move to an HA configuration with a trio of JobManagers on top of a ZK cluster. From the docs, it appears that I should have them in my 'masters' file (as I do), but when I attempt to start the TaskManagers, they die complaining there is no jobmanager.rpc.address config - which see

Managing Flink on Yarn programmatically

2016-05-16 Thread John Sherwood
I think this is primarily a shortcoming in my ability to grep through Scala efficiently, but are there any resources on how to programmatically spin up & administrate Flink jobs on YARN? The CLI naturally works, but I'd like to build out a service handling the nuances of job management rather than