Re: How to correct use timeWindow() with DataStream?

2018-03-19 Thread Fabian Hueske
If you don't want to partition by key, i.e., have a single result for each time window, you should not use keyBy and an allWindow. However, this will only be executed with a parallelism of 1. 2018-03-19 13:54 GMT+01:00 Felipe Gutierrez : > thanks a lot Fabian, > >

Re: How to correct use timeWindow() with DataStream?

2018-03-19 Thread Felipe Gutierrez
thanks a lot Fabian, It clarified my way to developing. I am using keyBy, timeWindow, and apply monad operator at the EventTimeStreamExampleJava now. I am generating

Re: How to correct use timeWindow() with DataStream?

2018-03-19 Thread Fabian Hueske
Hi, The timestamps of the stream records should be increasing (strict monotonicity is not required, a bit out of orderness can be handled due to watermarks). So, the events should also be generated with increasing timestamps. It looks like your generator generates random dates. I'd also generate

How to correct use timeWindow() with DataStream?

2018-03-16 Thread Felipe Gutierrez
Hi all, I am building an example with DataStream using Flink that has a fake source generator of LogLine(Date d, String line). I want to work with Watermarks on it so I created a class that implements AssignerWithPeriodicWatermarks. If I don't use the monad ".timeWindow(Time.seconds(2))" on the