Have a look at this presentation I gave a few weeks ago. https://youtu.be/bQmz7JOmE_4
Niels Basjes On Wed, 22 Jul 2020, 08:51 bat man, <tintin0...@gmail.com> wrote: > Hi Team, > > Can someone share their experiences handling this. > > Thanks. > > On Tue, Jul 21, 2020 at 11:30 AM bat man <tintin0...@gmail.com> wrote: > >> Hello, >> >> I have a pipeline which consumes data from a Kafka source. Since, the >> partitions are partitioned by device_id in case a group of devices is down >> some partitions will not get normal flow of data. >> I understand from documentation here[1] in flink 1.11 one can declare the >> source idle - >> WatermarkStrategy.<Tuple2<Long, String>>forBoundedOutOfOrderness(Duration >> .ofSeconds(20)).withIdleness(Duration.ofMinutes(1)); >> >> How can I handle this in 1.9, since I am using aws emr and emr doesn't >> have any release with the latest flink version. >> >> One way I could think of is to trigger watermark generation every 10 >> minutes or so using Periodic watermarks. However, this will not be full >> proof, are there any better way to handle this more dynamically. >> >> [1] - >> https://ci.apache.org/projects/flink/flink-docs-release-1.11/dev/event_timestamps_watermarks.html#watermark-strategies-and-the-kafka-connector >> >> Thanks, >> Hemant >> >>