Re: Windows and data loss.

2021-11-24 Thread Caizhi Weng
Hi! Are you using the datastream API or the table / SQL API? I don't know if datastream API has this functionality, but in table / SQL API we have the following configurations [1]. - table.exec.emit.late-fire.enabled: Emit window results for late records; - table.exec.emit.late-fire.dela

RE: Input Selectable & Checkpointing

2021-11-24 Thread Shazia Kayani
Hi Piotrek,Thanks for you message!Ok that does sound interesting and is a approach I had not considered before, will take a look into and further investigateThank you!Best wishes,Shazia   - Original message -From: "Piotr Nowojski" To: "Shazia Kayani" Cc: mart...@ververica.com, "user" Subjec

Windows and data loss.

2021-11-24 Thread John Smith
Hi I understand that when using windows and having set the watermarks and lateness configs. That if an event comes late it is lost and we can output it to side output. But wondering is there a way to do it without the loss? I'm guessing an "all" window with a custom trigger that just fires X peri

Re: behavior change with idle partitions and the new KafkaSource?

2021-11-24 Thread Arvid Heise
Hi David, If there are a fewer splits than subtasks, then in streaming mode, the watermark will not advance automatically anymore. If the subtask later picks up a new split (topic discovery), then the watermark will be set to the respective data. If the user uses higher parallelism than there are

Re: Input Selectable & Checkpointing

2021-11-24 Thread Piotr Nowojski
Hi Shazia, FLIP-182 [1] might be a thing that will let you address issues like this in the future. With it, maybe you could do some magic with assigning watermarks to make sure that one stream doesn't run too much into the future which would effectively prioritise the other stream. But that's curr

Re: Flink CLI - pass command line arguments to a pyflink job

2021-11-24 Thread Dian Fu
Thanks for sharing! Great to hear! On Wed, Nov 24, 2021 at 3:59 AM Kamil ty wrote: > Thank you Matthias and Dian! > > I have verified this command: > bin/flink run -py examples/python/table/batch/word_count.py --test "Hello > World" > Where the "--test" argument is accessed from the python code

Re: Setting producerConfig properties for FlinkKafkaProducer

2021-11-24 Thread Arvid Heise
Hi Darius, I fear that your use case has not been directly supported. You can certainly use the workarounds for that and I don't know of a better way. Consumer/Producer configs both in legacy and unified implementations are only meant to be statically settable. For the new unified interfaces, the