Re: join two streams with pyflink

2024-04-02 Thread Biao Geng
Hi Thierry, Your case is not very complex and I believe all programming language(e.g. Java, Python, SQL) interfaces of flink can do that. When using pyflink, you can use pyflink datastream/table/SQL API. Here are some examples of using pyflink table api:

Re: Join two streams

2023-06-30 Thread Иван Борисов
Thank you for your answer! It's work right now, one more question: I've got few streams from few Kafka topics (if it possible to do other way and easier I could make one topic or any other modifications) with sensors measurements into JSON messages: topic1: {'data': {'temp':25.2,

RE: Join two streams

2023-06-29 Thread Schwalbe Matthias
Привет Иван, The source of your problem is quite easy: - If you do windowing by event time, all the sources need to emit watermarks. - watermarks are the logical clock used when event-time timing - you could use either processing time windows, or adjust watermark strategy of your sources

Re: Join two streams

2023-06-29 Thread Shrihari R
Hey Иван, Use *TumblingProcessingTimeWindows* instead of TumblingEventTimeWindows. TumblingEventTimeWindows requires a watermark strategy. *Ref* https://stackoverflow.com/questions/72291659/flink-tumbling-window-is-not-triggered-no-watermark-strategy *Regards* Shrihari On Fri, Jun 30, 2023 at

Re: Join two streams from Kafka

2021-02-11 Thread Arvid Heise
Hi Shamit, unless you have some temporal relationship between the records to be joined, you have to use a regular join over stream 1 and stream 2. Since you cannot define any window, all data will be held in Flink's state, which is not an issue for a few millions but probably means you have to

Re: Join two streams using a count-based window

2016-06-10 Thread Nikos R. Katsipoulakis
Thank you very much Matthias! Also, the link you provided is very helpful. Cheers, Nikos On Fri, Jun 10, 2016 at 3:16 AM, Matthias J. Sax wrote: > I just put an answer to SO. > > About the other questions: Flink processes tuple-by-tuple and does some > internal buffering. You

Re: Join two streams using a count-based window

2016-06-10 Thread Matthias J. Sax
I just put an answer to SO. About the other questions: Flink processes tuple-by-tuple and does some internal buffering. You might be interested in https://cwiki.apache.org/confluence/display/FLINK/Data+exchange+between+tasks -Matthias On 06/09/2016 08:13 PM, Nikos R. Katsipoulakis wrote: >