Re: [DISCUSS] FLIP-435: Introduce a New Dynamic Table for Simplifying Data Pipelines

Timo Walther Tue, 12 Mar 2024 10:24:51 -0700

Hi Lincoln & Ron,

thanks for proposing this FLIP. I think a design similar to what youpropose has been in the heads of many people, however, I'm wondering howthis will fit into the bigger picture.

I haven't deeply reviewed the FLIP yet, but would like to ask someinitial questions:

Flink has introduced the concept of Dynamic Tables many years ago. Howdoes the term "Dynamic Table" fit into Flink's regular tables and alsohow does it relate to Table API?

I fear that adding the DYNAMIC TABLE keyword could cause confusion forusers, because a term for regular CREATE TABLE (that can be "kind ofdynamic" as well and is backed by a changelog) is then missing. Alsogiven that we call our connectors for those tables, DynamicTableSourceand DynamicTableSink.

In general, I find it contradicting that a TABLE can be "paused" or"resumed". From an English language perspective, this does soundincorrect. In my opinion (without much research yet), a continuousupdating trigger should rather be modelled as a CREATE MATERIALIZED VIEW(which users are familiar with?) or a new concept such as a CREATE TASK(that can be paused and resumed?).

How do you envision re-adding the functionality of a statement set, thatfans out to multiple tables? This is a very important use case for datapipelines.

Since the early days of Flink SQL, we were discussing `SELECT STREAM *FROM T EMIT 5 MINUTES`. Your proposal seems to rephrase STREAM and EMIT,into other keywords DYNAMIC TABLE and FRESHNESS. But the corefunctionality is still there. I'm wondering if we should widen the scope(maybe not part of this FLIP but a new FLIP) to follow the standard moreclosely. Making `SELECT * FROM t` bounded by default and use new syntaxfor the dynamic behavior. Flink 2.0 would be the perfect time for this,however, it would require careful discussions. What do you think?


Regards,
Timo


On 11.03.24 08:23, Ron liu wrote:

Hi, Dev


Lincoln Lee and I would like to start a discussion about FLIP-435:
Introduce a  New Dynamic Table for Simplifying Data Pipelines.


This FLIP is designed to simplify the development of data processing
pipelines. With Dynamic Tables with uniform SQL statements and
freshness, users can define batch and streaming transformations to
data in the same way, accelerate ETL pipeline development, and manage
task scheduling automatically.


For more details, see FLIP-435 [1]. Looking forward to your feedback.


[1]


Best,

Lincoln & Ron

Re: [DISCUSS] FLIP-435: Introduce a New Dynamic Table for Simplifying Data Pipelines

Reply via email to