Hi devs,

As Flink CDC 3.6 was released recently, I’d like to propose a discussion on the 
next Flink CDC 3.7 release cycle.


In this version, we’d like to focus on the following development tasks:


1) Multimodal data support. More data types (OBJECTREF, BLOB, VECTOR, TENSOR) 
could be added for unstructured data ingestion sources. Built-in multimodal 
processing functions could be added, and sinks like Paimon and Fluss could 
write them in a standard way.


2) AI ecosystem integration. Align AI model and function APIs with Flink SQL 
edition. A Python SDK is planned for constructing YAML pipelines and 
interoperating UDF with the Python ecosystem.


3) Connector ecosystem enhancement. For MySQL CDC, conditional snapshot, 
source-side rate-limiting, and binlog deserialization optimization are on the 
roadmap. More connectors like HLS (capturing live-streaming media) and Fluss 
will be added, too.


For developers who are interested in participating and contributing new 
features in this release cycle, please feel free to create a task that targets 
cdc-3.7.0 to trace your planning features [1].


I'm happy to volunteer as a release manager and open to working together with 
someone on this.


To ensure that we can complete the above plan, we plan to complete the 
development of Flink CDC 3.7 on 2026-06-28, with a 2-month developing cycle.


Best Regards,

Xiqian


[1] 
https://issues.apache.org/jira/issues/?jql=project%20%3D%20FLINK%20AND%20component%20%3D%20%22Flink%20CDC%22

Reply via email to