[
https://issues.apache.org/jira/browse/FLINK-38414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yanquan Lv reassigned FLINK-38414:
----------------------------------
Assignee: Poorvank Bhatia
> Add Vitess Pipeline Connector with Parallel Shard Reading Support
> -----------------------------------------------------------------
>
> Key: FLINK-38414
> URL: https://issues.apache.org/jira/browse/FLINK-38414
> Project: Flink
> Issue Type: New Feature
> Components: Flink CDC
> Reporter: Poorvank Bhatia
> Assignee: Poorvank Bhatia
> Priority: Major
>
> The current Vitess CDC connector
> ([flink-connector-vitess-cdc|https://github.com/apache/flink-cdc/tree/master/flink-cdc-connect/flink-cdc-source-connectors/flink-connector-vitess-cdc])
> is based on
> [Debezium|https://github.com/apache/flink-cdc/blob/master/flink-cdc-connect/flink-cdc-source-connectors/flink-connector-vitess-cdc/src/main/java/org/apache/flink/cdc/connectors/vitess/VitessSource.java#L256]
> and has a critical limitation: it's hardcoded to use a single task
> (tasks.max=1), making it impossible to parallelize reading from sharded
> Vitess keyspaces. This is a major bottleneck for production deployments with
> hundreds of shards.
> This ticket proposes adding a new pipeline connector
> (flink-cdc-pipeline-connector-vitess) that leverages FLIP-27 Source API to
> enable parallel reading with one worker per shard.
>
> References{*}{*}
> - https://vitess.io/docs/concepts/vstream/
> - https://cwiki.apache.org/confluence/display/FLINK/FLIP-27
--
This message was sent by Atlassian Jira
(v8.20.10#820010)