Bowen Li created FLINK-39980:
--------------------------------

             Summary: Implement split rebalance/reassignment support in 
DynamicKafkaSource after metadata changes
                 Key: FLINK-39980
                 URL: https://issues.apache.org/jira/browse/FLINK-39980
             Project: Flink
          Issue Type: New Feature
          Components: Connectors / Kafka
    Affects Versions: 2.3.0
            Reporter: Bowen Li


DynamicKafkaSource can remove clusters/topics when stream metadata changes. 
After removal, some source subtasks may become empty while other subtasks still 
own active partitions. This can happen even when {{{}parallelism <= total 
active partitions{}}}, because the current assignment is inherited from prior 
split ownership and does not rebalance active splits across the remaining 
readers.

Flink has added a source contract for enumerators to regain a global split view 
and perform reassignment/rebalancing after recovery: {{apache/flink#27149}} / 
FLIP-537. DynamicKafkaSource should implement this contract once it is 
available in our Flink runtime.

*Goals*
Implement the new rebalance/reassignment interface in DynamicKafkaSource so 
that after metadata removal or recovery, active splits are redistributed across 
available readers.

*Expected Behavior*
When active Kafka partitions remain and {{{}parallelism <= total active 
partitions{}}}, DynamicKafkaSource should avoid empty source subtasks by 
rebalancing active splits across readers.

When {{{}total active partitions < parallelism{}}}, empty subtasks are 
expected, but they should be handled by idleness logic and autoscaler 
calibration.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to