Lijie Wang created FLINK-25034:
----------------------------------
Summary: Support flexible number of subpartitions in
IntermediateResultPartition
Key: FLINK-25034
URL: https://issues.apache.org/jira/browse/FLINK-25034
Project: Flink
Issue Type: Sub-task
Components: Runtime / Coordination
Reporter: Lijie Wang
Currently, when a task is deployed, it needs to know the parallelism of its
consumer job vertex. This is because the consumer vertex parallelism is needed
to decide the _numberOfSubpartitions_ of _PartitionDescriptor_ which is part of
the {_}ResultPartitionDeploymentDescriptor{_}. The reason behind that is, at
the moment, for one result partition, different subpartitions serve different
consumer execution vertices. More specifically, one consumer execution vertex
only consumes data from subpartition with the same index.
Considering a dynamic graph, the parallelism of a job vertex may not have been
decided when its upstream vertices are deployed. To enable Flink to work in
this case, we need a way to allow an execution vertex to run without knowing
the parallelism of its consumer job vertices. One basic idea is to enable
multiple subpartitions in one result partition to serve the same consumer
execution vertex.
To achieve this goal, we can set the number of subpartitions to be the *max
parallelism* of the consumer job vertex. When the consumer vertex is deployed,
it should be assigned with a subpartition range to consume.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)