Weihua Hu created FLINK-31498:
---------------------------------
Summary: DeclartiveSlotManager always request redundant task
manager when resource is not enough
Key: FLINK-31498
URL: https://issues.apache.org/jira/browse/FLINK-31498
Project: Flink
Issue Type: Bug
Components: Runtime / Coordination
Reporter: Weihua Hu
Attachments: image-2023-03-17-18-05-43-088.png
Currently redundant task manager check in DeclarativeSlotManager only compare
free slots with required redundant slots.
when there are no enough resources in YARN/Kubernetes, this mechanism will
always try to request new task manager.
there are two way to address this.
1. maintain the state of redundant workers to avoid request twice
2. only try to request redundant workers when there is no pending worker
The first way will make the logic of redundant worker too complicated, I would
like to choose the second way
Looking forward to any suggestion.
!image-2023-03-17-18-05-43-088.png!
--
This message was sent by Atlassian Jira
(v8.20.10#820010)