Zhu Zhu created FLINK-28258: ------------------------------- Summary: Introduce ExecutionHistory to host historical executions for each execution vertex Key: FLINK-28258 URL: https://issues.apache.org/jira/browse/FLINK-28258 Project: Flink Issue Type: Sub-task Components: Runtime / Coordination Reporter: Zhu Zhu Fix For: 1.16.0
With speculative execution, tracking prior executions in an {{EvictingBoundedList}} does not work. This is because when using {{EvictingBoundedList}} relies on the assumption that the historical executions are added in ascending order of attempt number successively. This is no longer true if speculative execution is enabled. e.g. 3 speculative execution attempts #1, #2, #3 are running concurrently, later #3 failed, and then #1 failed, and execution attempt #2 keeps running. The broken assumption may result in exceptions in REST, job archiving and so on. We propose to introduce an {{ExecutionHistory}} to replace {{EvictingBoundedList}}. It hosts the historical executions in a {{LinkedHashMap}} with a size bound. When the map grows beyond the size bound, elements are dropped from the head of the map (FIFO order). -- This message was sent by Atlassian Jira (v8.20.7#820007)