[
https://issues.apache.org/jira/browse/FLINK-21467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17301066#comment-17301066
]
Kezhu Wang commented on FLINK-21467:
------------------------------------
Hi [~pnowojski], I guess it depends on various subtleties:
# "MAX_WATERMARK" could come from last unaligned checkpoint.
# Last unaligned checkpoint considered as completed but fail at
"notifyCheckpointComplete" phase".
# Recovered subtask gets splits assigned from either source enumerator or
redistributed operator list state.
The key unknown questions are:
# Will "MAX_WATERMARK" be persisted in unaligned checkpoint ?
# When an operator is considered finished ?
# A recovered finishing subtask could get new splits assigned ?
> Document possible recommended usage of Bounded{One/Multi}Input.endInput and
> emphasize that they could be called multiple times
> ------------------------------------------------------------------------------------------------------------------------------
>
> Key: FLINK-21467
> URL: https://issues.apache.org/jira/browse/FLINK-21467
> Project: Flink
> Issue Type: Improvement
> Components: API / DataStream
> Affects Versions: 1.13.0
> Reporter: Kezhu Wang
> Priority: Major
>
> It is too tempting to use these api, especially {{BoundedOneInput.endInput}},
> to commit final result before FLIP-147 delivered. And this will cause
> re-commit after failover as [~gaoyunhaii] has pointed out in FLINK-21132.
> I have
> [pointed|https://github.com/apache/iceberg/issues/2033#issuecomment-784153620]
> this out in
> [apache/iceberg#2033|https://github.com/apache/iceberg/issues/2033], please
> correct me if I was wrong.
> cc [~aljoscha] [~pnowojski] [~roman_khachatryan]
--
This message was sent by Atlassian Jira
(v8.3.4#803005)