zhijiang created FLINK-6120:
-------------------------------
Summary: Implement heartbeat logic between JobMaster and
ResourceManager
Key: FLINK-6120
URL: https://issues.apache.org/jira/browse/FLINK-6120
Project: Flink
Issue Type: Improvement
Reporter: zhijiang
Assignee: zhijiang
It is part of work for Flip-6.
The HeartbeatManager is mainly used for monitoring heartbeat target and
reporting payloads.
For {{ResourceManager}} side, it would trigger monitoring the
{{HeartbeatTarget}} when receive registration from {{JobMaster}}, and schedule
a task to {{requestHeartbeat}} at interval time. If not receive heartbeat
response within duration time, the {{HeartbeatListener}} will notify heartbeat
timeout, then the {{ResourceManager}} should remove the internal registered
{{JobMaster}}.
For {{JobMaster}} side, it would trigger monitoring the {{HeartbeatTarget}}
when receive registration acknowledgement from {{ResourceManager}}. An it will
also be notified heartbeat timeout if not receive heartbeat request from
{{ResourceManager}} within duration time.
The current implementation will not interact payloads via heartbeat, and it can
be added if needed future.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)