Till Rohrmann created FLINK-16866:
-------------------------------------
Summary: Make job submission non-blocking
Key: FLINK-16866
URL: https://issues.apache.org/jira/browse/FLINK-16866
Project: Flink
Issue Type: Improvement
Components: Runtime / Coordination
Affects Versions: 1.10.0, 1.9.2, 1.11.0
Reporter: Till Rohrmann
Fix For: 1.11.0
Currently, Flink waits to acknowledge a job submission until the corresponding
{{JobManager}} has been created. Since its creation also involves the creation
of the {{ExecutionGraph}} and potential FS operations, it can take a bit of
time. If the user has configured a too low {{web.timeout}}, the submission can
time out only reporting a {{TimeoutException}} to the user.
I propose to change the notion of job submission slightly. Instead of waiting
until the {{JobManager}} has been created, a job submission is complete once
all job relevant files have been uploaded to the {{Dispatcher}} and the
{{Dispatcher}} has been told about it. Creating the {{JobManager}} will then
belong to the actual job execution. Consequently, if problems occur while
creating the {{JobManager}} it will result into a job failure.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)