[ 
https://issues.apache.org/jira/browse/FLINK-33944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vijay updated FLINK-33944:
--------------------------
    Description: 
 
We are using Flink (1.18) version for our Flink cluster. The job manager has 
been deployed in "Application mode" and we are looking for a process to restore 
multiple jobs (using their respective savepoint directories) when the job 
manager is started. Currently, we have the option to restore only one job while 
running "standalone-job.sh" using the --fromSavepoint and 
--allowNonRestoredState. However, we need a way to trigger multiple job 
executions via Java client.

Note: We are not using a Kubernetes native deployment, but we are using k8s 
standalone mode of deployment.

Additional Query: If there is a process to restore multiple jobs from its 
respective savepoints on "Application mode" of deployment, is the same 
supported on Session mode of deployment or not?

*Expected process:*
 # Before starting with the Flink/application image upgrade, trigger the 
savepoints for all the current running jobs.
 # Once the savepoints process completed for all jobs, will trigger the scale 
down of job manager and task manager instances.
 # Update the image version on the k8s deployment with the update application 
image.
 # After image version is updated, scale up the job manager and task manager.
 # We need a process to restore the previously running jobs from the savepoint 
dir and start all the jobs.

  was:
 
We are using Flink (1.18) version for our Flink cluster. The job manager has 
been deployed in "Application mode" and we are looking for a process to restore 
multiple jobs (using their respective savepoint directories) when the job 
manager is started. Currently, we have the option to restore only one job while 
running "standalone-job.sh" using the --fromSavepoint and 
--allowNonRestoredState. However, we need a way to trigger multiple job 
executions via Java client.

Note: We are not using a Kubernetes native deployment, but we are using k8s 
standalone mode of deployment.

*Expected process:*
 # Before starting with the Flink/application image upgrade, trigger the 
savepoints for all the current running jobs.
 # Once the savepoints process completed for all jobs, will trigger the scale 
down of job manager and task manager instances.
 # Update the image version on the k8s deployment with the update application 
image.
 # After image version is updated, scale up the job manager and task manager.
 # We need a process to restore the previously running jobs from the savepoint 
dir and start all the jobs.


> Apache Flink: Process to restore more than one job on job manager startup 
> from the respective savepoints
> --------------------------------------------------------------------------------------------------------
>
>                 Key: FLINK-33944
>                 URL: https://issues.apache.org/jira/browse/FLINK-33944
>             Project: Flink
>          Issue Type: Bug
>          Components: Runtime / Checkpointing
>    Affects Versions: 1.18.0
>            Reporter: Vijay
>            Priority: Major
>
>  
> We are using Flink (1.18) version for our Flink cluster. The job manager has 
> been deployed in "Application mode" and we are looking for a process to 
> restore multiple jobs (using their respective savepoint directories) when the 
> job manager is started. Currently, we have the option to restore only one job 
> while running "standalone-job.sh" using the --fromSavepoint and 
> --allowNonRestoredState. However, we need a way to trigger multiple job 
> executions via Java client.
> Note: We are not using a Kubernetes native deployment, but we are using k8s 
> standalone mode of deployment.
> Additional Query: If there is a process to restore multiple jobs from its 
> respective savepoints on "Application mode" of deployment, is the same 
> supported on Session mode of deployment or not?
> *Expected process:*
>  # Before starting with the Flink/application image upgrade, trigger the 
> savepoints for all the current running jobs.
>  # Once the savepoints process completed for all jobs, will trigger the scale 
> down of job manager and task manager instances.
>  # Update the image version on the k8s deployment with the update application 
> image.
>  # After image version is updated, scale up the job manager and task manager.
>  # We need a process to restore the previously running jobs from the 
> savepoint dir and start all the jobs.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to