[jira] [Updated] (FLINK-23826) Verify optimized scheduler performance for large-scale jobs

2021-09-25 Thread Zhu Zhu (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-23826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhu Zhu updated FLINK-23826:

Description: 
This ticket is used to verify the result of FLINK-21110.
It should check if large scale jobs' scheduling are working well and the 
scheduling performance, with a real job running on cluster. 

The conclusion should include, for a *1 --- all-to-all-connected -->1* 
job:
1. time of job initialization on master (job received -> scheduling started)
2. time of task deployment (task deploying started -> all tasks in RUNNING)
3. time of making task failure recovery decision (JM notified about task 
failure -> tasks to restart decided)



  was:
This ticket is used to verify the result of FLINK-21110.
It should check if large scale jobs' scheduling are working well and the 
scheduling performance, with a real job running on cluster. 

The conclusion should include, for a *1 --- all-to-all-connected -->1* 
job:
1. time of job initialization on master (job received -> scheduling started)
2. time of task deployment (task deploying started -> all tasks in 
INITIALIZATION)
3. time of making task failure recovery decision (JM notified about task 
failure -> tasks to restart decided)




> Verify optimized scheduler performance for large-scale jobs
> ---
>
> Key: FLINK-23826
> URL: https://issues.apache.org/jira/browse/FLINK-23826
> Project: Flink
>  Issue Type: Sub-task
>  Components: Runtime / Coordination
>Affects Versions: 1.14.0
>Reporter: Zhu Zhu
>Assignee: Zhu Zhu
>Priority: Major
> Fix For: 1.14.0
>
>
> This ticket is used to verify the result of FLINK-21110.
> It should check if large scale jobs' scheduling are working well and the 
> scheduling performance, with a real job running on cluster. 
> The conclusion should include, for a *1 --- all-to-all-connected 
> -->1* job:
> 1. time of job initialization on master (job received -> scheduling started)
> 2. time of task deployment (task deploying started -> all tasks in RUNNING)
> 3. time of making task failure recovery decision (JM notified about task 
> failure -> tasks to restart decided)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (FLINK-23826) Verify optimized scheduler performance for large-scale jobs

2021-09-14 Thread Zhu Zhu (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-23826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhu Zhu updated FLINK-23826:

Priority: Major  (was: Critical)

> Verify optimized scheduler performance for large-scale jobs
> ---
>
> Key: FLINK-23826
> URL: https://issues.apache.org/jira/browse/FLINK-23826
> Project: Flink
>  Issue Type: Sub-task
>  Components: Runtime / Coordination
>Affects Versions: 1.14.0
>Reporter: Zhu Zhu
>Assignee: Zhu Zhu
>Priority: Major
> Fix For: 1.14.0
>
>
> This ticket is used to verify the result of FLINK-21110.
> It should check if large scale jobs' scheduling are working well and the 
> scheduling performance, with a real job running on cluster. 
> The conclusion should include, for a *1 --- all-to-all-connected 
> -->1* job:
> 1. time of job initialization on master (job received -> scheduling started)
> 2. time of task deployment (task deploying started -> all tasks in 
> INITIALIZATION)
> 3. time of making task failure recovery decision (JM notified about task 
> failure -> tasks to restart decided)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (FLINK-23826) Verify optimized scheduler performance for large-scale jobs

2021-09-14 Thread Zhu Zhu (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-23826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhu Zhu updated FLINK-23826:

Description: 
This ticket is used to verify the result of FLINK-21110.
It should check if large scale jobs' scheduling are working well and the 
scheduling performance, with a real job running on cluster. 

The conclusion should include, for a *1 --- all-to-all-connected -->1* 
job:
1. time of job initialization on master (job received -> scheduling started)
2. time of task deployment (task deploying started -> all tasks in 
INITIALIZATION)
3. time of making task failure recovery decision (JM notified about task 
failure -> tasks to restart decided)



  was:
This ticket is used to verify the result of FLINK-21110.
It should check if large scale jobs' scheduling are working well and the 
scheduling performance, with a real job running on cluster. 

The conclusion should include, for a *1 --- all-to-all-connected -->1* 
job:
1. time of job initialization on master (job received -> scheduling started)
2. time of job scheduling and deployment (scheduling started -> all tasks in 
INITIALIZATION)
3. time of job restarting on task failover (JM notified about task failure -> 
all tasks in INITIALIZATION again)




> Verify optimized scheduler performance for large-scale jobs
> ---
>
> Key: FLINK-23826
> URL: https://issues.apache.org/jira/browse/FLINK-23826
> Project: Flink
>  Issue Type: Sub-task
>  Components: Runtime / Coordination
>Affects Versions: 1.14.0
>Reporter: Zhu Zhu
>Assignee: Zhu Zhu
>Priority: Critical
> Fix For: 1.14.0
>
>
> This ticket is used to verify the result of FLINK-21110.
> It should check if large scale jobs' scheduling are working well and the 
> scheduling performance, with a real job running on cluster. 
> The conclusion should include, for a *1 --- all-to-all-connected 
> -->1* job:
> 1. time of job initialization on master (job received -> scheduling started)
> 2. time of task deployment (task deploying started -> all tasks in 
> INITIALIZATION)
> 3. time of making task failure recovery decision (JM notified about task 
> failure -> tasks to restart decided)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (FLINK-23826) Verify optimized scheduler performance for large-scale jobs

2021-09-14 Thread Zhu Zhu (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-23826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhu Zhu updated FLINK-23826:

Description: 
This ticket is used to verify the result of FLINK-21110.
It should check if large scale jobs' scheduling are working well and the 
scheduling performance, with a real job running on cluster. 

The conclusion should include, for a *1 --- all-to-all-connected -->1* 
job:
1. time of job initialization on master (job received -> scheduling started)
2. time of job scheduling and deployment (scheduling started -> all tasks in 
INITIALIZATION)
3. time of job restarting on task failover (JM notified about task failure -> 
all tasks in INITIALIZATION again)



  was:
This ticket is used to verify the result of FLINK-21110.
It should check if large scale jobs' scheduling are working well and the 
scheduling performance, with a real job running on cluster. 

The conclusion should include, for a *1 --- all-to-all-connected -->1* 
job:
1. time of job initialization on master (job received -> scheduling started)
2. time of job scheduling and deployment (scheduling started -> all tasks in 
INITIALIZATION)
3. time of job restarting on task failover (JM notified about task failure -> 
all tasks in INITIALIZATION again)
4. master heap memory required




> Verify optimized scheduler performance for large-scale jobs
> ---
>
> Key: FLINK-23826
> URL: https://issues.apache.org/jira/browse/FLINK-23826
> Project: Flink
>  Issue Type: Sub-task
>  Components: Runtime / Coordination
>Affects Versions: 1.14.0
>Reporter: Zhu Zhu
>Assignee: Zhu Zhu
>Priority: Critical
> Fix For: 1.14.0
>
>
> This ticket is used to verify the result of FLINK-21110.
> It should check if large scale jobs' scheduling are working well and the 
> scheduling performance, with a real job running on cluster. 
> The conclusion should include, for a *1 --- all-to-all-connected 
> -->1* job:
> 1. time of job initialization on master (job received -> scheduling started)
> 2. time of job scheduling and deployment (scheduling started -> all tasks in 
> INITIALIZATION)
> 3. time of job restarting on task failover (JM notified about task failure -> 
> all tasks in INITIALIZATION again)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (FLINK-23826) Verify optimized scheduler performance for large-scale jobs

2021-09-13 Thread Zhu Zhu (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-23826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhu Zhu updated FLINK-23826:

Priority: Critical  (was: Blocker)

> Verify optimized scheduler performance for large-scale jobs
> ---
>
> Key: FLINK-23826
> URL: https://issues.apache.org/jira/browse/FLINK-23826
> Project: Flink
>  Issue Type: Sub-task
>  Components: Runtime / Coordination
>Affects Versions: 1.14.0
>Reporter: Zhu Zhu
>Assignee: Zhilong Hong
>Priority: Critical
> Fix For: 1.14.0
>
>
> This ticket is used to verify the result of FLINK-21110.
> It should check if large scale jobs' scheduling are working well and the 
> scheduling performance, with a real job running on cluster. 
> The conclusion should include, for a *1 --- all-to-all-connected 
> -->1* job:
> 1. time of job initialization on master (job received -> scheduling started)
> 2. time of job scheduling and deployment (scheduling started -> all tasks in 
> INITIALIZATION)
> 3. time of job restarting on task failover (JM notified about task failure -> 
> all tasks in INITIALIZATION again)
> 4. master heap memory required



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (FLINK-23826) Verify optimized scheduler performance for large-scale jobs

2021-08-25 Thread Xintong Song (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-23826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xintong Song updated FLINK-23826:
-
Labels:   (was: release-testing)

> Verify optimized scheduler performance for large-scale jobs
> ---
>
> Key: FLINK-23826
> URL: https://issues.apache.org/jira/browse/FLINK-23826
> Project: Flink
>  Issue Type: Sub-task
>  Components: Runtime / Coordination
>Affects Versions: 1.14.0
>Reporter: Zhu Zhu
>Priority: Blocker
> Fix For: 1.14.0
>
>
> This ticket is used to verify the result of FLINK-21110.
> It should check if large scale jobs' scheduling are working well and the 
> scheduling performance, with a real job running on cluster. 
> The conclusion should include, for a *1 --- all-to-all-connected 
> -->1* job:
> 1. time of job initialization on master (job received -> scheduling started)
> 2. time of job scheduling and deployment (scheduling started -> all tasks in 
> INITIALIZATION)
> 3. time of job restarting on task failover (JM notified about task failure -> 
> all tasks in INITIALIZATION again)
> 4. master heap memory required



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (FLINK-23826) Verify optimized scheduler performance for large-scale jobs

2021-08-17 Thread Xintong Song (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-23826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xintong Song updated FLINK-23826:
-
Labels: release-testing  (was: )

> Verify optimized scheduler performance for large-scale jobs
> ---
>
> Key: FLINK-23826
> URL: https://issues.apache.org/jira/browse/FLINK-23826
> Project: Flink
>  Issue Type: Sub-task
>  Components: Runtime / Coordination
>Affects Versions: 1.14.0
>Reporter: Zhu Zhu
>Priority: Blocker
>  Labels: release-testing
> Fix For: 1.14.0
>
>
> This ticket is used to verify the result of FLINK-21110.
> It should check if large scale jobs' scheduling are working well and the 
> scheduling performance, with a real job running on cluster. 
> The conclusion should include, for a *1 --- all-to-all-connected 
> -->1* job:
> 1. time of job initialization on master (job received -> scheduling started)
> 2. time of job scheduling and deployment (scheduling started -> all tasks in 
> INITIALIZATION)
> 3. time of job restarting on task failover (JM notified about task failure -> 
> all tasks in INITIALIZATION again)
> 4. master heap memory required



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (FLINK-23826) Verify optimized scheduler performance for large-scale jobs

2021-08-16 Thread Zhu Zhu (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-23826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhu Zhu updated FLINK-23826:

Description: 
This ticket is used to verify the result of FLINK-21110.
It should check if large scale jobs' scheduling are working well and the 
scheduling performance, with a real job running on cluster. 

The conclusion should include, for a *1 --- all-to-all-connected -->1* 
job:
1. time of job initialization on master (job received -> scheduling started)
2. time of job scheduling and deployment (scheduling started -> all tasks in 
INITIALIZATION)
3. time of job restarting on task failover (JM notified about task failure -> 
all tasks in INITIALIZATION again)
4. master heap memory required



  was:
This ticket is used to verify the result of FLINK-21110.
It should check if large scale jobs' scheduling are working well and the 
scheduling performance, with a real job running on cluster. 

The conclusion should include, for a *1 ---all-to-all-connected-->1* 
job:
1. time of job initialization on master (job received -> scheduling started)
2. time of job scheduling and deployment (scheduling started -> all tasks in 
INITIALIZATION)
3. time of job restarting on task failover (JM notified about task failure -> 
all tasks in INITIALIZATION again)
4. master heap memory required




> Verify optimized scheduler performance for large-scale jobs
> ---
>
> Key: FLINK-23826
> URL: https://issues.apache.org/jira/browse/FLINK-23826
> Project: Flink
>  Issue Type: Sub-task
>  Components: Runtime / Coordination
>Affects Versions: 1.14.0
>Reporter: Zhu Zhu
>Priority: Blocker
> Fix For: 1.14.0
>
>
> This ticket is used to verify the result of FLINK-21110.
> It should check if large scale jobs' scheduling are working well and the 
> scheduling performance, with a real job running on cluster. 
> The conclusion should include, for a *1 --- all-to-all-connected 
> -->1* job:
> 1. time of job initialization on master (job received -> scheduling started)
> 2. time of job scheduling and deployment (scheduling started -> all tasks in 
> INITIALIZATION)
> 3. time of job restarting on task failover (JM notified about task failure -> 
> all tasks in INITIALIZATION again)
> 4. master heap memory required



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (FLINK-23826) Verify optimized scheduler performance for large-scale jobs

2021-08-16 Thread Zhu Zhu (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-23826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhu Zhu updated FLINK-23826:

Description: 
This ticket is used to verify the result of FLINK-21110.
It should check if large scale jobs' scheduling are working well and the 
scheduling performance, with a real job running on cluster. 

The conclusion should include, for a *1 ---all-to-all-connected-->1* 
job:
1. time of job initialization on master (job received -> scheduling started)
2. time of job scheduling and deployment (scheduling started -> all tasks in 
INITIALIZATION)
3. time of job restarting on task failover (JM notified about task failure -> 
all tasks in INITIALIZATION again)
4. master heap memory required



  was:
This ticket is used to verify the result of FLINK-21110.
It should check if large scale jobs' scheduling are working well and the 
scheduling performance, with a real job running on cluster. 

The conclusion should include:
1. time of job initialization on master (job received -> scheduling started)
2. time of job scheduling and deployment (scheduling started -> all tasks in 
INITIALIZATION)
3. time of job restarting on task failover (JM notified about task failure -> 
all tasks in INITIALIZATION again)
4. master heap memory required for large scale jobs




> Verify optimized scheduler performance for large-scale jobs
> ---
>
> Key: FLINK-23826
> URL: https://issues.apache.org/jira/browse/FLINK-23826
> Project: Flink
>  Issue Type: Sub-task
>  Components: Runtime / Coordination
>Affects Versions: 1.14.0
>Reporter: Zhu Zhu
>Priority: Blocker
> Fix For: 1.14.0
>
>
> This ticket is used to verify the result of FLINK-21110.
> It should check if large scale jobs' scheduling are working well and the 
> scheduling performance, with a real job running on cluster. 
> The conclusion should include, for a *1 ---all-to-all-connected-->1* 
> job:
> 1. time of job initialization on master (job received -> scheduling started)
> 2. time of job scheduling and deployment (scheduling started -> all tasks in 
> INITIALIZATION)
> 3. time of job restarting on task failover (JM notified about task failure -> 
> all tasks in INITIALIZATION again)
> 4. master heap memory required



--
This message was sent by Atlassian Jira
(v8.3.4#803005)