[ 
https://issues.apache.org/jira/browse/FLINK-39566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Purushottam Sinha updated FLINK-39566:
--------------------------------------
    Description: 
Problem
Today the Checkpoints tab is a wall of numbers. To find why a checkpoint was 
slow, you scroll through subtask rows and eyeball columns. It's hard to spot 
stragglers or compare checkpoints over time.         
                              
Proposal                                                                        
                                                                                
                                               
Add a Gantt-style view to the Checkpoints tab:
- Recent strip — last 60(or max possible if less) checkpoints as colored bars 
(width = duration, color = status: completed / savepoint / in-progress / 
failed). Auto-refreshes.                                                        
 
- Per-checkpoint Gantt — one row per subtask, grouped by operator, with stacked 
segments for the four checkpoint phases. Sorted by total duration so stragglers 
float to the top. Outliers and aborts highlighted.                              
                                                                                
                                                                                
     
                              
Interactions                                                                    
                                                                                
                                               
- Click a bar in the strip → pins the Gantt to that checkpoint
- "Follow newest" → resume live tracking
- "Export PNG" → snapshot for incident reports
                                                                                
                                                                                
                                                 
Scope                       
Frontend-only. Uses existing REST endpoints, renders via @antv/g2. No backend 
changes.

  was:
Problem
Today the Checkpoints tab is a wall of numbers. To find why a checkpoint was 
slow, you scroll through subtask rows and eyeball columns. It's hard to spot 
stragglers or compare checkpoints over time.         
                              
Proposal                                                                        
                                                                                
                                               
Add a Gantt-style view to the Checkpoints tab:
- Recent strip — last 60 checkpoints as colored bars (width = duration, color = 
status: completed / savepoint / in-progress / failed). Auto-refreshes.          
                                               
- Per-checkpoint Gantt — one row per subtask, grouped by operator, with stacked 
segments for the four checkpoint phases. Sorted by total duration so stragglers 
float to the top. Outliers and aborts          
  highlighted.                                                                  
                                                                                
                                                 
                              
Interactions                                                                    
                                                                                
                                               
- Click a bar in the strip → pins the Gantt to that checkpoint
- "Follow newest" → resume live tracking
- "Export PNG" → snapshot for incident reports
                                                                                
                                                                                
                                                 
Scope                       
Frontend-only. Uses existing REST endpoints, renders via @antv/g2. No backend 
changes.


> [runtime-web] Add checkpoint duration Gantt view
> ------------------------------------------------
>
>                 Key: FLINK-39566
>                 URL: https://issues.apache.org/jira/browse/FLINK-39566
>             Project: Flink
>          Issue Type: New Feature
>          Components: Runtime / Web Frontend
>            Reporter: Purushottam Sinha
>            Priority: Minor
>              Labels: pull-request-available
>
> Problem
> Today the Checkpoints tab is a wall of numbers. To find why a checkpoint was 
> slow, you scroll through subtask rows and eyeball columns. It's hard to spot 
> stragglers or compare checkpoints over time.         
>                               
> Proposal                                                                      
>                                                                               
>                                                    
> Add a Gantt-style view to the Checkpoints tab:
> - Recent strip — last 60(or max possible if less) checkpoints as colored bars 
> (width = duration, color = status: completed / savepoint / in-progress / 
> failed). Auto-refreshes.                                                      
>    
> - Per-checkpoint Gantt — one row per subtask, grouped by operator, with 
> stacked segments for the four checkpoint phases. Sorted by total duration so 
> stragglers float to the top. Outliers and aborts highlighted.                 
>                                                                               
>                                                                               
>                       
>                               
> Interactions                                                                  
>                                                                               
>                                                    
> - Click a bar in the strip → pins the Gantt to that checkpoint
> - "Follow newest" → resume live tracking
> - "Export PNG" → snapshot for incident reports
>                                                                               
>                                                                               
>                                                      
> Scope                       
> Frontend-only. Uses existing REST endpoints, renders via @antv/g2. No backend 
> changes.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to