[jira] [Commented] (FLINK-37769) Include cause in event when Restarting unhealthy job

david radley (Jira) Wed, 07 May 2025 03:06:04 -0700


    [ 
https://issues.apache.org/jira/browse/FLINK-37769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17949951#comment-17949951
 ]


david radley commented on FLINK-37769:
--------------------------------------

I wonder is there additional information from the job we can get, before we 
restart it that would be help in the message, to add further clarity as to the 
cause of the unhealthy job?  

> Include cause in event when Restarting unhealthy job
> ----------------------------------------------------
>
>                 Key: FLINK-37769
>                 URL: https://issues.apache.org/jira/browse/FLINK-37769
>             Project: Flink
>          Issue Type: Improvement
>          Components: Kubernetes Operator
>            Reporter: Gyula Fora
>            Assignee: Daren Wong
>            Priority: Major
>
> When the kubernetes operator restarts unhealthy jobs due to any reason an 
> event is triggered with the following message:
> "Restarting unhealthy job"
> We should extend this message to include the cause why the job was determined 
> to be unhealthy:
> "Restarting unhealthy job: X restarts within Y minutes"
> "Restarting unhealthy job: No checkpoint taken within X minutes"
> etc.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Commented] (FLINK-37769) Include cause in event when Restarting unhealthy job

Reply via email to