[ 
https://issues.apache.org/jira/browse/YUNIKORN-2665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17853586#comment-17853586
 ] 

Manikandan R commented on YUNIKORN-2665:
----------------------------------------

Backported to branch 1.5

> Gang app originator pod changes after restart
> ---------------------------------------------
>
>                 Key: YUNIKORN-2665
>                 URL: https://issues.apache.org/jira/browse/YUNIKORN-2665
>             Project: Apache YuniKorn
>          Issue Type: Bug
>          Components: shim - kubernetes
>    Affects Versions: 1.3.0, 1.4.0, 1.5.0, 1.5.1
>            Reporter: Manikandan R
>            Assignee: Manikandan R
>            Priority: Critical
>              Labels: pull-request-available
>
> Gang app choose the first pod (who created the app) as originator pod which 
> becomes the real driver pod later. While processing gang app specifically 
> after the placeholder creation and in the process of replacement, restart can 
> lead to the below described incorrect behaviour:
> During restore, there is no guarantee on the ordering of pods coming from K8s 
> lister especially when all the pods created with the same second timestamp. 
> k8s use the seconds based timestamp, which means all pods created with in 
> same second has same timestamp. During this situation, whichever pod comes 
> first from lister, YK designate it as originator pod. So, any placeholder 
> could become the originator pod and actual originator pod has been lost. This 
> change could cause rippling effects leading to weird behaviour and needs to 
> be fixed.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@yunikorn.apache.org
For additional commands, e-mail: issues-h...@yunikorn.apache.org

Reply via email to