Maxence Cramet created AIRFLOW-6810: ---------------------------------------
Summary: KubernetesPodOperator pod is completed but xcom side car is stuck Key: AIRFLOW-6810 URL: https://issues.apache.org/jira/browse/AIRFLOW-6810 Project: Apache Airflow Issue Type: Bug Components: executor-kubernetes Affects Versions: 1.10.6 Reporter: Maxence Cramet Assignee: Daniel Imberman We're using KubernetesPodOperator with param xcom_push=true in order to push information from our task. >From time to time the main pod completes but the side car pod is stuck. Here's the output of the pods details: {noformat} kubectl describe pod my_pod Name: my_pod Namespace: default Priority: 0 PriorityClassName: <none> Node: xxx Start Time: Wed, 05 Feb 2020 11:12:33 +0000 Labels: xxx Annotations: xxx Status: Running IP: xxx Containers: base: Container ID: xxx Image: xxx Image ID: xxx Port: <none> Host Port: <none> Args: xxx State: Terminated Reason: Completed Exit Code: 0 Started: Wed, 05 Feb 2020 11:12:38 +0000 Finished: Wed, 05 Feb 2020 11:12:47 +0000 Ready: False Restart Count: 0 Limits: memory: 512Mi Requests: memory: 512Mi Environment: xxx Mounts: /airflow/xcom from xcom (rw) airflow-xcom-sidecar: Container ID: docker://83053d7d292cda9156454ac13064d72ace1e4f72738ba9b62b04ff57cb7966cc Image: alpine Image ID: docker-pullable://alpine@sha256:ab00606a42621fb68f2ed6ad3c88be54397f981a7b70a79db3d1172b11c4367d Port: <none> Host Port: <none> Command: sh -c trap "exit 0" INT; while true; do sleep 30; done; State: Running Started: Wed, 05 Feb 2020 11:12:40 +0000 Ready: True Restart Count: 0 Limits: memory: 4Gi Requests: cpu: 1m memory: 2Gi Environment: <none> Mounts: /airflow/xcom from xcom (rw) xxx Conditions: Type Status Initialized True Ready False ContainersReady False PodScheduled True Volumes: xcom: Type: EmptyDir (a temporary directory that shares a pod's lifetime) Medium: SizeLimit: <unset> xxx QoS Class: Burstable Node-Selectors: <none> Tolerations: node.kubernetes.io/not-ready:NoExecute for 300s node.kubernetes.io/unreachable:NoExecute for 300s Events: <none>{noformat} I don't have more information of the possible causes of that. -- This message was sent by Atlassian Jira (v8.3.4#803005)