Meng Zhu created MESOS-9359: ------------------------------- Summary: Test `DefaultExecutorTest.MaxCompletionTime` is flaky. Key: MESOS-9359 URL: https://issues.apache.org/jira/browse/MESOS-9359 Project: Mesos Issue Type: Bug Reporter: Meng Zhu Attachments: DefaultExecutorTest.MaxCompletionTime
The test is flaky because its task with simple command `exit 0` took more than 2 seconds to complete (even though the container successfully transitioned to running state) which is not expected by the test, log snippets: {noformat} I1026 21:48:04.353044 30084 containerizer.cpp:1939] Launching 'mesos-containerizer' with flags '--help="false" --launch_info="{"command":{"shell":true,"value":"exit 0"},"environment":{"variables":[{"name":"MESOS_SANDBOX","type":"VALUE","value":"/tmp/MesosContainerizer_DefaultExecutorTest_MaxCompletionTime_0_Owa0zJ/slaves/7f85af3e-a8b9-4252-b8f4-8d1a9bd0f1eb-S0/frameworks/7f85af3e-a8b9-4252-b8f4-8d1a9bd0f1eb-0000/executors/default/runs/72c8a5d0-1d90-4eff-8fc0-c690139c1664/containers/8e6b6380-ea9e-44f6-8abf-dd5bb6878739"},{"name":"MESOS_CONTAINER_IP","type":"VALUE","value":"172.16.10.21"}]},"task_environment":{},"user":"root","working_directory":"/tmp/MesosContainerizer_DefaultExecutorTest_MaxCompletionTime_0_Owa0zJ/slaves/7f85af3e-a8b9-4252-b8f4-8d1a9bd0f1eb-S0/frameworks/7f85af3e-a8b9-4252-b8f4-8d1a9bd0f1eb-0000/executors/default/runs/72c8a5d0-1d90-4eff-8fc0-c690139c1664/containers/8e6b6380-ea9e-44f6-8abf-dd5bb6878739"}" --pipe_read="99" --pipe_write="100" --runtime_directory="/tmp/MesosContainerizer_DefaultExecutorTest_MaxCompletionTime_0_g87RiO/containers/72c8a5d0-1d90-4eff-8fc0-c690139c1664/containers/8e6b6380-ea9e-44f6-8abf-dd5bb6878739" --unshare_namespace_mnt="false"' ... I1026 21:48:04.495669 30084 containerizer.cpp:3122] Transitioning the state of container 72c8a5d0-1d90-4eff-8fc0-c690139c1664.8e6b6380-ea9e-44f6-8abf-dd5bb6878739 from FETCHING to RUNNING ... II1026 21:48:06.506870 16743 default_executor.cpp:1293] Killing task 8a5fa021-1b4a-4bfc-a3df-848eb610a9f5 which exceeded its maximum completion time of 2secs I1026 21:48:06.529467 16743 default_executor.cpp:1126] Killing task 8a5fa021-1b4a-4bfc-a3df-848eb610a9f5 running in child container 72c8a5d0-1d90-4eff-8fc0-c690139c1664.8e6b6380-ea9e-44f6-8abf-dd5bb6878739 with SIGTERM signal {noformat} Full log attached. -- This message was sent by Atlassian JIRA (v7.6.3#76005)