James Peach created MESOS-6576: ---------------------------------- Summary: DefaultExecutorTest.KillTaskGroupOnTaskFailure sometimes fails in CI Key: MESOS-6576 URL: https://issues.apache.org/jira/browse/MESOS-6576 Project: Mesos Issue Type: Bug Components: tests Reporter: James Peach
{{DefaultExecutorTest.KillTaskGroupOnTaskFailure}} sometimes fails in the ASF CI. Interesting pieces of the failing test run: {noformat} ... I1110 20:38:54.775871 29740 status_update_manager.cpp:323] Received status update TASK_KILLED (UUID: a4746389-8155-44e0-ada4-00b8d3e997c1) for task df99cc50-9b0f-4692-afc9-d587c3515a67 of framework 2df0125f-4865-4aba-b13d-02f338815729-0000 I1110 20:38:54.776181 29730 slave.cpp:4075] Status update manager successfully handled status update TASK_KILLED (UUID: a4746389-8155-44e0-ada4-00b8d3e997c1) for task df99cc50-9b0f-4692-afc9-d587c3515a67 of framework 2df0125f-4865-4aba-b13d-02f338815729-0000 I1110 20:38:55.456354 29738 hierarchical.cpp:1880] Filtered offer with cpus(*):1.7; mem(*):928; disk(*):928; ports(*):[31000-32000] on agent 2df0125f-4865-4aba-b13d-02f338815729-S0 for framework 2df0125f-4865-4aba-b13d-02f338815729-0000 I1110 20:38:55.456434 29738 hierarchical.cpp:1694] No allocations performed I1110 20:38:55.456468 29738 hierarchical.cpp:1789] No inverse offers to send out! I1110 20:38:55.456545 29738 hierarchical.cpp:1286] Performed allocation for 1 agents in 745185ns I1110 20:38:55.875964 29731 containerizer.cpp:2336] Container a56ac08b-8f97-4ae4-a2e8-5ef5d55fbe98 has exited I1110 20:38:55.876022 29731 containerizer.cpp:1973] Destroying container a56ac08b-8f97-4ae4-a2e8-5ef5d55fbe98 in RUNNING state I1110 20:38:55.876387 29731 launcher.cpp:143] Asked to destroy container a56ac08b-8f97-4ae4-a2e8-5ef5d55fbe98 I1110 20:38:55.881464 29728 provisioner.cpp:324] Ignoring destroy request for unknown container a56ac08b-8f97-4ae4-a2e8-5ef5d55fbe98 I1110 20:38:55.882894 29730 slave.cpp:4672] Executor 'default' of framework 2df0125f-4865-4aba-b13d-02f338815729-0000 exited with status 0 I1110 20:38:55.883446 29741 master.cpp:5884] Executor 'default' of framework 2df0125f-4865-4aba-b13d-02f338815729-0000 on agent 2df0125f-4865-4aba-b13d-02f338815729-S0 at slave(18)@172.17.0.2:36164 (ade222407ffe): exited with status 0 I1110 20:38:55.883545 29741 master.cpp:7840] Removing executor 'default' with resources cpus(*):0.1; mem(*):32; disk(*):32 of framework 2df0125f-4865-4aba-b13d-02f338815729-0000 on agent 2df0125f-4865-4aba-b13d-02f338815729-S0 at slave(18)@172.17.0.2:36164 (ade222407ffe) I1110 20:38:55.884820 29729 hierarchical.cpp:1018] Recovered cpus(*):0.1; mem(*):32; disk(*):32 (total: cpus(*):2; mem(*):1024; disk(*):1024; ports(*):[31000-32000], allocated: cpus(*):0.2; mem(*):64; disk(*):64) on agent 2df0125f-4865-4aba-b13d-02f338815729-S0 from framework 2df0125f-4865-4aba-b13d-02f338815729-0000 I1110 20:38:55.885892 29737 scheduler.cpp:675] Enqueuing event FAILURE received from <a href='http://172.17.0.2:36164/master/api/v1/scheduler'>http://172.17.0.2:36164/master/api/v1/scheduler</a> GMOCK WARNING: Uninteresting mock function call - returning directly. Function call: failure(0x7ffdc4df11f0, @0x2b639800b6b0 48-byte object <90-82 AC-51 63-2B 00-00 00-00 00-00 00-00 00-00 07-00 00-00 00-00 00-00 70-0A 01-98 63-2B 00-00 20-C7 00-98 63-2B 00-00 00-00 00-00 63-2B 00-00>) ... I1110 20:39:04.566794 29732 master.cpp:7715] Updating the state of task e72d5139-0a11-48af-9d43-d4163c1404ee of framework 2df0125f-4865-4aba-b13d-02f338815729-0000 (latest state: TASK_FAILED, status update state: TASK_RUNNING) ... I1110 20:39:04.569413 29736 scheduler.cpp:675] Enqueuing event UPDATE received from <a href='http://172.17.0.2:36164/master/api/v1/scheduler'>http://172.17.0.2:36164/master/api/v1/scheduler</a> ../../src/tests/default_executor_tests.cpp:583: Failure Value of: taskStates Actual: { (df99cc50-9b0f-4692-afc9-d587c3515a67, TASK_KILLED), (e72d5139-0a11-48af-9d43-d4163c1404ee, TASK_FAILED) } Expected: expectedTaskStates Which is: { (df99cc50-9b0f-4692-afc9-d587c3515a67, TASK_RUNNING), (e72d5139-0a11-48af-9d43-d4163c1404ee, TASK_RUNNING) } ... {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)