James Peach created MESOS-6576:
----------------------------------

             Summary: DefaultExecutorTest.KillTaskGroupOnTaskFailure sometimes 
fails in CI
                 Key: MESOS-6576
                 URL: https://issues.apache.org/jira/browse/MESOS-6576
             Project: Mesos
          Issue Type: Bug
          Components: tests
            Reporter: James Peach


{{DefaultExecutorTest.KillTaskGroupOnTaskFailure}} sometimes fails in the ASF 
CI.

Interesting  pieces of the failing test run:
{noformat}
...
I1110 20:38:54.775871 29740 status_update_manager.cpp:323] Received status 
update TASK_KILLED (UUID: a4746389-8155-44e0-ada4-00b8d3e997c1) for task 
df99cc50-9b0f-4692-afc9-d587c3515a67 of framework 
2df0125f-4865-4aba-b13d-02f338815729-0000
I1110 20:38:54.776181 29730 slave.cpp:4075] Status update manager successfully 
handled status update TASK_KILLED (UUID: a4746389-8155-44e0-ada4-00b8d3e997c1) 
for task df99cc50-9b0f-4692-afc9-d587c3515a67 of framework 
2df0125f-4865-4aba-b13d-02f338815729-0000
I1110 20:38:55.456354 29738 hierarchical.cpp:1880] Filtered offer with 
cpus(*):1.7; mem(*):928; disk(*):928; ports(*):[31000-32000] on agent 
2df0125f-4865-4aba-b13d-02f338815729-S0 for framework 
2df0125f-4865-4aba-b13d-02f338815729-0000
I1110 20:38:55.456434 29738 hierarchical.cpp:1694] No allocations performed
I1110 20:38:55.456468 29738 hierarchical.cpp:1789] No inverse offers to send 
out!
I1110 20:38:55.456545 29738 hierarchical.cpp:1286] Performed allocation for 1 
agents in 745185ns
I1110 20:38:55.875964 29731 containerizer.cpp:2336] Container 
a56ac08b-8f97-4ae4-a2e8-5ef5d55fbe98 has exited
I1110 20:38:55.876022 29731 containerizer.cpp:1973] Destroying container 
a56ac08b-8f97-4ae4-a2e8-5ef5d55fbe98 in RUNNING state
I1110 20:38:55.876387 29731 launcher.cpp:143] Asked to destroy container 
a56ac08b-8f97-4ae4-a2e8-5ef5d55fbe98
I1110 20:38:55.881464 29728 provisioner.cpp:324] Ignoring destroy request for 
unknown container a56ac08b-8f97-4ae4-a2e8-5ef5d55fbe98
I1110 20:38:55.882894 29730 slave.cpp:4672] Executor 'default' of framework 
2df0125f-4865-4aba-b13d-02f338815729-0000 exited with status 0
I1110 20:38:55.883446 29741 master.cpp:5884] Executor 'default' of framework 
2df0125f-4865-4aba-b13d-02f338815729-0000 on agent 
2df0125f-4865-4aba-b13d-02f338815729-S0 at slave(18)@172.17.0.2:36164 
(ade222407ffe): exited with status 0
I1110 20:38:55.883545 29741 master.cpp:7840] Removing executor 'default' with 
resources cpus(*):0.1; mem(*):32; disk(*):32 of framework 
2df0125f-4865-4aba-b13d-02f338815729-0000 on agent 
2df0125f-4865-4aba-b13d-02f338815729-S0 at slave(18)@172.17.0.2:36164 
(ade222407ffe)
I1110 20:38:55.884820 29729 hierarchical.cpp:1018] Recovered cpus(*):0.1; 
mem(*):32; disk(*):32 (total: cpus(*):2; mem(*):1024; disk(*):1024; 
ports(*):[31000-32000], allocated: cpus(*):0.2; mem(*):64; disk(*):64) on agent 
2df0125f-4865-4aba-b13d-02f338815729-S0 from framework 
2df0125f-4865-4aba-b13d-02f338815729-0000
I1110 20:38:55.885892 29737 scheduler.cpp:675] Enqueuing event FAILURE received 
from <a 
href='http://172.17.0.2:36164/master/api/v1/scheduler'>http://172.17.0.2:36164/master/api/v1/scheduler</a>

GMOCK WARNING:
Uninteresting mock function call - returning directly.
    Function call: failure(0x7ffdc4df11f0, @0x2b639800b6b0 48-byte object 
&lt;90-82 AC-51 63-2B 00-00 00-00 00-00 00-00 00-00 07-00 00-00 00-00 00-00 
70-0A 01-98 63-2B 00-00 20-C7 00-98 63-2B 00-00 00-00 00-00 63-2B 00-00&gt;)
...
I1110 20:39:04.566794 29732 master.cpp:7715] Updating the state of task 
e72d5139-0a11-48af-9d43-d4163c1404ee of framework 
2df0125f-4865-4aba-b13d-02f338815729-0000 (latest state: TASK_FAILED, status 
update state: TASK_RUNNING)
...
I1110 20:39:04.569413 29736 scheduler.cpp:675] Enqueuing event UPDATE received 
from <a 
href='http://172.17.0.2:36164/master/api/v1/scheduler'>http://172.17.0.2:36164/master/api/v1/scheduler</a>
../../src/tests/default_executor_tests.cpp:583: Failure
Value of: taskStates
  Actual: { (df99cc50-9b0f-4692-afc9-d587c3515a67, TASK_KILLED), 
(e72d5139-0a11-48af-9d43-d4163c1404ee, TASK_FAILED) }
Expected: expectedTaskStates
Which is: { (df99cc50-9b0f-4692-afc9-d587c3515a67, TASK_RUNNING), 
(e72d5139-0a11-48af-9d43-d4163c1404ee, TASK_RUNNING) }
...
{noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to