Tomas Barton created MESOS-9969: ----------------------------------- Summary: Agent crashes when trying to clean up volue Key: MESOS-9969 URL: https://issues.apache.org/jira/browse/MESOS-9969 Project: Mesos Issue Type: Bug Components: agent Affects Versions: 1.8.2 Reporter: Tomas Barton
{code} Sep 17 13:49:26 w03 mesos-agent[21803]: I0917 13:49:26.081748 21828 linux_launcher.cpp:650] Destroying cgroup '/sys/fs/cgroup/systemd/mesos/370ed262-4041-4180-a7e1-9ea78070e3a6' Sep 17 13:49:26 w03 mesos-agent[21803]: I0917 13:49:26.081876 21832 containerizer.cpp:2907] Checkpointing termination state to nested container's runtime directory '/var/run/mesos/containers/8e3997e7-c53a-4043-9a7e-26a2e436a041/containers/ae0bdc6d-c738-4352-b5d4-7572182671d5/termination' Sep 17 13:49:26 w03 mesos-agent[21803]: mesos-agent: /pkg/src/mesos/3rdparty/stout/include/stout/option.hpp:120: T& Option<T>::get() & [with T = std::basic_string<char>]: Assertion `isSome()' failed. Sep 17 13:49:26 w03 mesos-agent[21803]: *** Aborted at 1568728166 (unix time) try "date -d @1568728166" if you are using GNU date *** Sep 17 13:49:26 w03 mesos-agent[21803]: W0917 13:49:26.082281 21835 disk.cpp:453] Ignoring cleanup for unknown container a9ba6959-ea02-4543-b7d5-92a639aaaa40 Sep 17 13:49:26 w03 mesos-agent[21803]: PC: @ 0x7f16a3867fff (unknown) Sep 17 13:49:26 w03 mesos-agent[21803]: *** SIGABRT (@0x552b) received by PID 21803 (TID 0x7f169e47d700) from PID 21803; stack trace: *** Sep 17 13:49:26 w03 mesos-agent[21803]: E0917 13:49:26.082608 21835 memory.cpp:501] Listening on OOM events failed for container a9ba6959-ea02-4543-b7d5-92a639aaaa40: Event listener is terminating Sep 17 13:49:26 w03 mesos-agent[21803]: @ 0x7f16a3be50e0 (unknown) Sep 17 13:49:26 w03 mesos-agent[21803]: @ 0x7f16a3867fff (unknown) Sep 17 13:49:26 w03 mesos-agent[21803]: @ 0x7f16a386942a (unknown) Sep 17 13:49:26 w03 mesos-agent[21803]: @ 0x7f16a3860e67 (unknown) Sep 17 13:49:26 w03 mesos-agent[21803]: I0917 13:49:26.083741 21835 linux.cpp:1074] Unmounting volume '/var/lib/mesos/slave/slaves/04e596b7-f03d-4cba-bbbc-fa9e0aebb5d2-S17/frameworks/04e596b7-f03d-4cba-bbbc-fa9e0aebb5d2-0003/executors/es01__coordinator__8591ac8e-3d9d-45ac-bb68-bee379c8c4a4/runs/a9ba6959-ea02-4543-b7d5-92a639aaaa40/container-path' for con Sep 17 13:49:26 w03 mesos-agent[21803]: @ 0x7f16a3860f12 (unknown) Sep 17 13:49:26 w03 mesos-agent[21803]: @ 0x7f16a7654f13 _ZNR6OptionISsE3getEv.part.152 Sep 17 13:49:26 w03 mesos-agent[21803]: @ 0x7f16a7666b2f mesos::internal::slave::MesosContainerizerProcess::______destroy() Sep 17 13:49:26 w03 mesos-agent[21803]: @ 0x7f16a861cb41 process::ProcessBase::consume() Sep 17 13:49:26 w03 mesos-agent[21803]: @ 0x7f16a8633c9c process::ProcessManager::resume() Sep 17 13:49:26 w03 mesos-agent[21803]: @ 0x7f16a86398a6 _ZNSt6thread5_ImplISt12_Bind_simpleIFZN7process14ProcessManager12init_threadsEvEUlvE_vEEE6_M_runEv Sep 17 13:49:26 w03 mesos-agent[21803]: @ 0x7f16a43c6200 (unknown) Sep 17 13:49:26 w03 mesos-agent[21803]: @ 0x7f16a3bdb4a4 start_thread Sep 17 13:49:26 w03 mesos-agent[21803]: @ 0x7f16a391dd0f (unknown) Sep 17 13:49:26 w03 systemd[1]: dcos-mesos-slave.service: Main process exited, code=killed, status=6/ABRT {code} -- This message was sent by Atlassian Jira (v8.3.2#803003)