[ https://issues.apache.org/jira/browse/MESOS-6784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15755184#comment-15755184 ]
Anand Mazumdar commented on MESOS-6784: --------------------------------------- Committed a fix for the second log snippet that Jie posted around the test bug. {noformat} commit 28eaa8df7c95130b0c244f7613ad506be899cafd Author: Anand Mazumdar <an...@apache.org> Date: Wed Dec 14 17:40:47 2016 -0800 Fixed the 'IOSwitchboardTest.KillSwitchboardContainerDestroyed' test. The container was launched with TTY enabled. This meant that killing the switchboard would trigger the task to terminate on its own owing to the "master" end of the TTY dying. This would make it not go through the code path of the isolator failing due to resource limit issue. Review: https://reviews.apache.org/r/54770 {noformat} The original log in the issue description is a separate issue in the switchboard code itself and I am working on that. This should make the CI green for now. > IOSwitchboardTest.KillSwitchboardContainerDestroyed is flaky > ------------------------------------------------------------ > > Key: MESOS-6784 > URL: https://issues.apache.org/jira/browse/MESOS-6784 > Project: Mesos > Issue Type: Bug > Components: agent > Reporter: Neil Conway > Assignee: Anand Mazumdar > Priority: Blocker > Labels: mesosphere > > {noformat} > [ RUN ] IOSwitchboardTest.KillSwitchboardContainerDestroyed > I1212 13:57:02.641043 2211 containerizer.cpp:220] Using isolation: > posix/cpu,filesystem/posix,network/cni > W1212 13:57:02.641438 2211 backend.cpp:76] Failed to create 'overlay' > backend: OverlayBackend requires root privileges, but is running as user nrc > W1212 13:57:02.641559 2211 backend.cpp:76] Failed to create 'bind' backend: > BindBackend requires root privileges > I1212 13:57:02.642822 2268 containerizer.cpp:594] Recovering containerizer > I1212 13:57:02.643975 2253 provisioner.cpp:253] Provisioner recovery complete > I1212 13:57:02.644953 2255 containerizer.cpp:986] Starting container > 09e87380-00ab-4987-83c9-fa1c5d86717f for executor 'executor' of framework > I1212 13:57:02.647004 2245 switchboard.cpp:430] Allocated pseudo terminal > '/dev/pts/54' for container 09e87380-00ab-4987-83c9-fa1c5d86717f > I1212 13:57:02.652305 2245 switchboard.cpp:596] Created I/O switchboard > server (pid: 2705) listening on socket file > '/tmp/mesos-io-switchboard-b4af1c92-6633-44f3-9d35-e0e36edaf70a' for > container 09e87380-00ab-4987-83c9-fa1c5d86717f > I1212 13:57:02.655513 2267 launcher.cpp:133] Forked child with pid '2706' > for container '09e87380-00ab-4987-83c9-fa1c5d86717f' > I1212 13:57:02.655732 2267 containerizer.cpp:1621] Checkpointing container's > forked pid 2706 to > '/tmp/IOSwitchboardTest_KillSwitchboardContainerDestroyed_Me5CRx/meta/slaves/frameworks/executors/executor/runs/09e87380-00ab-4987-83c9-fa1c5d86717f/pids/forked.pid' > I1212 13:57:02.726306 2265 containerizer.cpp:2463] Container > 09e87380-00ab-4987-83c9-fa1c5d86717f has exited > I1212 13:57:02.726352 2265 containerizer.cpp:2100] Destroying container > 09e87380-00ab-4987-83c9-fa1c5d86717f in RUNNING state > E1212 13:57:02.726495 2243 switchboard.cpp:861] Unexpected termination of > I/O switchboard server: 'IOSwitchboard' exited with signal: Killed for > container 09e87380-00ab-4987-83c9-fa1c5d86717f > I1212 13:57:02.726563 2265 launcher.cpp:149] Asked to destroy container > 09e87380-00ab-4987-83c9-fa1c5d86717f > E1212 13:57:02.783607 2228 switchboard.cpp:799] Failed to remove unix domain > socket file '/tmp/mesos-io-switchboard-b4af1c92-6633-44f3-9d35-e0e36edaf70a' > for container '09e87380-00ab-4987-83c9-fa1c5d86717f': No such file or > directory > ../../mesos/src/tests/containerizer/io_switchboard_tests.cpp:661: Failure > Value of: wait.get()->reasons().size() == 1 > Actual: false > Expected: true > *** Aborted at 1481579822 (unix time) try "date -d @1481579822" if you are > using GNU date *** > PC: @ 0x1bf16d0 testing::UnitTest::AddTestPartResult() > *** SIGSEGV (@0x0) received by PID 2211 (TID 0x7faed7d078c0) from PID 0; > stack trace: *** > @ 0x7faecf855100 (unknown) > @ 0x1bf16d0 testing::UnitTest::AddTestPartResult() > @ 0x1be6247 testing::internal::AssertHelper::operator=() > @ 0x19ed751 > mesos::internal::tests::IOSwitchboardTest_KillSwitchboardContainerDestroyed_Test::TestBody() > @ 0x1c0ed8c > testing::internal::HandleSehExceptionsInMethodIfSupported<>() > @ 0x1c09e74 > testing::internal::HandleExceptionsInMethodIfSupported<>() > @ 0x1beb505 testing::Test::Run() > @ 0x1bebc88 testing::TestInfo::Run() > @ 0x1bec2ce testing::TestCase::Run() > @ 0x1bf2ba8 testing::internal::UnitTestImpl::RunAllTests() > @ 0x1c0f9b1 > testing::internal::HandleSehExceptionsInMethodIfSupported<>() > @ 0x1c0a9f2 > testing::internal::HandleExceptionsInMethodIfSupported<>() > @ 0x1bf18ee testing::UnitTest::Run() > @ 0x11bc9e3 RUN_ALL_TESTS() > @ 0x11bc599 main > @ 0x7faece663b15 __libc_start_main > @ 0xa9c219 (unknown) > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)