[ 
https://issues.apache.org/jira/browse/MESOS-6784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15755184#comment-15755184
 ] 

Anand Mazumdar commented on MESOS-6784:
---------------------------------------

Committed a fix for the second log snippet that Jie posted around the test bug. 

{noformat}
commit 28eaa8df7c95130b0c244f7613ad506be899cafd
Author: Anand Mazumdar <an...@apache.org>
Date:   Wed Dec 14 17:40:47 2016 -0800

    Fixed the 'IOSwitchboardTest.KillSwitchboardContainerDestroyed' test.

    The container was launched with TTY enabled. This meant that
    killing the switchboard would trigger the task to terminate
    on its own owing to the "master" end of the TTY dying. This
    would make it not go through the code path of the isolator
    failing due to resource limit issue.

    Review: https://reviews.apache.org/r/54770
{noformat}

The original log in the issue description is a separate issue in the 
switchboard code itself and I am working on that. This should make the CI green 
for now.

> IOSwitchboardTest.KillSwitchboardContainerDestroyed is flaky
> ------------------------------------------------------------
>
>                 Key: MESOS-6784
>                 URL: https://issues.apache.org/jira/browse/MESOS-6784
>             Project: Mesos
>          Issue Type: Bug
>          Components: agent
>            Reporter: Neil Conway
>            Assignee: Anand Mazumdar
>            Priority: Blocker
>              Labels: mesosphere
>
> {noformat}
> [ RUN      ] IOSwitchboardTest.KillSwitchboardContainerDestroyed
> I1212 13:57:02.641043  2211 containerizer.cpp:220] Using isolation: 
> posix/cpu,filesystem/posix,network/cni
> W1212 13:57:02.641438  2211 backend.cpp:76] Failed to create 'overlay' 
> backend: OverlayBackend requires root privileges, but is running as user nrc
> W1212 13:57:02.641559  2211 backend.cpp:76] Failed to create 'bind' backend: 
> BindBackend requires root privileges
> I1212 13:57:02.642822  2268 containerizer.cpp:594] Recovering containerizer
> I1212 13:57:02.643975  2253 provisioner.cpp:253] Provisioner recovery complete
> I1212 13:57:02.644953  2255 containerizer.cpp:986] Starting container 
> 09e87380-00ab-4987-83c9-fa1c5d86717f for executor 'executor' of framework
> I1212 13:57:02.647004  2245 switchboard.cpp:430] Allocated pseudo terminal 
> '/dev/pts/54' for container 09e87380-00ab-4987-83c9-fa1c5d86717f
> I1212 13:57:02.652305  2245 switchboard.cpp:596] Created I/O switchboard 
> server (pid: 2705) listening on socket file 
> '/tmp/mesos-io-switchboard-b4af1c92-6633-44f3-9d35-e0e36edaf70a' for 
> container 09e87380-00ab-4987-83c9-fa1c5d86717f
> I1212 13:57:02.655513  2267 launcher.cpp:133] Forked child with pid '2706' 
> for container '09e87380-00ab-4987-83c9-fa1c5d86717f'
> I1212 13:57:02.655732  2267 containerizer.cpp:1621] Checkpointing container's 
> forked pid 2706 to 
> '/tmp/IOSwitchboardTest_KillSwitchboardContainerDestroyed_Me5CRx/meta/slaves/frameworks/executors/executor/runs/09e87380-00ab-4987-83c9-fa1c5d86717f/pids/forked.pid'
> I1212 13:57:02.726306  2265 containerizer.cpp:2463] Container 
> 09e87380-00ab-4987-83c9-fa1c5d86717f has exited
> I1212 13:57:02.726352  2265 containerizer.cpp:2100] Destroying container 
> 09e87380-00ab-4987-83c9-fa1c5d86717f in RUNNING state
> E1212 13:57:02.726495  2243 switchboard.cpp:861] Unexpected termination of 
> I/O switchboard server: 'IOSwitchboard' exited with signal: Killed for 
> container 09e87380-00ab-4987-83c9-fa1c5d86717f
> I1212 13:57:02.726563  2265 launcher.cpp:149] Asked to destroy container 
> 09e87380-00ab-4987-83c9-fa1c5d86717f
> E1212 13:57:02.783607  2228 switchboard.cpp:799] Failed to remove unix domain 
> socket file '/tmp/mesos-io-switchboard-b4af1c92-6633-44f3-9d35-e0e36edaf70a' 
> for container '09e87380-00ab-4987-83c9-fa1c5d86717f': No such file or 
> directory
> ../../mesos/src/tests/containerizer/io_switchboard_tests.cpp:661: Failure
> Value of: wait.get()->reasons().size() == 1
>   Actual: false
> Expected: true
> *** Aborted at 1481579822 (unix time) try "date -d @1481579822" if you are 
> using GNU date ***
> PC: @          0x1bf16d0 testing::UnitTest::AddTestPartResult()
> *** SIGSEGV (@0x0) received by PID 2211 (TID 0x7faed7d078c0) from PID 0; 
> stack trace: ***
>     @     0x7faecf855100 (unknown)
>     @          0x1bf16d0 testing::UnitTest::AddTestPartResult()
>     @          0x1be6247 testing::internal::AssertHelper::operator=()
>     @          0x19ed751 
> mesos::internal::tests::IOSwitchboardTest_KillSwitchboardContainerDestroyed_Test::TestBody()
>     @          0x1c0ed8c 
> testing::internal::HandleSehExceptionsInMethodIfSupported<>()
>     @          0x1c09e74 
> testing::internal::HandleExceptionsInMethodIfSupported<>()
>     @          0x1beb505 testing::Test::Run()
>     @          0x1bebc88 testing::TestInfo::Run()
>     @          0x1bec2ce testing::TestCase::Run()
>     @          0x1bf2ba8 testing::internal::UnitTestImpl::RunAllTests()
>     @          0x1c0f9b1 
> testing::internal::HandleSehExceptionsInMethodIfSupported<>()
>     @          0x1c0a9f2 
> testing::internal::HandleExceptionsInMethodIfSupported<>()
>     @          0x1bf18ee testing::UnitTest::Run()
>     @          0x11bc9e3 RUN_ALL_TESTS()
>     @          0x11bc599 main
>     @     0x7faece663b15 __libc_start_main
>     @           0xa9c219 (unknown)
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to