[ https://issues.apache.org/jira/browse/MESOS-7742?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16329006#comment-16329006 ]
Andrei Budnik commented on MESOS-7742: -------------------------------------- These patches ^^ are fixing the first cause described in the [first patch|https://reviews.apache.org/r/65122/]. There is a second cause when an attempt to connect to IO-Switchboard fails with: {code:java} I1109 23:47:25.016929 27803 process.cpp:3982] Failed to process request for '/slave(812)/api/v1': Failed to connect to /tmp/mesos-io-switchboard-56bcba4b-6e81-4aeb-a0e9-41309ec991b5: Connection refused W1109 23:47:25.017009 27803 http.cpp:2944] Failed to attach to nested container 7ab572dd-78b5-4186-93af-7ac011990f80.b77944da-f1d5-4694-a51b-8fde150c5f7a: Failed to connect to /tmp/mesos-io-switchboard-56bcba4b-6e81-4aeb-a0e9-41309ec991b5: Connection refused I1109 23:47:25.017063 27803 process.cpp:1590] Returning '500 Internal Server Error' for '/slave(812)/api/v1' (Failed to connect to /tmp/mesos-io-switchboard-56bcba4b-6e81-4aeb-a0e9-41309ec991b5: Connection refused) {code} The reason for this failure needs to be investigated. > ContentType/AgentAPIStreamingTest.AttachInputToNestedContainerSession is flaky > ------------------------------------------------------------------------------ > > Key: MESOS-7742 > URL: https://issues.apache.org/jira/browse/MESOS-7742 > Project: Mesos > Issue Type: Bug > Components: agent > Affects Versions: 1.5.0 > Reporter: Vinod Kone > Assignee: Andrei Budnik > Priority: Major > Labels: flaky-test, mesosphere-oncall > Fix For: 1.6.0 > > Attachments: AgentAPITest.LaunchNestedContainerSession-badrun.txt, > LaunchNestedContainerSessionDisconnected-badrun.txt > > > Observed this on ASF CI and internal Mesosphere CI. Affected tests: > {noformat} > AgentAPIStreamingTest.AttachInputToNestedContainerSession > AgentAPITest.LaunchNestedContainerSession > AgentAPITest.AttachContainerInputAuthorization/0 > AgentAPITest.LaunchNestedContainerSessionWithTTY/0 > AgentAPITest.LaunchNestedContainerSessionDisconnected/1 > {noformat} > This issue comes at least in three different flavours. Take > {{AgentAPIStreamingTest.AttachInputToNestedContainerSession}} as an example. > h5. Flavour 1 > {noformat} > ../../src/tests/api_tests.cpp:6473 > Value of: (response).get().status > Actual: "503 Service Unavailable" > Expected: http::OK().status > Which is: "200 OK" > Body: "" > {noformat} > h5. Flavour 2 > {noformat} > ../../src/tests/api_tests.cpp:6473 > Value of: (response).get().status > Actual: "500 Internal Server Error" > Expected: http::OK().status > Which is: "200 OK" > Body: "Disconnected" > {noformat} > h5. Flavour 3 > {noformat} > /home/ubuntu/workspace/mesos/Mesos_CI-build/FLAG/CMake/label/mesos-ec2-ubuntu-16.04/mesos/src/tests/api_tests.cpp:6367 > Value of: (sessionResponse).get().status > Actual: "500 Internal Server Error" > Expected: http::OK().status > Which is: "200 OK" > Body: "" > {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)