[ 
https://issues.apache.org/jira/browse/MESOS-1222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13978387#comment-13978387
 ] 

Ian Downes commented on MESOS-1222:
-----------------------------------

The root cause of all test failures described here was because libev was 
waiting on all children (patch was not applied, see MESOS-1229). 
MesosContainerizer was unable to get the exit status from mesos-fetcher and 
failed to launch containers leading to tests failing in different ways.

> Turning on GTEST_SHUFFLE on the master branch triggers many failures in 
> SlaveRecoveryTest/0.*
> ---------------------------------------------------------------------------------------------
>
>                 Key: MESOS-1222
>                 URL: https://issues.apache.org/jira/browse/MESOS-1222
>             Project: Mesos
>          Issue Type: Bug
>            Reporter: Chi Zhang
>            Assignee: Ian Downes
>         Attachments: gmock_warning.tar.gz, 
> mock_function_called_more_times.tar.gz, 
> registerExecutor_or_status_or_shutdownFramework_msg_lost.tar.gz, 
> task_lost.tar.gz
>
>
> I pulled from the master branch a moment ago and used the following command 
> to run recovery tests.
> MESOS_VERBOSE=1 GLOG_v=1 ./bin/mesos-tests.sh 
> --gtest_filter='SlaveRecoveryTest/0.*' --gtest_shuffle --gtest_repeat=50 
> --gtest_break_on_failure 2>&1 | tee /tmp/recover_apache.log.$$
> Ran it 12 times. The errors seemed to show up pretty randomly. I roughly put 
> them into 4 categories and attached separately.
> I think the 3 patches from Ian are likely very useful: 20428 20429 20430.
> Also, would including GTEST_SHUFFLE as default increase our confidence for 
> make check?



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to