[ https://issues.apache.org/jira/browse/MESOS-9130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16744904#comment-16744904 ]
Benjamin Bannier commented on MESOS-9130: ----------------------------------------- Reopening as above fix introduced another flakiness. Review: https://reviews.apache.org/r/69781/ > Test `StorageLocalResourceProviderTest.ROOT_ContainerTerminationMetric` is > flaky. > --------------------------------------------------------------------------------- > > Key: MESOS-9130 > URL: https://issues.apache.org/jira/browse/MESOS-9130 > Project: Mesos > Issue Type: Bug > Components: resource provider, storage > Affects Versions: 1.6.0, 1.7.0 > Reporter: Chun-Hung Hsiao > Assignee: Benjamin Bannier > Priority: Major > Labels: mesosphere, storage > Fix For: 1.8.0 > > Attachments: test.log > > > This test is flaky and can fail with the following error: > {noformat} > ../../src/tests/storage_local_resource_provider_tests.cpp:3167 > Failed to wait 15secs for pluginRestarted{noformat} > The actual error is the following: > {noformat} > E0802 22:13:37.265038 8216 provider.cpp:1496] Failed to reconcile resource > provider b9379982-d990-4f63-8a5b-10edd4f5a1bb: Collect failed: OS > Error{noformat} > The root cause is that the SLRP calls {{ListVolumes}} and {{GetCapacity}} > when starting up, and if the plugin container is killed when these calls are > ongoing, gRPC will return an {{OS Error}} which will lead the SLRP to fail. > This flakiness will be fixed once we finish > https://issues.apache.org/jira/browse/MESOS-8400. -- This message was sent by Atlassian JIRA (v7.6.3#76005)