[ https://issues.apache.org/jira/browse/MESOS-8871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16462355#comment-16462355 ]
Andrei Budnik commented on MESOS-8871: -------------------------------------- This issue has been reproduced once in our internal testing cluster. Fixed by removing '/var/lib/mesos/slave/store/docker/storedImages' file. > Agent may fail to recover if the agent dies before image store cache > checkpointed. > ---------------------------------------------------------------------------------- > > Key: MESOS-8871 > URL: https://issues.apache.org/jira/browse/MESOS-8871 > Project: Mesos > Issue Type: Improvement > Components: agent > Reporter: Gilbert Song > Priority: Major > Labels: mesosphere, slave > > {noformat} > E0502 13:51:45.398555 10100 slave.cpp:7305] EXIT with status 1: Failed to > perform recovery: Collect failed: Collect failed: Collect failed: Unexpected > empty images file '/var/lib/mesos/slave/store/docker/storedImages' > {noformat} > This may happen if the agent dies after the file is created but before the > contents are persisted on disk. -- This message was sent by Atlassian JIRA (v7.6.3#76005)