Unfortunately I don't have the log right now. IIRC the executor received
the `KILL` event because the log I saw contained this line:
https://github.com/apache/mesos/blob/7e11a2d39cc642944897d2480105dbfd860fa601/src/launcher/default_executor.cpp#L1236
But it didn't contain this line:

On Wed, May 16, 2018 at 6:18 PM, Vinod Kone <vi...@mesosphere.io> wrote:

> Can you paste some logs here too if you have?
>
> On Wed, May 16, 2018 at 5:53 PM, Chun-Hung Hsiao (JIRA) <j...@apache.org>
> wrote:
>
> >
> >     [ https://issues.apache.org/jira/browse/MESOS-8927?page=
> > com.atlassian.jira.plugin.system.issuetabpanels:comment-
> > tabpanel&focusedCommentId=16478318#comment-16478318 ]
> >
> > Chun-Hung Hsiao commented on MESOS-8927:
> > ----------------------------------------
> >
> > I'd like to add some notes here. This problem is actually nontrivial,
> > because AFAIK we don't have a reliable way to kill a container at any
> state.
> >
> > > Default executor cannot kill tasks if `LAUNCH_NESTED_CONTAINER` is
> stuck.
> > > ------------------------------------------------------------
> > -------------
> > >
> > >                 Key: MESOS-8927
> > >                 URL: https://issues.apache.org/jira/browse/MESOS-8927
> > >             Project: Mesos
> > >          Issue Type: Bug
> > >          Components: executor
> > >    Affects Versions: 1.5.1, 1.6.0
> > >            Reporter: Chun-Hung Hsiao
> > >            Priority: Critical
> > >              Labels: default-executor, mesosphere
> > >
> > > In the default executor, if the {{LAUNCH_NESTED_CONTAINER}} call never
> > returns, {{container->launched}} won't be set, so a follow-up {{KILL}}
> > event will be ignored:
> > >  [https://github.com/apache/mesos/blob/40b40d9b73221388e583fc140280f1
> > eb2b48b832/src/launcher/default_executor.cpp#L1091]
> > > This could lead to tasks stuck in {{TASK_STARTING}}.
> >
> >
> >
> > --
> > This message was sent by Atlassian JIRA
> > (v7.6.3#76005)
> >
>

Reply via email to