[ 
https://issues.apache.org/jira/browse/YARN-1897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14004166#comment-14004166
 ] 

Ming Ma commented on YARN-1897:
-------------------------------

Chatted with Gera offline. The definition of SignalContainer* APIs is needed 
for other subtasks including YARN-1515. So we will resolve SignalContainer* 
APIs issues in this jira. After it is done, other subtasks can continue. Here 
are couple open issues.

1.  Support for a list of containers. The latest patch in this jira just 
supports a flat list of signalContainerRequest, regardless if they are from the 
same containers or not. Gera's patch in YARN-1515 groups all commands under the 
same container together via signalContainerRequest.getSignals(). Either 
approach works. I don't have strong preference either way given the most common 
use case is for single container; although signalContainers is more consistent 
with startContainers.

2. Support for "SIGTERM + delay + SIGKILL" used in stopContainers. Latest 
YARN-1515 introduces "Pause" method so that containers can pause in between 
signals. We need something like that to support YARN-1515 scenario. Or we can 
provide some new SignalContainerCommand like "sleep".

Really appreciate any comments on this.

> Define SignalContainerRequest and SignalContainerResponse
> ---------------------------------------------------------
>
>                 Key: YARN-1897
>                 URL: https://issues.apache.org/jira/browse/YARN-1897
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: api
>            Reporter: Ming Ma
>            Assignee: Ming Ma
>         Attachments: YARN-1897-2.patch, YARN-1897-3.patch, YARN-1897-4.patch, 
> YARN-1897.1.patch
>
>
> We need to define SignalContainerRequest and SignalContainerResponse first as 
> they are needed by other sub tasks. SignalContainerRequest should use 
> OS-independent commands and provide a way to application to specify "reason" 
> for diagnosis. SignalContainerResponse might be empty.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to