[
https://issues.apache.org/jira/browse/STORM-532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14172053#comment-14172053
]
ASF GitHub Bot commented on STORM-532:
--------------------------------------
GitHub user caofangkun opened a pull request:
https://github.com/apache/storm/pull/293
STORM-532,Supervisor should restart worker immediately, if the worker pr...
https://issues.apache.org/jira/browse/STORM-532
For now
if the worker process does not exist any more
Supervisor will have to wait a few seconds for worker heartbeart timeout
and restart worker .
If supervisor knows the worker processid and check if the process exists in
the sync-processes thread ,may need less time to restart worker.
1: record worker process id in the worker local heartbeart
2: in supervisor sync-processes ,get process id from worker local heartbeat
and check if the process exits
3: if not restart it immediately
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/caofangkun/incubator-storm master
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/storm/pull/293.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #293
----
commit ab9ae75fe3f3bbf070d05b5b165af04152a5933d
Author: caokun <caokun@caokun-virtualbox.(none)>
Date: 2014-10-15T01:12:11Z
STORM-532,Supervisor should restart worker immediately, if the worker
process does not exist any more
----
> Supervisor should restart worker immediately, if the worker process does not
> exist any more
> --------------------------------------------------------------------------------------------
>
> Key: STORM-532
> URL: https://issues.apache.org/jira/browse/STORM-532
> Project: Apache Storm
> Issue Type: Improvement
> Affects Versions: 0.10.0
> Reporter: caofangkun
> Priority: Minor
>
> For now
> if the worker process does not exist any more
> Supervisor will have to wait a few seconds for worker heartbeart timeout and
> restart worker .
> If supervisor knows the worker processid and check if the process exists in
> the sync-processes thread ,may need less time to restart worker.
> 1: record worker process id in the worker local heartbeart
> 2: in supervisor sync-processes ,get process id from worker local heartbeat
> and check if the process exits
> 3: if not restart it immediately
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)