[
https://issues.apache.org/jira/browse/STORM-1670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Satish Duggana updated STORM-1670:
----------------------------------
Description:
LocalState#get(String) can throw FileNotFoundException which may result in
supervisor.clj#sync-processes which stop assigning new workers/assignments etc
and supervisor goes down later.
VersionedStore#mostRecentVersionPath() can return a file only with suffix of
.version but the original file for a specific version may not have been there
because .version suffix was not deleted but respective data file may have been
deleted in earlier cleanups.
was:
LocalState#get(String) can throw FileNotFoundException which results in not
removing worker heartbeats and it throws Exception in
supervisr.clj#sync-processes which stops assigning new workers/assignments.
VersionedStore#mostRecentVersionPath() can return a file only with suffix of
.version but the original file for a specific version may not have been there
because .version suffix was not deleted but respective data file may have been
deleted in earlier cleanups.
> LocalState#get(String) can throw FileNotFoundException which results in not
> removing worker heartbeats and supervisor is kind of stuck.
> ---------------------------------------------------------------------------------------------------------------------------------------
>
> Key: STORM-1670
> URL: https://issues.apache.org/jira/browse/STORM-1670
> Project: Apache Storm
> Issue Type: Bug
> Components: storm-core
> Affects Versions: 1.0.0, 2.0.0
> Reporter: Satish Duggana
> Assignee: Satish Duggana
> Priority: Critical
>
> LocalState#get(String) can throw FileNotFoundException which may result in
> supervisor.clj#sync-processes which stop assigning new workers/assignments
> etc and supervisor goes down later.
> VersionedStore#mostRecentVersionPath() can return a file only with suffix of
> .version but the original file for a specific version may not have been there
> because .version suffix was not deleted but respective data file may have
> been deleted in earlier cleanups.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)