Henry Saputra created FLINK-2430: ------------------------------------ Summary: Potential race condition when restart all is called for a Twill runnable Key: FLINK-2430 URL: https://issues.apache.org/jira/browse/FLINK-2430 Project: Flink Issue Type: Bug Affects Versions: 0.6-incubating Reporter: Henry Saputra
When sending restart instance to all for a particular TwillRunnable, it could have race condition where the heartbeat thread run right after all containers have been released which make the check: // Looks for containers requests. if (provisioning.isEmpty() && runnableContainerRequests.isEmpty() && runningContainers.isEmpty()) { LOG.info("All containers completed. Shutting down application master."); break; } This could happen when all running containers are empty and new runnableContainerRequests has not been added. -- This message was sent by Atlassian JIRA (v6.3.4#6332)