Github user serranom commented on the issue: https://github.com/apache/twill/pull/23 Also, I noticed the test failure and I'm confused about it. It seems to be something intermittent because it succeeds in some cases. When it fails, the AM doesn't restart the containers: 2017-01-17T23:43:19,872Z WARN o.a.t.i.a.RunningContainers [testing-docker-7dddd7e0-951d-4746-842a-bd7d0e7c25ce] [ApplicationMasterService] RunningContainers:handleCompleted(RunningContainers.java:476) - Container container_1484695869902_0027_01_000006 exited abnormally with state COMPLETE, exit code 1. 2017-01-17T23:43:19,872Z INFO o.a.t.i.a.RunningContainers [testing-docker-7dddd7e0-951d-4746-842a-bd7d0e7c25ce] [ApplicationMasterService] RunningContainers:shouldRetry(RunningContainers.java:518) - 5 of 6 retries for runnable FailingServer. 2017-01-17T23:43:19,873Z INFO o.a.t.i.a.RunningContainers [testing-docker-7dddd7e0-951d-4746-842a-bd7d0e7c25ce] [ApplicationMasterService] RunningContainers:handleCompleted(RunningContainers.java:479) - Re-request the container container_1484695869902_0027_01_000006 for exit code 1. 2017-01-17T23:43:19,873Z INFO o.a.t.i.a.ApplicationMasterService [testing-docker-7dddd7e0-951d-4746-842a-bd7d0e7c25ce] [ApplicationMasterService] ApplicationMasterService:handleCompleted(ApplicationMasterService.java:502) - Re-request container for FailingServer with 1 instances. 2017-01-17T23:43:19,873Z DEBUG o.a.t.i.a.ApplicationMasterService [testing-docker-7dddd7e0-951d-4746-842a-bd7d0e7c25ce] [ApplicationMasterService] ApplicationMasterService:doRun(ApplicationMasterService.java:406) - Runnable container requests: 3 2017-01-17T23:43:19,873Z INFO o.a.t.i.a.ApplicationMasterService [testing-docker-7dddd7e0-951d-4746-842a-bd7d0e7c25ce] [ApplicationMasterService] ApplicationMasterService:doRun(ApplicationMasterService.java:444) - Relaxing provisioning constraints for request f30d39fe-2d65-493d-b670-8b42192bf322 2017-01-17T23:43:20,875Z DEBUG o.a.t.i.a.ApplicationMasterService [testing-docker-7dddd7e0-951d-4746-842a-bd7d0e7c25ce] [ApplicationMasterService] ApplicationMasterService:doRun(ApplicationMasterService.java:406) - Runnable container requests: 3 2017-01-17T23:43:21,878Z DEBUG o.a.t.i.a.ApplicationMasterService [testing-docker-7dddd7e0-951d-4746-842a-bd7d0e7c25ce] [ApplicationMasterService] ApplicationMasterService:doRun(ApplicationMasterService.java:406) - Runnable container requests: 3 2017-01-17T23:43:22,880Z DEBUG o.a.t.i.a.ApplicationMasterService [testing-docker-7dddd7e0-951d-4746-842a-bd7d0e7c25ce] [ApplicationMasterService] ApplicationMasterService:doRun(ApplicationMasterService.java:406) - Runnable container requests: 3 2017-01-17T23:43:23,882Z DEBUG o.a.t.i.a.ApplicationMasterService [testing-docker-7dddd7e0-951d-4746-842a-bd7d0e7c25ce] [ApplicationMasterService] ApplicationMasterService:doRun(ApplicationMasterService.java:406) - Runnable container requests: 3 2017-01-17T23:43:24,884Z DEBUG o.a.t.i.a.ApplicationMasterService [testing-docker-7dddd7e0-951d-4746-842a-bd7d0e7c25ce] [ApplicationMasterService] ApplicationMasterService:doRun(ApplicationMasterService.java:406) - Runnable container requests: 3 2017-01-17T23:43:24,885Z INFO o.a.t.i.LogOnlyEventHandler [testing-docker-7dddd7e0-951d-4746-842a-bd7d0e7c25ce] [ApplicationMasterService] LogOnlyEventHandler:launchTimeout(LogOnlyEventHandler.java:36) - Requested 2 containers for runnable FailingServer, only got 0 after 5012 ms. 2017-01-17T23:43:25,888Z DEBUG o.a.t.i.a.ApplicationMasterService [testing-docker-7dddd7e0-951d-4746-842a-bd7d0e7c25ce] [ApplicationMasterService] ApplicationMasterService:doRun(ApplicationMasterService.java:406) - Runnable container requests: 3 2017-01-17T23:43:26,890Z DEBUG o.a.t.i.a.ApplicationMasterService [testing-docker-7dddd7e0-951d-4746-842a-bd7d0e7c25ce] [ApplicationMasterService] ApplicationMasterService:doRun(ApplicationMasterService.java:406) - Runnable container requests: 3 2017-01-17T23:43:27,892Z DEBUG o.a.t.i.a.ApplicationMasterService [testing-docker-7dddd7e0-951d-4746-842a-bd7d0e7c25ce] [ApplicationMasterService] ApplicationMasterService:doRun(ApplicationMasterService.java:406) - Runnable container requests: 3 Any advice on debugging this?
--- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---