Hi all,
In the document, I got that slider will try to recover on container
failure. But in my test application it doesn't
I'm using the 0.40 release. build from source
Here is what I get.
*If I kill the child process *
The agent check_process_status method will raise ComponentIsNotRunning and
*If I kill the child process *
So are you observing that a replacement container instance is never
started? Can you check the AM log? What is expected to happen here is as
follows:
- process goes down, and then agent goes down, AM notices, asks for a new
container from Yarn.
if you want the agent