Dmytro Sen created AMBARI-12548:
-----------------------------------

             Summary: Ambari Agent does not register / heartbeat if "df" 
command hangs
                 Key: AMBARI-12548
                 URL: https://issues.apache.org/jira/browse/AMBARI-12548
             Project: Ambari
          Issue Type: Bug
          Components: ambari-agent
    Affects Versions: 2.1.0
            Reporter: Dmytro Sen
            Assignee: Dmytro Sen
            Priority: Critical
             Fix For: 2.1.1


The agent executes the "df" command when it starts up.
In rare scenarios, this command can hang (can even hanging for several days 
without any timeout). When this happens the agent won't move forward to the 
process of registration, etc.
There's an ambari-server property called agent.check.mounts.timeout. This can 
be defined to force a timeout.
However, we should always have a default timeout on the agent script side, even 
if this property is not defined to avoid a situation where the agent is 
completely hung.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to