JaySenSharma created AMBARI-18064:
-------------------------------------

             Summary: Decrease the number of retry count for 
check_ranger_login_urllib2
                 Key: AMBARI-18064
                 URL: https://issues.apache.org/jira/browse/AMBARI-18064
             Project: Ambari
          Issue Type: Bug
          Components: ambari-agent
    Affects Versions: trunk
         Environment: All
            Reporter: JaySenSharma


If the Ranger Admin is down then while starting any service from Ambari it 
keeps retrying  75 times in the interval of 8 seconds (total 600 Seconds , 
Means 10 minutes) and then it finally starts the service like Kafka Broker 
service.

Following kind of logging we can see in the ambari console when the Ranger 
Admin is Down and when the kafka broker start request is triggered (Attaching 
the "/var/lib/ambari-agent/data/output-297.txt" log):

Snippet of the retry attempts:
{code}
2016-08-08 13:45:27,802 - HdfsResource[None] {'security_enabled': False, 
'hadoop_bin_dir': '/usr/hdp/current/hadoop-client/bin', 'keytab': [EMPTY], 
'default_fs': 'hdfs://jss1.example.com:8020', 'hdfs_resource_ignore_file': 
'/var/lib/ambari-agent/data/.hdfs_resource_ignore', 'hdfs_site': ..., 
'kinit_path_local': 'kinit', 'principal_name': [EMPTY], 'user': 'hdfs', 
'action': ['execute'], 'hadoop_conf_dir': 
'/usr/hdp/current/hadoop-client/conf', 'immutable_paths': 
[u'/apps/hive/warehouse', u'/mr-history/done', u'/app-logs', u'/tmp']}
2016-08-08 13:45:27,853 - RangeradminV2: Skip ranger admin if it's down !
2016-08-08 13:45:27,858 - Will retry 74 time(s), caught exception: Connection 
failed to Ranger Admin. Reason - [Errno 111] Connection refused.. Sleeping for 
8 sec(s)
2016-08-08 13:45:35,869 - Will retry 73 time(s), caught exception: Connection 
failed to Ranger Admin. Reason - [Errno 111] Connection refused.. Sleeping for 
8 sec(s)
.
.
.
2016-08-08 13:55:04,653 - Will retry 2 time(s), caught exception: Connection 
failed to Ranger Admin. Reason - [Errno 111] Connection refused.. Sleeping for 
8 sec(s)
2016-08-08 13:55:12,665 - Will retry 1 time(s), caught exception: Connection 
failed to Ranger Admin. Reason - [Errno 111] Connection refused.. Sleeping for 
8 sec(s)
2016-08-08 13:55:20,676 - Connection failed to Ranger Admin. Reason - [Errno 
111] Connection refused.
2016-08-08 13:55:20,683 - 
File['/usr/hdp/current/kafka-broker/config/ranger-security.xml'] {'content': 
InlineTemplate(...), 'owner': 'kafka', 'group': 'hadoop', 'mode': 0644}
{code}


*What is Needed?*
Here we see that it is not worth to wait for 600 Seconds (10 Minutes) to retry 
and then start the service (kafka broker Or any other component).  Instead it 
can be reduced retry attempts to 15 times instead of trying 75 times.

*What was previous behavior?*
Before the [AMBARI-14710|https://issues.apache.org/jira/browse/AMBARI-14710] 
the retry attempt was set to 15 times which was more accurate.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to