[ 
https://issues.apache.org/jira/browse/AMBARI-20754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15996743#comment-15996743
 ] 

Yuanbo Liu commented on AMBARI-20754:
-------------------------------------

[~dili] Can we backport this JIRA into 2.5. We have a such kind of defect to 
fix in next version.

> get_value_from_jmx constantly prints exception message in retry mechanism, 
> which brings bad user experience
> -----------------------------------------------------------------------------------------------------------
>
>                 Key: AMBARI-20754
>                 URL: https://issues.apache.org/jira/browse/AMBARI-20754
>             Project: Ambari
>          Issue Type: Bug
>            Reporter: Yuanbo Liu
>            Assignee: Yuanbo Liu
>             Fix For: trunk
>
>         Attachments: AMBARI-20754.001.patch
>
>
> {{get_value_from_jmx}} of {{jmx.py}} is used in getting NameNode HA state. As 
> we know, if the cluster is large, it takes a long time for Namenode to leave 
> safe mode when restarting Namenode, thus we use retry mechanism to invoke 
> {{get_value_from_jmx}} in case of getting wrong state. The problem is that, 
> {{get_value_from_jmx}} will print several exception message into std_error 
> during retrying, it confuses users because there're error messages in 
> std_error, while all the services restart successfully. Here are the error 
> messages:
> {quote}
> 2017-04-12 15:12:56,633 - Getting jmx metrics from NN failed. URL: 
> http://xxxx:50070/jmx?qry=Hadoop:service=NameNode,name=FSNamesystem
> Traceback (most recent call last):
> File 
> "/usr/lib/python2.6/site-packages/resource_management/libraries/functions/jmx.py",
>  line 38, in get_value_from_jmx
>    _, data, _ = get_user_call_output(cmd, user=run_user, quiet=False)
> File 
> "/usr/lib/python2.6/site-packages/resource_management/libraries/functions/get_user_call_output.py",
>  line 61, in get_user_call_output
>    raise ExecutionFailed(err_msg, code, files_output[0], files_output[1])
> ExecutionFailed: Execution of 'curl --negotiate -u : -s 
> 'http://xxxx:50070/jmx?qry=Hadoop:service=NameNode,name=FSNamesystem' 
> 1>/tmp/tmpWp05DF 2>/tmp/tmphm2dny' returned 7.
> 2017-04-12 15:12:58,562 - Getting jmx metrics from NN failed. URL: 
> http://xxxx:50070/jmx?qry=Hadoop:service=NameNode,name=FSNamesystem
> Traceback (most recent call last):
> File 
> "/usr/lib/python2.6/site-packages/resource_management/libraries/functions/jmx.py",
>  line 42, in get_value_from_jmx
>    return data_dict["beans"][0][property]
> IndexError: list index out of range
> {quote}
> We should improve it.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to