[ https://issues.apache.org/jira/browse/HADOOP-12325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14707467#comment-14707467 ]
Xiaoyu Yao commented on HADOOP-12325: ------------------------------------- [~anu], thanks for updating the patch. Some further comments: bq. I could not find any place where we were using WritableRpcEngine for real, hence I did not make that change. I've attached a caller hierarchy from IntelliJ for WritableRpcEngine#call. I think we should logSlowRpcCalls for both ProtobufRpcEngine#call and WritableRpcEngine#call. The easy way to do this is to refactor logSlowRpcCalls from ProtobufRpcEngine.java to Server.java. This way, both ProtocoBufRpcEngine#call and WritableRpcEngine#call can simply call server#logSlowRpcCalls instead. There is an early JIRA HDFS-5180 with patches available sharing the similar idea. We can resolve that one once HADOOP-12325 is fixed. > RPC Metrics : Add the ability track and log slow RPCs > ----------------------------------------------------- > > Key: HADOOP-12325 > URL: https://issues.apache.org/jira/browse/HADOOP-12325 > Project: Hadoop Common > Issue Type: Improvement > Components: ipc, metrics > Affects Versions: 2.7.1 > Reporter: Anu Engineer > Assignee: Anu Engineer > Attachments: HADOOP-12325.001.patch, HADOOP-12325.002.patch, > HADOOP-12325.003.patch > > > This JIRA proposes to add a counter called RpcSlowCalls and also a > configuration setting that allows users to log really slow RPCs. Slow RPCs > are RPCs that fall at 99th percentile. This is useful to troubleshoot why > certain services like name node freezes under heavy load. -- This message was sent by Atlassian JIRA (v6.3.4#6332)