[ 
https://issues.apache.org/jira/browse/HADOOP-12325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14707467#comment-14707467
 ] 

Xiaoyu Yao commented on HADOOP-12325:
-------------------------------------

[~anu], thanks for updating the patch. Some further comments: 

bq. I could not find any place where we were using WritableRpcEngine for real, 
hence I did not make that change.

I've attached a caller hierarchy from IntelliJ for WritableRpcEngine#call. I 
think we should logSlowRpcCalls for both ProtobufRpcEngine#call and 
WritableRpcEngine#call. The easy way to do this is to refactor logSlowRpcCalls 
from ProtobufRpcEngine.java to Server.java. This way, both 
ProtocoBufRpcEngine#call and WritableRpcEngine#call can simply call 
server#logSlowRpcCalls instead.

There is an early JIRA HDFS-5180 with patches available sharing the similar 
idea. We can resolve that one once HADOOP-12325 is fixed.

> RPC Metrics : Add the ability track and log slow RPCs
> -----------------------------------------------------
>
>                 Key: HADOOP-12325
>                 URL: https://issues.apache.org/jira/browse/HADOOP-12325
>             Project: Hadoop Common
>          Issue Type: Improvement
>          Components: ipc, metrics
>    Affects Versions: 2.7.1
>            Reporter: Anu Engineer
>            Assignee: Anu Engineer
>         Attachments: HADOOP-12325.001.patch, HADOOP-12325.002.patch, 
> HADOOP-12325.003.patch
>
>
> This JIRA proposes to add a counter called RpcSlowCalls and also a 
> configuration setting that allows users to log really slow RPCs.  Slow RPCs 
> are RPCs that fall at 99th percentile. This is useful to troubleshoot why 
> certain services like name node freezes under heavy load.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to