[ 
https://issues.apache.org/jira/browse/HDFS-14403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16815511#comment-16815511
 ] 

Daryn Sharp commented on HDFS-14403:
------------------------------------

Sorry for delay, been a little bit distracted.  We should consider 
incorporating elements from both patches.  My splicing wasn't quite as abstract 
as this one.  Notables and cheats:
* {{CallQueueManager}}: Cheated by using {{scheduler instanceof 
DecayRpcScheduler}} to avoid disrupting the interface.
* {{DecayRpcScheduler}}: Hijacked the "*CallCount" variables to be either count 
or cost.
* {{Protobuf/WritableRpcEngine}}: Hoisted redundant metrics update into server 
to allowed finer-grain breakdown.
** Dropped bizarre metric update that recorded failed calls as their exception. 
 Always complicated historical trend analysis of call load or which op was 
being flooded.
* {{Server}}: metrics tweaks
** Skipped calls from disconnected client are no longer billed as near-zero 
processing time – it worse the backlog, the better processing used to look.
** Add lock wait metric.
** Normal metric for processing time still includes lock wait.
** Detailed per op processing time does not include lock wait.  Can finally do 
production load analysis of call expense!
* {{RpcMetrics}}: changed timeunits to microseconds because milliseconds 
accuracy is near worthless.  probably/maybe should be configurable.
* {{FSNamesystemLock}}: callback feels a little clunky but most performant I 
could quickly devise.

> Cost-Based RPC FairCallQueue
> ----------------------------
>
>                 Key: HDFS-14403
>                 URL: https://issues.apache.org/jira/browse/HDFS-14403
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: ipc, namenode
>            Reporter: Erik Krogen
>            Assignee: Christopher Gregorian
>            Priority: Major
>              Labels: qos, rpc
>         Attachments: CostBasedFairCallQueueDesign_v0.pdf, 
> HDFS-14403.001.patch, HDFS-14403.002.patch, HDFS-14403.003.patch
>
>
> HADOOP-15016 initially described extensions to the Hadoop FairCallQueue 
> encompassing both cost-based analysis of incoming RPCs, as well as support 
> for reservations of RPC capacity for system/platform users. This JIRA intends 
> to track the former, as HADOOP-15016 was repurposed to more specifically 
> focus on the reservation portion of the work.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

Reply via email to