[ 
https://issues.apache.org/jira/browse/HDFS-9198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14987330#comment-14987330
 ] 

Daryn Sharp commented on HDFS-9198:
-----------------------------------

# I named it more generically since it can be used for more, but I'll rename if 
you feel strongly.
# I originally had it outside the loop but something failed.  Probably a test.  
Since it's a class static it should be cheap.
# The NN is terminated when the queue stops accepting offers and the thread 
isn't running, but I'll add a try/catch around the run() loop too.
# Will look at the style again.
# Yes, I compensated for async IBR failures by making the next heartbeat 
trigger a re-registration.  The only exception we've seen from an IBR is 
dead/unregistered node so the new code is a no-op but I added it as a safety 
net.
# Ok.

> Coalesce IBR processing in the NN
> ---------------------------------
>
>                 Key: HDFS-9198
>                 URL: https://issues.apache.org/jira/browse/HDFS-9198
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: namenode
>    Affects Versions: 2.0.0-alpha
>            Reporter: Daryn Sharp
>            Assignee: Daryn Sharp
>         Attachments: HDFS-9198-branch2.patch, HDFS-9198-trunk.patch, 
> HDFS-9198-trunk.patch, HDFS-9198-trunk.patch
>
>
> IBRs from thousands of DNs under load will degrade NN performance due to 
> excessive write-lock contention from multiple IPC handler threads.  The IBR 
> processing is quick, so the lock contention may be reduced by coalescing 
> multiple IBRs into a single write-lock transaction.  The handlers will also 
> be freed up faster for other operations.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to