[ 
https://issues.apache.org/jira/browse/HDFS-7597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15183351#comment-15183351
 ] 

Chris Nauroth commented on HDFS-7597:
-------------------------------------

bq. I'm still not sure why HDFS-8855 is/was necessary because this internal 
patch solved the problem for us long ago.

Yes, agreed.  That's why it was a forehead-smacking moment when I realized the 
same issue essentially had been fixed twice mistakenly.

I agree that this patch is a more general solution.  We might consider pulling 
out HDFS-8855 later as a clean-up.  As far as scope for this patch, do you want 
to do something to address the {{TestDataNodeUGIProvider}} failure, and we'll 
defer any further clean-up to a separate issue?

> DNs should not open new NN connections when webhdfs clients seek
> ----------------------------------------------------------------
>
>                 Key: HDFS-7597
>                 URL: https://issues.apache.org/jira/browse/HDFS-7597
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: webhdfs
>    Affects Versions: 2.0.0-alpha
>            Reporter: Daryn Sharp
>            Assignee: Daryn Sharp
>            Priority: Critical
>              Labels: BB2015-05-TBR
>         Attachments: HDFS-7597.patch, HDFS-7597.patch, HDFS-7597.patch
>
>
> Webhdfs seeks involve closing the current connection, and reissuing a new 
> open request with the new offset.  The RPC layer caches connections so the DN 
> keeps a lingering connection open to the NN.  Connection caching is in part 
> based on UGI.  Although the client used the same token for the new offset 
> request, the UGI is different which forces the DN to open another unnecessary 
> connection to the NN.
> A job that performs many seeks will easily crash the NN due to fd exhaustion.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to