[ 
https://issues.apache.org/jira/browse/PIG-4442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14345522#comment-14345522
 ] 

Chris Nauroth commented on PIG-4442:
------------------------------------

Thanks for the reviews and commit, Daniel and Prashant!  NameNodes everywhere 
thank you for lightening their load a bit.  :-)

> Eliminate redundant RPC call to get file information in HPath.
> --------------------------------------------------------------
>
>                 Key: PIG-4442
>                 URL: https://issues.apache.org/jira/browse/PIG-4442
>             Project: Pig
>          Issue Type: Improvement
>    Affects Versions: 0.13.0
>            Reporter: Chris Nauroth
>            Assignee: Chris Nauroth
>            Priority: Minor
>             Fix For: 0.15.0
>
>         Attachments: PIG-4442.001.patch
>
>
> The {{HPath}} class makes 2 separate calls to {{FileSystem#getFileStatus}} to 
> get the block size and replication.  In the case of HDFS, this results in 2 
> separate but identical RPC transactions with the NameNode.  The situation is 
> the same for many other alternative {{FileSystem}} implementations too.  We 
> can get a minor latency improvement and lighten some RPC load on the remote 
> services by using a single call and getting the block size and replication 
> from the same response.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to