[ https://issues.apache.org/jira/browse/PIG-4442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14346251#comment-14346251 ]
Prashant Kommireddi commented on PIG-4442: ------------------------------------------ Schumi, I shall get you one day! > Eliminate redundant RPC call to get file information in HPath. > -------------------------------------------------------------- > > Key: PIG-4442 > URL: https://issues.apache.org/jira/browse/PIG-4442 > Project: Pig > Issue Type: Improvement > Affects Versions: 0.13.0 > Reporter: Chris Nauroth > Assignee: Chris Nauroth > Priority: Minor > Fix For: 0.15.0 > > Attachments: PIG-4442.001.patch > > > The {{HPath}} class makes 2 separate calls to {{FileSystem#getFileStatus}} to > get the block size and replication. In the case of HDFS, this results in 2 > separate but identical RPC transactions with the NameNode. The situation is > the same for many other alternative {{FileSystem}} implementations too. We > can get a minor latency improvement and lighten some RPC load on the remote > services by using a single call and getting the block size and replication > from the same response. -- This message was sent by Atlassian JIRA (v6.3.4#6332)