[
https://issues.apache.org/jira/browse/PIG-4442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris Nauroth updated PIG-4442:
-------------------------------
Attachment: PIG-4442.001.patch
The attached patch consolidates to a single {{FileSystem#getFileStatus}} call
and then pulls block size and replication from the same instance.
> Eliminate redundant RPC call to get file information in HPath.
> --------------------------------------------------------------
>
> Key: PIG-4442
> URL: https://issues.apache.org/jira/browse/PIG-4442
> Project: Pig
> Issue Type: Improvement
> Affects Versions: 0.13.0
> Reporter: Chris Nauroth
> Assignee: Chris Nauroth
> Priority: Minor
> Attachments: PIG-4442.001.patch
>
>
> The {{HPath}} class makes 2 separate calls to {{FileSystem#getFileStatus}} to
> get the block size and replication. In the case of HDFS, this results in 2
> separate but identical RPC transactions with the NameNode. The situation is
> the same for many other alternative {{FileSystem}} implementations too. We
> can get a minor latency improvement and lighten some RPC load on the remote
> services by using a single call and getting the block size and replication
> from the same response.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)