Chris Nauroth created PIG-4442:
----------------------------------
Summary: Eliminate redundant RPC call to get file information in
HPath.
Key: PIG-4442
URL: https://issues.apache.org/jira/browse/PIG-4442
Project: Pig
Issue Type: Improvement
Affects Versions: 0.13.0
Reporter: Chris Nauroth
Assignee: Chris Nauroth
Priority: Minor
The {{HPath}} class makes 2 separate calls to {{FileSystem#getFileStatus}} to
get the block size and replication. In the case of HDFS, this results in 2
separate but identical RPC transactions with the NameNode. The situation is
the same for many other alternative {{FileSystem}} implementations too. We can
get a minor latency improvement and lighten some RPC load on the remote
services by using a single call and getting the block size and replication from
the same response.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)