Mostafa Mokhtar created IMPALA-6403: ---------------------------------------
Summary: Enable file handle re-use for multiple scan ranges within the same file for an HDFS Scan node Key: IMPALA-6403 URL: https://issues.apache.org/jira/browse/IMPALA-6403 Project: IMPALA Issue Type: Improvement Reporter: Mostafa Mokhtar Assignee: Joe McDonnell Impala creates a file handle per scan range, for queries that read multiple columns per scan range un-necessarily large load is added to the HDFS NameNode which limits scalability on large clusters. For a given set of scan ranges against a file within a Scan Node a single file handle should be created an reused to avoid excessive RPCs. -- This message was sent by Atlassian JIRA (v7.6.3#76005)