[ https://issues.apache.org/jira/browse/HIVE-13840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Prasanth Jayachandran updated HIVE-13840: ----------------------------------------- Attachment: HIVE-13840.2.patch In the updated patch 1) Another file system call in split generation is avoided by specifying max length in reader. If max length is not specified ORC reader will issue fs.getFileStatus(path) to find the length of the file. 2) Added file system stats to MockFS which is used in the newly added test case fyi.. [~rajesh.balamohan],[~ashutoshc] [~owen.omalley] Can you please review the patch? > Orc split generation is reading file footers twice > -------------------------------------------------- > > Key: HIVE-13840 > URL: https://issues.apache.org/jira/browse/HIVE-13840 > Project: Hive > Issue Type: Bug > Components: ORC > Affects Versions: 2.1.0 > Reporter: Prasanth Jayachandran > Assignee: Prasanth Jayachandran > Priority: Critical > Attachments: HIVE-13840.1.patch, HIVE-13840.2.patch > > > Recent refactorings to move orc out introduced a regression in split > generation. This leads to reading the orc file footers twice during split > generation. -- This message was sent by Atlassian JIRA (v6.3.4#6332)