[ 
https://issues.apache.org/jira/browse/HIVE-13840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Prasanth Jayachandran updated HIVE-13840:
-----------------------------------------
    Attachment: HIVE-13840.2.patch

In the updated patch
1) Another file system call in split generation is avoided by specifying max 
length in reader. If max length is not specified ORC reader will issue 
fs.getFileStatus(path) to find the length of the file.
2) Added file system stats to MockFS which is used in the newly added test case

fyi.. [~rajesh.balamohan],[~ashutoshc]

[~owen.omalley] Can you please review the patch?

> Orc split generation is reading file footers twice
> --------------------------------------------------
>
>                 Key: HIVE-13840
>                 URL: https://issues.apache.org/jira/browse/HIVE-13840
>             Project: Hive
>          Issue Type: Bug
>          Components: ORC
>    Affects Versions: 2.1.0
>            Reporter: Prasanth Jayachandran
>            Assignee: Prasanth Jayachandran
>            Priority: Critical
>         Attachments: HIVE-13840.1.patch, HIVE-13840.2.patch
>
>
> Recent refactorings to move orc out introduced a regression in split 
> generation. This leads to reading the orc file footers twice during split 
> generation.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to