Timothy Chen created HIVE-6143:
----------------------------------
Summary: Refactor Orc file format parsing logic to be shared
Key: HIVE-6143
URL: https://issues.apache.org/jira/browse/HIVE-6143
Project: Hive
Issue Type: Bug
Reporter: Timothy Chen
Currently the Orc file format parsing logic is hidden in private methods in
reader and record reader classes, for example footer parsing, stream loading,
etc.
For the Orc file format to be a more reusable file format outside of Hive, I
suggest refactor these generic logic into a shared class.
The current interface of reading per serialized as objects is not suffice as
for columnar execution engines such as Drill/Impala, it's much more efficient
to load in columnar data into its own columnar in memory formats.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)