Tongjie Chen created PARQUET-100:
------------------------------------
Summary: provide an option in parquet-pig to avoid reading footers
in client side
Key: PARQUET-100
URL: https://issues.apache.org/jira/browse/PARQUET-100
Project: Parquet
Issue Type: Improvement
Components: parquet-mr
Affects Versions: parquet-mr_1.6.0
Reporter: Tongjie Chen
Parquet Pig reads footer in client side, to calculate splits and retrieve
schema etc.
In HCatalog environment, if there are large number of files generated by Hive,
Parquet-Pig will spend significant chunk of time processing those footers in
client side (before job is submitted to cluster).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)