Julien Le Dem created PARQUET-86:
------------------------------------
Summary: parquet-hive (and therefore Hive) depends on
ParquetInputSplit constructor
Key: PARQUET-86
URL: https://issues.apache.org/jira/browse/PARQUET-86
Project: Parquet
Issue Type: Bug
Components: parquet-mr
Reporter: Julien Le Dem
The issue is not really parquet-hive which we can modify in sync but rather
Hive itself. As we want to be able to change the split implementation without
breaking Hive. (Users might want to use the latest Parquet with their version
of Hive)
[ParquetRecordReaderWrapper in
parquet-hive|https://github.com/apache/incubator-parquet-mr/blob/647b8a70f9b7c94cabf9a7ec7bce2e7cbbb4c05b/parquet-hive/parquet-hive-storage-handler/src/main/java/org/apache/hadoop/hive/ql/io/parquet/read/ParquetRecordReaderWrapper.java#L223]
and in
[Hive|https://github.com/apache/hive/blob/e58b9d273cb78bda2947148bc54f4befb2514241/ql/src/java/org/apache/hadoop/hive/ql/io/parquet/read/ParquetRecordReaderWrapper.java#L221]
It should use
https://github.com/apache/incubator-parquet-mr/blob/master/parquet-hadoop/src/main/java/parquet/hadoop/InternalParquetRecordReader.java
directly instead.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)