Julien Le Dem created PARQUET-86:
------------------------------------

             Summary: parquet-hive (and therefore Hive) depends on 
ParquetInputSplit constructor
                 Key: PARQUET-86
                 URL: https://issues.apache.org/jira/browse/PARQUET-86
             Project: Parquet
          Issue Type: Bug
          Components: parquet-mr
            Reporter: Julien Le Dem


The issue is not really parquet-hive which we can modify in sync but rather 
Hive itself. As we want to be able to change the split implementation without 
breaking Hive. (Users might want to use the latest Parquet with their version 
of Hive)
[ParquetRecordReaderWrapper in 
parquet-hive|https://github.com/apache/incubator-parquet-mr/blob/647b8a70f9b7c94cabf9a7ec7bce2e7cbbb4c05b/parquet-hive/parquet-hive-storage-handler/src/main/java/org/apache/hadoop/hive/ql/io/parquet/read/ParquetRecordReaderWrapper.java#L223]
and in 
[Hive|https://github.com/apache/hive/blob/e58b9d273cb78bda2947148bc54f4befb2514241/ql/src/java/org/apache/hadoop/hive/ql/io/parquet/read/ParquetRecordReaderWrapper.java#L221]

It should use 
https://github.com/apache/incubator-parquet-mr/blob/master/parquet-hadoop/src/main/java/parquet/hadoop/InternalParquetRecordReader.java
 directly instead.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to