H. Vetinari created HIVE-22005:
----------------------------------

             Summary: Handle complete parquet specification
                 Key: HIVE-22005
                 URL: https://issues.apache.org/jira/browse/HIVE-22005
             Project: Hive
          Issue Type: Improvement
    Affects Versions: All Versions
            Reporter: H. Vetinari


Hive cannot read parquet files written by (default-)spark after 1.4, which uses 
some other internal representation, but stay faithful to the parquet 
specification (see SPARK-20297).

Hive should be able to read such data written by spark, plus ideally other 
parquet formats (arrow, etc?) that follow the spec.

Quote from SPARK-20297:
> The standard doesn't say that smaller decimals *have* to be stored in 
> int32/int64, it just is an option for subset of decimal types. int32 and 
> int64 are valid representations for a subset of decimal types. 
> fixed_len_byte_array and binary are a valid representation of any decimal 
> type.

Arguably, this is a subtask of HIVE-12398.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

Reply via email to