[jira] [Updated] (HIVE-5783) Native Parquet Support in Hive

Brock Noland (JIRA) Fri, 17 Jan 2014 11:11:24 -0800

     [ 
https://issues.apache.org/jira/browse/HIVE-5783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Brock Noland updated HIVE-5783:
-------------------------------

    Attachment: HIVE-5783.patch

Hey guys,

I rebased your patch on top of trunk. The bit items I changed are:

* Moved DeprecatedParquet*Format classed back to original package since that is 
what users have stored in their metastore. We should be able to remove those 
classes after 2 releases
* Removed \@author tags since they aren't used in Apache
* Fixed some license headers which were missing



> Native Parquet Support in Hive
> ------------------------------
>
>                 Key: HIVE-5783
>                 URL: https://issues.apache.org/jira/browse/HIVE-5783
>             Project: Hive
>          Issue Type: New Feature
>          Components: Serializers/Deserializers
>            Reporter: Justin Coffey
>            Assignee: Justin Coffey
>            Priority: Minor
>         Attachments: HIVE-5783.patch, HIVE-5783.patch, 
> hive-0.11-parquet.patch, parquet-hive.patch
>
>
> Problem Statement:
> Hive would be easier to use if it had native Parquet support. Our 
> organization, Criteo, uses Hive extensively. Therefore we built the Parquet 
> Hive integration and would like to now contribute that integration to Hive.
> About Parquet:
> Parquet is a columnar storage format for Hadoop and integrates with many 
> Hadoop ecosystem tools such as Thrift, Avro, Hadoop MapReduce, Cascading, 
> Pig, Drill, Crunch, and Hive. Pig, Crunch, and Drill all contain native 
> Parquet integration.
> Changes Details:
> Parquet was built with dependency management in mind and therefore only a 
> single Parquet jar will be added as a dependency.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Updated] (HIVE-5783) Native Parquet Support in Hive

Reply via email to