-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/24377/
-----------------------------------------------------------
(Updated Aug. 13, 2014, 8:13 a.m.)
Review request for hive.
Bugs: HIVE-7142
https://issues.apache.org/jira/browse/HIVE-7142
Repository: hive-git
Description
-------
Currently Hive only support serialize data into UTF-8 charset bytes or
deserialize from UTF-8 bytes, real world users may want to load different kinds
of encoded data into hive directly. This jira is dedicated to support
serialize/deserialize all kinds of encoded data in SerDe layer.
For user, only need to configure serialization encoding on table level by set
serialization encoding through serde parameter, for example:
CREATE TABLE person(id INT, name STRING, desc STRING)ROW FORMAT SERDE
'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe' WITH
SERDEPROPERTIES("serialization.encoding"='GBK');
or
ALTER TABLE person SET SERDEPROPERTIES ('serialization.encoding'='GBK');
LIMITATIONS: Only LazySimpleSerDe support "serialization.encoding" property in
this patch.
Diffs (updated)
-----
serde/if/serde.thrift 31c87ee
serde/src/gen/thrift/gen-cpp/serde_constants.h d56c917
serde/src/gen/thrift/gen-cpp/serde_constants.cpp 54503e3
serde/src/gen/thrift/gen-javabean/org/apache/hadoop/hive/serde/serdeConstants.java
515cf25
serde/src/gen/thrift/gen-php/org/apache/hadoop/hive/serde/Types.php 837dd11
serde/src/gen/thrift/gen-py/org_apache_hadoop_hive_serde/constants.py 8eac87d
serde/src/gen/thrift/gen-rb/serde_constants.rb ed86522
serde/src/java/org/apache/hadoop/hive/serde2/AbstractEncodingAwareSerDe.java
PRE-CREATION
serde/src/java/org/apache/hadoop/hive/serde2/DelimitedJSONSerDe.java 179f9b5
serde/src/java/org/apache/hadoop/hive/serde2/SerDeUtils.java b7fb048
serde/src/java/org/apache/hadoop/hive/serde2/lazy/LazySimpleSerDe.java
fb55c70
Diff: https://reviews.apache.org/r/24377/diff/
Testing
-------
Thanks,
chengxiang li