----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/24377/ -----------------------------------------------------------
(Updated Aug. 6, 2014, 9:11 a.m.) Review request for hive. Bugs: HIVE-7142 https://issues.apache.org/jira/browse/HIVE-7142 Repository: hive-git Description ------- Currently Hive only support serialize data into UTF-8 charset bytes or deserialize from UTF-8 bytes, real world users may want to load different kinds of encoded data into hive directly. This jira is dedicated to support serialize/deserialize all kinds of encoded data in SerDe layer. For user, only need to configure serialization encoding on table level by set serialization encoding through serde parameter, for example: CREATE TABLE person(id INT, name STRING, desc STRING)ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe' WITH SERDEPROPERTIES("serialization.encoding"='GBK'); or ALTER TABLE person SET SERDEPROPERTIES ('serialization.encoding'='GBK'); LIMITATIONS: Only LazySimpleSerDe support "serialization.encoding" property in this patch. Diffs (updated) ----- serde/src/java/org/apache/hadoop/hive/serde2/AbstractEncodingAwareSerDe.java PRE-CREATION serde/src/java/org/apache/hadoop/hive/serde2/DelimitedJSONSerDe.java 179f9b5 serde/src/java/org/apache/hadoop/hive/serde2/SerDeUtils.java b7fb048 serde/src/java/org/apache/hadoop/hive/serde2/lazy/LazySimpleSerDe.java fb55c70 Diff: https://reviews.apache.org/r/24377/diff/ Testing ------- Thanks, chengxiang li