[
https://issues.apache.org/jira/browse/HIVE-4618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Phabricator updated HIVE-4618:
------------------------------
Attachment: HIVE-4618.D11007.1.patch
navis requested code review of "HIVE-4618 [jira] show create table creating
unusable DDL when field delimiter is \001".
Reviewers: JIRA
HIVE-4618 show create table creating unusable DDL when field delimiter is \001
When including a "fields terminated by" in the create statement. If the
delimiter is preceded by a \001, hive turns this into \u0001 which is correct.
However it then gives you a ddl that does not work because the parser changes
the \u0001 into u0001.
Example:
hive> create table j1 (a string) row format delimited fields terminated by
'\001';
hive> show create table j1;
CREATE TABLE j1(
a string)
ROW FORMAT DELIMITED
FIELDS TERMINATED BY '\u0001'
STORED AS INPUTFORMAT
'org.apache.hadoop.mapred.TextInputFormat'
OUTPUTFORMAT
'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat'
LOCATION
'hdfs://forza-1.cloud.rtp.cloudera.com:8020/user/hive/warehouse/j1'
TBLPROPERTIES (
'transient_lastDdlTime'='1369664999')
hive> desc formatted j1;
…shortened to save space
Storage Desc Params:
field.delim \u0001
serialization.format \u0001
hive> drop table j1;
hive> CREATE TABLE j1(
> a string)
> ROW FORMAT DELIMITED
> FIELDS TERMINATED BY '\u0001'
> STORED AS INPUTFORMAT
> 'org.apache.hadoop.mapred.TextInputFormat'
> OUTPUTFORMAT
> 'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat'
> LOCATION
> 'hdfs://forza-1.cloud.rtp.cloudera.com:8020/user/hive/warehouse/j1'
> TBLPROPERTIES (
> 'transient_lastDdlTime'='1369664999');
hive> desc formatted j1;
…shortened to save space
Storage Desc Params:
field.delim u0001
serialization.format u0001
TEST PLAN
EMPTY
REVISION DETAIL
https://reviews.facebook.net/D11007
AFFECTED FILES
ql/src/java/org/apache/hadoop/hive/ql/parse/BaseSemanticAnalyzer.java
ql/src/test/queries/clientpositive/unicode_notation.q
ql/src/test/results/clientpositive/unicode_notation.q.out
MANAGE HERALD RULES
https://reviews.facebook.net/herald/view/differential/
WHY DID I GET THIS EMAIL?
https://reviews.facebook.net/herald/transcript/26277/
To: JIRA, navis
> show create table creating unusable DDL when field delimiter is \001
> --------------------------------------------------------------------
>
> Key: HIVE-4618
> URL: https://issues.apache.org/jira/browse/HIVE-4618
> Project: Hive
> Issue Type: Bug
> Components: CLI
> Affects Versions: 0.10.0
> Environment: CDH4.2
> Hive 0.10
> Reporter: Johndee Burks
> Assignee: Navis
> Priority: Minor
> Attachments: HIVE-4618.D11007.1.patch
>
>
> When including a "fields terminated by" in the create statement. If the
> delimiter is preceded by a \001, hive turns this into \u0001 which is
> correct. However it then gives you a ddl that does not work because the
> parser changes the \u0001 into u0001.
> Example:
> hive> create table j1 (a string) row format delimited fields terminated by
> '\001';
> hive> show create table j1;
> CREATE TABLE j1(
> a string)
> ROW FORMAT DELIMITED
> FIELDS TERMINATED BY '\u0001'
> STORED AS INPUTFORMAT
> 'org.apache.hadoop.mapred.TextInputFormat'
> OUTPUTFORMAT
> 'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat'
> LOCATION
> 'hdfs://forza-1.cloud.rtp.cloudera.com:8020/user/hive/warehouse/j1'
> TBLPROPERTIES (
> 'transient_lastDdlTime'='1369664999')
> hive> desc formatted j1;
> …shortened to save space
> Storage Desc Params:
> field.delim \u0001
> serialization.format \u0001
> hive> drop table j1;
> hive> CREATE TABLE j1(
> > a string)
> > ROW FORMAT DELIMITED
> > FIELDS TERMINATED BY '\u0001'
> > STORED AS INPUTFORMAT
> > 'org.apache.hadoop.mapred.TextInputFormat'
> > OUTPUTFORMAT
> > 'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat'
> > LOCATION
> > 'hdfs://forza-1.cloud.rtp.cloudera.com:8020/user/hive/warehouse/j1'
> > TBLPROPERTIES (
> > 'transient_lastDdlTime'='1369664999');
> hive> desc formatted j1;
> …shortened to save space
> Storage Desc Params:
> field.delim u0001
> serialization.format u0001
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira