[ 
https://issues.apache.org/jira/browse/HIVE-16351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daniel Doubrovkine updated HIVE-16351:
--------------------------------------
    Description: 
>From https://github.com/rcongiu/Hive-JSON-Serde/issues/65

This happens with both JSON and MongoDB connector Serde, so I don't believe 
this is a Serde bug.

Using 
http://www.congiu.net/hive-json-serde/1.3.6/cdh4/json-serde-1.3.6-jar-with-dependencies.jar
 placed into /usr/local/Cellar/apache-hive-1.2.1/lib

A dummy test.json with a CR/LF

{code}
$ cat /tmp/test.json
{"text":"foo\nbar","number":123}

$ hadoop fs -mkdir /user/data

$ hive
hive> CREATE DATABASE test;

hive> CREATE EXTERNAL TABLE test ( text string )
    > ROW FORMAT SERDE 'org.openx.data.jsonserde.JsonSerDe'
    > LOCATION '/user/data';

hive> SELECT * FROM test;

foo
bar     123
NULL    NULL
{code}

You can see how that's totally wrong, there's only one row of data.

  was:
>From https://github.com/rcongiu/Hive-JSON-Serde/issues/65

This happens with both JSON and MongoDB connector Serde, so I don't believe 
this is a Serde bug.

Using 
http://www.congiu.net/hive-json-serde/1.3.6/cdh4/json-serde-1.3.6-jar-with-dependencies.jar
 placed into /usr/local/Cellar/apache-hive-1.2.1/lib

A dummy test.json with a CR/LF

```
$ cat /tmp/test.json
{"text":"foo\nbar","number":123}

$ hadoop fs -mkdir /user/data

$ hive
hive> CREATE DATABASE test;

hive> CREATE EXTERNAL TABLE test ( text string )
    > ROW FORMAT SERDE 'org.openx.data.jsonserde.JsonSerDe'
    > LOCATION '/user/data';

hive> SELECT * FROM test;

foo
bar     123
NULL    NULL
```

You can see how that's totally wrong, there's only one row of data.


> Hive confused by CR/LFs
> -----------------------
>
>                 Key: HIVE-16351
>                 URL: https://issues.apache.org/jira/browse/HIVE-16351
>             Project: Hive
>          Issue Type: Bug
>          Components: Hive, Serializers/Deserializers
>    Affects Versions: 1.2.1
>         Environment: Hadoop 2.7.3
>            Reporter: Daniel Doubrovkine
>
> From https://github.com/rcongiu/Hive-JSON-Serde/issues/65
> This happens with both JSON and MongoDB connector Serde, so I don't believe 
> this is a Serde bug.
> Using 
> http://www.congiu.net/hive-json-serde/1.3.6/cdh4/json-serde-1.3.6-jar-with-dependencies.jar
>  placed into /usr/local/Cellar/apache-hive-1.2.1/lib
> A dummy test.json with a CR/LF
> {code}
> $ cat /tmp/test.json
> {"text":"foo\nbar","number":123}
> $ hadoop fs -mkdir /user/data
> $ hive
> hive> CREATE DATABASE test;
> hive> CREATE EXTERNAL TABLE test ( text string )
>     > ROW FORMAT SERDE 'org.openx.data.jsonserde.JsonSerDe'
>     > LOCATION '/user/data';
> hive> SELECT * FROM test;
> foo
> bar   123
> NULL  NULL
> {code}
> You can see how that's totally wrong, there's only one row of data.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to