[jira] [Commented] (HIVE-5221) Issue in column type with data type as BINARY

2013-11-09 Thread Ashutosh Chauhan (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-5221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13818176#comment-13818176
 ] 

Ashutosh Chauhan commented on HIVE-5221:


[~brocknoland] I think you have a valid point. I think we should do same on 
write side as we are now doing on read side, ie if user wants to store binary 
data in text file (which is weird, but anyways) he need to use an udf to encode 
it before passing it to serde. Than, we are consistent on both read and write 
side, asking user to use appropriate udfs for encoding/decoding and setting 
serde free from any assumptions.
[~kamrul] Can you make changes in LazyUtils.writePrimitiveUtf8 so that it 
doesn't do any encoding?

 Issue in column type with data type as BINARY
 -

 Key: HIVE-5221
 URL: https://issues.apache.org/jira/browse/HIVE-5221
 Project: Hive
  Issue Type: Bug
Reporter: Arun Vasu
Assignee: Mohammad Kamrul Islam
Priority: Critical
 Attachments: HIVE-5221.1.patch, HIVE-5221.2.patch


 Hi,
 I am using Hive 10. When I create an external table with column type as 
 Binary, the query result on the table is showing some junk values for the 
 column with binary datatype.
 Please find below the query I have used to create the table:
 CREATE EXTERNAL TABLE BOOL1(NB BOOLEAN,email STRING, bitfld BINARY)
  ROW FORMAT DELIMITED
FIELDS TERMINATED BY '^'
LINES TERMINATED BY '\n'
 STORED AS TEXTFILE
 LOCATION '/user/hivetables/testbinary';
 The query I have used is : select * from bool1
 The sample data in the hdfs file is:
 0^a...@abc.com^001
 1^a...@abc.com^010
  ^a...@abc.com^011
  ^a...@abc.com^100
 t^a...@abc.com^101
 f^a...@abc.com^110
 true^a...@abc.com^111
 false^a...@abc.com^001
 123^^01100010
 12344^^0111
 Please share your inputs if it is possible.
 Thanks,
 Arun



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HIVE-5221) Issue in column type with data type as BINARY

2013-11-04 Thread Brock Noland (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-5221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13813334#comment-13813334
 ] 

Brock Noland commented on HIVE-5221:


We can wait for HiveQA but it looks like a test in TestLazySimpleSerDe might 
fail as it's encoding base 64 data and expecting it to be decoded?

One more question, which perhaps isn't related, but it looks like 
LazyUtils.writePrimitiveUTF8 is also encoding data as Base64.  Thinking about 
it... doesn't Lazy Serde have to use base64 to store BINARY data?

 Issue in column type with data type as BINARY
 -

 Key: HIVE-5221
 URL: https://issues.apache.org/jira/browse/HIVE-5221
 Project: Hive
  Issue Type: Bug
Reporter: Arun Vasu
Assignee: Mohammad Kamrul Islam
Priority: Critical
 Attachments: HIVE-5221.1.patch, HIVE-5221.2.patch


 Hi,
 I am using Hive 10. When I create an external table with column type as 
 Binary, the query result on the table is showing some junk values for the 
 column with binary datatype.
 Please find below the query I have used to create the table:
 CREATE EXTERNAL TABLE BOOL1(NB BOOLEAN,email STRING, bitfld BINARY)
  ROW FORMAT DELIMITED
FIELDS TERMINATED BY '^'
LINES TERMINATED BY '\n'
 STORED AS TEXTFILE
 LOCATION '/user/hivetables/testbinary';
 The query I have used is : select * from bool1
 The sample data in the hdfs file is:
 0^a...@abc.com^001
 1^a...@abc.com^010
  ^a...@abc.com^011
  ^a...@abc.com^100
 t^a...@abc.com^101
 f^a...@abc.com^110
 true^a...@abc.com^111
 false^a...@abc.com^001
 123^^01100010
 12344^^0111
 Please share your inputs if it is possible.
 Thanks,
 Arun



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HIVE-5221) Issue in column type with data type as BINARY

2013-11-04 Thread Brock Noland (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-5221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13813335#comment-13813335
 ] 

Brock Noland commented on HIVE-5221:


bq. doesn't Lazy Serde have to use base64 to store BINARY data?

meaning when dealing with text files isn't base64 required for BINARY.

 Issue in column type with data type as BINARY
 -

 Key: HIVE-5221
 URL: https://issues.apache.org/jira/browse/HIVE-5221
 Project: Hive
  Issue Type: Bug
Reporter: Arun Vasu
Assignee: Mohammad Kamrul Islam
Priority: Critical
 Attachments: HIVE-5221.1.patch, HIVE-5221.2.patch


 Hi,
 I am using Hive 10. When I create an external table with column type as 
 Binary, the query result on the table is showing some junk values for the 
 column with binary datatype.
 Please find below the query I have used to create the table:
 CREATE EXTERNAL TABLE BOOL1(NB BOOLEAN,email STRING, bitfld BINARY)
  ROW FORMAT DELIMITED
FIELDS TERMINATED BY '^'
LINES TERMINATED BY '\n'
 STORED AS TEXTFILE
 LOCATION '/user/hivetables/testbinary';
 The query I have used is : select * from bool1
 The sample data in the hdfs file is:
 0^a...@abc.com^001
 1^a...@abc.com^010
  ^a...@abc.com^011
  ^a...@abc.com^100
 t^a...@abc.com^101
 f^a...@abc.com^110
 true^a...@abc.com^111
 false^a...@abc.com^001
 123^^01100010
 12344^^0111
 Please share your inputs if it is possible.
 Thanks,
 Arun



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HIVE-5221) Issue in column type with data type as BINARY

2013-11-04 Thread Hive QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-5221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13813537#comment-13813537
 ] 

Hive QA commented on HIVE-5221:
---



{color:red}Overall{color}: -1 at least one tests failed

Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12611731/HIVE-5221.2.patch

{color:red}ERROR:{color} -1 due to 16 failed/errored test(s), 4551 tests 
executed
*Failed tests:*
{noformat}
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_ba_table1
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_ba_table2
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_ba_table3
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_ba_table_udfs
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_ba_table_union
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_binary_table_bincolserde
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_binary_table_colserde
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_compute_stats_binary
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_load_binary_data
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_udf_printf
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_udf_reflect2
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_udf_substr
org.apache.hadoop.hive.serde2.lazy.TestLazyPrimitive.testLazyBinary
org.apache.hcatalog.pig.TestHCatStorer.testStoreFuncAllSimpleTypes
org.apache.hive.hcatalog.pig.TestHCatStorer.testStoreFuncAllSimpleTypes
org.apache.hive.jdbc.TestJdbcDriver2.testDataTypes
{noformat}

Test results: 
http://bigtop01.cloudera.org:8080/job/PreCommit-HIVE-Build/134/testReport
Console output: 
http://bigtop01.cloudera.org:8080/job/PreCommit-HIVE-Build/134/console

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests failed with: TestsFailedException: 16 tests failed
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12611731

 Issue in column type with data type as BINARY
 -

 Key: HIVE-5221
 URL: https://issues.apache.org/jira/browse/HIVE-5221
 Project: Hive
  Issue Type: Bug
Reporter: Arun Vasu
Assignee: Mohammad Kamrul Islam
Priority: Critical
 Attachments: HIVE-5221.1.patch, HIVE-5221.2.patch


 Hi,
 I am using Hive 10. When I create an external table with column type as 
 Binary, the query result on the table is showing some junk values for the 
 column with binary datatype.
 Please find below the query I have used to create the table:
 CREATE EXTERNAL TABLE BOOL1(NB BOOLEAN,email STRING, bitfld BINARY)
  ROW FORMAT DELIMITED
FIELDS TERMINATED BY '^'
LINES TERMINATED BY '\n'
 STORED AS TEXTFILE
 LOCATION '/user/hivetables/testbinary';
 The query I have used is : select * from bool1
 The sample data in the hdfs file is:
 0^a...@abc.com^001
 1^a...@abc.com^010
  ^a...@abc.com^011
  ^a...@abc.com^100
 t^a...@abc.com^101
 f^a...@abc.com^110
 true^a...@abc.com^111
 false^a...@abc.com^001
 123^^01100010
 12344^^0111
 Please share your inputs if it is possible.
 Thanks,
 Arun



--
This message was sent by Atlassian JIRA
(v6.1#6144)