files with control-A,B are not delimited correctly.
---------------------------------------------------
Key: HIVE-2303
URL: https://issues.apache.org/jira/browse/HIVE-2303
Project: Hive
Issue Type: Bug
Reporter: Amareshwari Sriramadasu
Assignee: Amareshwari Sriramadasu
The following is from one of our users:
create external table impressions (imp string, msg string)
row format delimited
fields terminated by '\t'
lines terminated by '\n'
stored as textfile
location '/xxx';
Some strings in my data contains Control-A, Control-B etc as internal
delimiters. If I do a
Select * from impressions limit 10;
All fields were able to print correctly. However if I do a
Select * from impressions where msg regexp '.*' limit 10;
The fields were broken by the control characters. The difference between the 2
commands is that the latter requires a map-reduce job.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira