[ 
https://issues.apache.org/jira/browse/HIVE-27662?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Raghav Aggarwal updated HIVE-27662:
-----------------------------------
    Description: 
When reading a text table with vectorization on and hive.fetch.task.conversion 
as none, wrong parsing of delimiter is happening in nested complex types 
containing map. For example, if a columns schema is like: 
map<string,structid:string,name:string> then \u0004 char is coming in the 
output. Here is a example:

 

  was:When reading the data from text file format (with vectorizaton on) which 
contains multiple delimiter like ^A ^B ^C ^D etc i.e (\u0001, \u0002, \u0003, 
\u0004), incorrect parsing of data is happening which leads to incorrect 
result. 


> Incorrect parsing of nested complex types containing map during vectorized 
> text processing
> ------------------------------------------------------------------------------------------
>
>                 Key: HIVE-27662
>                 URL: https://issues.apache.org/jira/browse/HIVE-27662
>             Project: Hive
>          Issue Type: Bug
>          Components: Vectorization
>            Reporter: Raghav Aggarwal
>            Assignee: Raghav Aggarwal
>            Priority: Major
>
> When reading a text table with vectorization on and 
> hive.fetch.task.conversion as none, wrong parsing of delimiter is happening 
> in nested complex types containing map. For example, if a columns schema is 
> like: map<string,structid:string,name:string> then \u0004 char is coming in 
> the output. Here is a example:
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to