[
https://issues.apache.org/jira/browse/HIVE-9168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14254236#comment-14254236
]
Matt McCline commented on HIVE-9168:
------------------------------------
[~vikram.dixit] Can you please approve for 0.14.1 -- Thank You (Data
Corruption Issue).
> Vectorized Coalesce for strings is broken
> -----------------------------------------
>
> Key: HIVE-9168
> URL: https://issues.apache.org/jira/browse/HIVE-9168
> Project: Hive
> Issue Type: Bug
> Components: Vectorization
> Affects Versions: 0.13.0, 0.14.0
> Reporter: Matt McCline
> Assignee: Matt McCline
> Priority: Critical
> Fix For: 0.15.0, 0.14.1
>
> Attachments: HIVE-9168.01.patch
>
>
> Vectorized Coalesce uses BytesColumnVector.setElement which does not set the
> output string length correctly.
> {noformat}
> create table str_str_orc (str1 string, str2 string) stored as orc;
> insert into table str_str_orc values (null, "X"), ("0", "X"), ("1", "X"),
> (null, "y");
> EXPLAIN
> SELECT
> str2, ROUND(sum(cast(COALESCE(str1, 0) as int))/60, 2) as result
> from str_str_orc
> GROUP BY str2;
> SELECT
> str2, ROUND(sum(cast(COALESCE(str1, 0) as int))/60, 2) as result
> from str_str_orc
> GROUP BY str2;
> EXPLAIN
> SELECT COALESCE(str1, 0) as result
> from str_str_orc;
> SELECT COALESCE(str1, 0) as result
> from str_str_orc;
> {noformat}
> Produces different results when vectorized and not vectorized.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)