[
https://issues.apache.org/jira/browse/HIVE-8488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14175720#comment-14175720
]
Gopal V commented on HIVE-8488:
-------------------------------
Let me re-emphasize that comment - hash code controls bucket map-joins and SMB
joins today.
We need to fix this so that old inserted data which has {{CLUSTERED BY
varchar_column}} can be queried against a newly inserted partition with the
same schema.
Otherwise people will have to re-insert all existing data for JOINs to keep
working correctly.
> hash() doesn't match between string and char/varchar
> ----------------------------------------------------
>
> Key: HIVE-8488
> URL: https://issues.apache.org/jira/browse/HIVE-8488
> Project: Hive
> Issue Type: Bug
> Components: UDF
> Reporter: Jason Dere
> Assignee: Jason Dere
> Attachments: HIVE-8488.1.patch
>
>
> {noformat}
> hive> select * from tab1;
> OK
> val_484 val_484 val_484
> hive> select hash(c1), hash(c2), hash(c3) from tab1;
> OK
> 230901778 1973712113 1973712113
> {noformat}
> This may throw off users expecting string/varchar/char types to be fairly
> interchangeable.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)