[ 
https://issues.apache.org/jira/browse/HIVE-8488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14175720#comment-14175720
 ] 

Gopal V commented on HIVE-8488:
-------------------------------

Let me re-emphasize that comment - hash code controls bucket map-joins and SMB 
joins today.

We need to fix this so that old inserted data which has {{CLUSTERED BY 
varchar_column}} can be queried against a newly inserted partition with the 
same schema.

Otherwise people will have to re-insert all existing data for JOINs to keep 
working correctly.

> hash() doesn't match between string and char/varchar
> ----------------------------------------------------
>
>                 Key: HIVE-8488
>                 URL: https://issues.apache.org/jira/browse/HIVE-8488
>             Project: Hive
>          Issue Type: Bug
>          Components: UDF
>            Reporter: Jason Dere
>            Assignee: Jason Dere
>         Attachments: HIVE-8488.1.patch
>
>
> {noformat}
> hive> select * from tab1;
> OK
> val_484       val_484 val_484
> hive> select hash(c1), hash(c2), hash(c3) from tab1;
> OK
> 230901778     1973712113      1973712113
> {noformat}
> This may throw off users expecting string/varchar/char types to be fairly 
> interchangeable.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to