[ https://issues.apache.org/jira/browse/HIVE-8488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14175720#comment-14175720 ]
Gopal V commented on HIVE-8488: ------------------------------- Let me re-emphasize that comment - hash code controls bucket map-joins and SMB joins today. We need to fix this so that old inserted data which has {{CLUSTERED BY varchar_column}} can be queried against a newly inserted partition with the same schema. Otherwise people will have to re-insert all existing data for JOINs to keep working correctly. > hash() doesn't match between string and char/varchar > ---------------------------------------------------- > > Key: HIVE-8488 > URL: https://issues.apache.org/jira/browse/HIVE-8488 > Project: Hive > Issue Type: Bug > Components: UDF > Reporter: Jason Dere > Assignee: Jason Dere > Attachments: HIVE-8488.1.patch > > > {noformat} > hive> select * from tab1; > OK > val_484 val_484 val_484 > hive> select hash(c1), hash(c2), hash(c3) from tab1; > OK > 230901778 1973712113 1973712113 > {noformat} > This may throw off users expecting string/varchar/char types to be fairly > interchangeable. -- This message was sent by Atlassian JIRA (v6.3.4#6332)