Gopal V created HIVE-7266: ----------------------------- Summary: Optimized HashTable with vectorized map-joins results in String columns extending Key: HIVE-7266 URL: https://issues.apache.org/jira/browse/HIVE-7266 Project: Hive Issue Type: Bug Components: Vectorization Affects Versions: 0.14.0 Reporter: Gopal V Assignee: Jitendra Nath Pandey
The following query returns different results when both vectorized mapjoin and the new optimized hashtable are enabled. {code} hive> set hive.vectorized.execution.enabled=false; hive> select s_suppkey, n_name from supplier, nation where s_nationkey = n_nationkey limit 25; ... 316869 JAPAN 1636869 RUSSIA 1096869 IRAN 7236869 RUSSIA 2276869 INDIA 8516869 ARGENTINA 2636869 MOZAMBIQUE 3836869 ROMANIA 2616869 FRANCE {code} But when vectorization is enabled, the results are {code} 316869 JAPAN 1636869 RUSSIA 1096869 IRANIA 7236869 RUSSIA 2276869 INDIAA 8516869 ARGENTINA 2636869 MOZAMBIQUE 3836869 ROMANIAQUE 2616869 FRANCEAQUE {code} When the new optimized map-join hashtable is disabled as well {code} hive> set hive.vectorized.execution.enabled=true; hive> set hive.mapjoin.optimized.hashtable=false; hive> select s_suppkey, n_name from supplier, nation where s_nationkey = n_nationkey limit 25; 316869 JAPAN 1636869 RUSSIA 1096869 IRAN 7236869 RUSSIA 2276869 INDIA 8516869 ARGENTINA 2636869 MOZAMBIQUE 3836869 ROMANIA 2616869 FRANCE {code} -- This message was sent by Atlassian JIRA (v6.2#6252)