mahesh kumar behera created HIVE-25142:
------------------------------------------

             Summary: Rehashing in map join fast hash table  causing corruption 
for large keys
                 Key: HIVE-25142
                 URL: https://issues.apache.org/jira/browse/HIVE-25142
             Project: Hive
          Issue Type: Bug
          Components: Hive
            Reporter: mahesh kumar behera
            Assignee: mahesh kumar behera


In map join the hash table is created using the keys. To support rehashing, the 
keys are stored in write buffer. The hash table contains the offset of the keys 
along with the hash code. When rehashing is done, the offset is extracted from 
the hash table and then hash code is generated again. For large keys of size 
greater than 255, the key length is also stored along with the key. In case of 
fast hash table implementation the way key is extracted is not proper. There is 
a code bug and thats causing the wrong key to be extracted and causing wrong 
hash code generation. This is causing the corruption in the hash table.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to