ngsg commented on PR #5543:
URL: https://github.com/apache/hive/pull/5543#issuecomment-2493241079

   @BsoBird
   I don't know the exact answer to your question, but in my experience, the 
NPE issue did not occur in the following situation:
   1. used `hash()` instead of `murmur_hash()`. Both of them appear as `hash` 
in query plan, but only `murmur_hash` causes the NPE issue.
   2. `hive.llap.io.enabled=true` & small inputs. I don't know the exact 
reason, but I have to disable LLAP IO to reproduce this issue using qtests with 
a small file.
   3. ORC file using direct encoding. If a string column uses direct encoding, 
then ORC reader always set `isRepeating` to false.
   4. length[i] == 0 for all 1<=i<=1023. Then `Murmur3.hash32` does not access 
`vector[][]` and NPE does not occur.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to