Why do people use number of buckets as two raised to the power?

2018-12-20 Thread Devender Yadav
Hi All, Is there any benefit of keeping the number of buckets as two raised to the power e.g. 2, 4, 8, 16, 32, etc? Regards, Devender NOTE: This message may contain information that is confidential, proprietary, privileged or otherwise protected by

hive 3.1 mapjoin with complex predicate produce incorrect results

2018-12-20 Thread Andrey Zinovyev
Hi, We stumbled on some weird behaviour of mapjoin in hive 3.1 Sample schema: > create table table_data(key int, a int); > insert into table_data values (1, 1), (2, 2), (1, 3), (2, 4), (3, 5); > create table table_dict(key int, b int); > insert into table_dict values (1, 42), (2, 43); Query: