Hi all:

We found that when we join on two different type keys , hive will convert
all join key to Double.

Consider such simple query:

explain
> select *
> from table_a a
> join table_b b on a.id = b.id
>

If type of a.id is int while b.id 's type is string, hive will convert a.id
and b.id to double. When the conversion occurs, map join will become very
slow.

Simple solution is disable autojoin.

Does anyone how to solve it more effectively?

My hive version : 1.1.0+cdh5.4.7+233

Zhiwen Sun

Reply via email to