Hi,
I am trying to do inner join on two tables, but running for long time
Tab1 - 100GB
Tab2 - 2GB -- Partition table on source
I tried doing stream table, but ran for long time like 3 hrs : Looks like
only 1 reducer is working on it
I tried Map Join by increasing the mem, it failed.
Pls find the sample query:
set hive.ignore.mapjoin.hint=false;
SET mapred.reduce.tasks=320;
create table ev_claim_claimline_pat_test as
select /*+ streamtable(c) */ c.*, p.col1,p.col2,p.col3 from Tab2 p inner
join Tab1 c
on (trim(p.pid)=trim(c.p_id) and p.source='XYZ');
Can some one help me.
Thanks,
Karthik. B