Hi Divya

Below are some quick tips that always helps:

1. Partition your data set and use partition keys while selecting data to
reduce data set.

2.Also, if both data sets can be joined by the same partition key then use
it in the join.

3. If one table being joined is a small table then you can use MAPJOIN Hint
Example:
SELECT /*+ MAPJOIN(b) */ a.key, a.value
FROM a JOIN b ON a.key = b.key

Regards
Richa

On Mon, Jan 18, 2016 at 7:07 PM, Divya Gehlot <divya.htco...@gmail.com>
wrote:

> Hi,
> Need tips/guidance to optimize(increase perfomance) billion data rows
>  joins in hive .
>
> Any help would be appreciated.
>
>
> Thanks,
> Divya
>

Reply via email to