Hi Divya Below are some quick tips that always helps:
1. Partition your data set and use partition keys while selecting data to reduce data set. 2.Also, if both data sets can be joined by the same partition key then use it in the join. 3. If one table being joined is a small table then you can use MAPJOIN Hint Example: SELECT /*+ MAPJOIN(b) */ a.key, a.value FROM a JOIN b ON a.key = b.key Regards Richa On Mon, Jan 18, 2016 at 7:07 PM, Divya Gehlot <divya.htco...@gmail.com> wrote: > Hi, > Need tips/guidance to optimize(increase perfomance) billion data rows > joins in hive . > > Any help would be appreciated. > > > Thanks, > Divya >