Map Side joins will use the CompositeInputFormat.  They will only really be
worth doing if one data set is small, and the other is large.

This is a good example :
http://www.congiu.com/joins-in-hadoop-using-compositeinputformat/

the trick is to google for CompositeInputFormat.compose() .... :)


On Thu, Jul 11, 2013 at 5:02 PM, Botelho, Andrew <andrew.bote...@emc.com>wrote:

> Hi,****
>
> ** **
>
> I want to perform a JOIN on two sets of data with Hadoop.  I read that the
> class CompositeInputFormat can be used to perform joins on data, but I
> can’t find any examples of how to do it.****
>
> Could someone help me out? It would be much appreciated. J****
>
> ** **
>
> Thanks in advance,****
>
> ** **
>
> Andrew****
>



-- 
Jay Vyas
http://jayunit100.blogspot.com

Reply via email to