Map Side joins will use the CompositeInputFormat. They will only really be worth doing if one data set is small, and the other is large.
This is a good example : http://www.congiu.com/joins-in-hadoop-using-compositeinputformat/ the trick is to google for CompositeInputFormat.compose() .... :) On Thu, Jul 11, 2013 at 5:02 PM, Botelho, Andrew <andrew.bote...@emc.com>wrote: > Hi,**** > > ** ** > > I want to perform a JOIN on two sets of data with Hadoop. I read that the > class CompositeInputFormat can be used to perform joins on data, but I > can’t find any examples of how to do it.**** > > Could someone help me out? It would be much appreciated. J**** > > ** ** > > Thanks in advance,**** > > ** ** > > Andrew**** > -- Jay Vyas http://jayunit100.blogspot.com