Have you seen PIG: http://incubator.apache.org/pig/
It generates hadoop code and is more query like, and (as far as I remember) includes union, join, etc. Tim On Fri, 2008-02-22 at 09:13 -0800, Chuck Lan wrote: > Hi, > > I'm currently looking into how to better scale the performance of our > calculations involving large sets of financial data. It is currently using > a series of Oracle SQL statements to perform the calculations. It seems to > me that the MapReduce algorithm may work in this scenario. However, I > believe would need to perform some denormalization of data in order for this > to work. Do I have to? Or is there a good way to implement joins within > the Hadoop framework efficiently? > > Thanks, > Chuck