Re: Calculations involve large datasets

Tim Wintle Fri, 22 Feb 2008 10:12:56 -0800

Have you seen PIG:
http://incubator.apache.org/pig/


It generates hadoop code and is more query like, and (as far as I
remember) includes union, join, etc.

Tim

On Fri, 2008-02-22 at 09:13 -0800, Chuck Lan wrote:
> Hi,
> 
> I'm currently looking into how to better scale the performance of our
> calculations involving large sets of financial data.  It is currently using
> a series of Oracle SQL statements to perform the calculations.  It seems to
> me that the MapReduce algorithm may work in this scenario.  However, I
> believe would need to perform some denormalization of data in order for this
> to work.  Do I have to?  Or is there a good way to implement joins within
> the Hadoop framework efficiently?
> 
> Thanks,
> Chuck

Re: Calculations involve large datasets

Reply via email to