Announcement of the BSP (Bulk Synchronous Parallel) package on Hadoop

2009-12-11 Thread Edward J. Yoon
Hello communities, Now I'm happy to announce that we've developed new computing model called BSP (Bulk Synchronous Parallel) on top of Hadoop. Here are the slides for the topic Apache HAMA: An Introduction to Bulk Synchronization Parallel on Hadoop: Download a PDF file:

Re: Which Hadoop product is more appropriate for a quick query on a large data set?

2009-12-11 Thread Todd Lipcon
Hi Xueling, One important question that can really change the answer: How often does the dataset change? Can the changes be merged in in bulk every once in a while, or do you need to actually update them randomly very often? Also, how fast is quick? Do you mean 1 minute, 10 seconds, 1 second,