ar 7, 2010 at 8:28 AM, Kay Kay wrote:
> On 03/06/2010 09:29 AM, Phil McCarthy wrote:
>>
>> Hi,
>>
>> I'm new to Hadoop, and I'm trying to figure out the best way to use it
>> with EC2 to make large number of calls to a web API,
>
> Consi
MapReduce to
parallelize the process. The calls will be regular HTTP requests, and
the URLs follow a known format, so can be generated easily.
This seems like it'd be a pretty common type of task, so apologies if
I've missed something obvious in the docs etc.
Cheers,
Phil McCarthy