Re: Parallelizing HTTP calls with MapReduce

2010-03-07 Thread Phil McCarthy
ar 7, 2010 at 8:28 AM, Kay Kay wrote: > On 03/06/2010 09:29 AM, Phil McCarthy wrote: >> >> Hi, >> >> I'm new to Hadoop, and I'm trying to figure out the best way to use it >> with EC2 to make large number of calls to a web API, > > Consi

Parallelizing HTTP calls with MapReduce

2010-03-06 Thread Phil McCarthy
MapReduce to parallelize the process. The calls will be regular HTTP requests, and the URLs follow a known format, so can be generated easily. This seems like it'd be a pretty common type of task, so apologies if I've missed something obvious in the docs etc. Cheers, Phil McCarthy