On Sep 27, 2011, at 12:09 PM, Kevin Burton wrote: > Is it possible to connect the output of one map reduce job so that it is the > input to another map reduce job. > > Basically… then reduce() outputs a key, that will be passed to another map() > function without having to store intermediate data to the filesystem. >
Currently there is no way to pipeline in such a manner - with hadoop-0.23 it's doable, but will take more effort. Arun