Hi,

I have 6 node cluster, and the scenario is as follows :-

I have one map reduce job which will write file1 in HDFS.
I have another map reduce job which will write file2 in  HDFS.
In the third map reduce job I need to use file1 and file2 to do some
computation and output the value.

What is the best way to store file1 and file2 in HDFS so that they could be
used in third map reduce job.

Thanks,
Hitarth

Reply via email to