some speed wrote:
I was wondering if it was possible to read the input for a map function from
2 different files:
1st file ---> user-input file from a particular location(path)
2nd file=---> A resultant file (has just one <key,value> pair) from a
previous MapReduce job. (I am implementing a chain MapReduce function)
Now, for every <key,value> pair in the user-input file, I would like to use
the same <key,value> pair from the 2nd file for some calculations.
I think you can use DistributedCache for distributing your second file
among maps.
Please see more documentation at
http://hadoop.apache.org/core/docs/current/mapred_tutorial.html#DistributedCache
Thanks
Amareshwari
Is it possible for me to do so? Can someone guide me in the right direction
please?
Thanks!