My humble experience: I would prefer specifying the files in
command line using -files option, then treat them explicitly in
the Mapper configure or setup function using
File f1 = new File("file1name");
File f2 = new File("file2name");
Cause I am not 100% sure how does distributed cached det
Hi,
The Java API offers a DistributedCache class which lets you do this.
The usage is detailed at
http://hadoop.apache.org/common/docs/current/api/org/apache/hadoop/filecache/DistributedCache.html
On Fri, May 4, 2012 at 5:11 PM, i...@christianherta.de
wrote:
> Hello,
> I have written a chain of
Hello,
I have written a chain of map-reduce jobs which creates a Mapfile. I want
to use the Mapfile in a proximate map-reduce job via distributed cache.
Therefore I have to create an archive file of the folder with holds the
/data and /index files.
In the documentation and in the Book "Hadoop the