DistributedCache.addCacheFile() when you're setting up the JobConf
DistributedCache.getLocalCacheFiles()in your map or reduce methods
There is a simple example here:
http://hadoop.apache.org/core/docs/r0.15.3/mapred_tutorial.html
Have a look at the last example!
Cheers,
Pi
jerrro wrote
You can either pack the files with your job jar or use the distributed
cache if the file size is big. See http://wiki.apache.org/hadoop/FAQ#8.
Hairong
-Original Message-
From: jerrro [mailto:[EMAIL PROTECTED]
Sent: Thursday, January 24, 2008 8:06 AM
To: [EMAIL PROTECTED]
Subject: hado
Hi Jerrro,
take a look at
http://hadoop.apache.org/core/docs/r0.15.3/mapred_tutorial.html#DistributedCache
The DistributedCache looks like what you are searching for. I think the
interesting part is the example
http://hadoop.apache.org/core/docs/r0.15.3/mapred_tutorial.html#Example%3A+WordCoun