On Tue, Nov 1, 2011 at 12:52 AM, Joey Echeverria <j...@cloudera.com> wrote: > Yes, you can read the file in the configure() (old api) and setup() > (new api) methods. The data can be saved in a variable that will be > accessible to every call to map().
If I understood the question correctly, the situation is to keep a big HDFS file in memory for each mapper. I know that for small files, the DistributedCache is a solution. Bibek