How to let Reducer know on which partition it is working

Jürgen Broß Wed, 26 Nov 2008 04:39:35 -0800

Hi all,

my Reducers need to load a huge HashMap from data present in the HDFS.This data has been partitioned by a previous map/reduce job. Thecomplete data would not fit into main memory of a Reducer machine. Itwould suffice to load only the correct partition of the data. Theproblem is that the "correct" partition is determined by thePartitioner, which feeds the current Reducers. I'm not sure how to let aReducer know in its configure() method which partition it will get fromthe Partitioner, i.e. which partition to load from HDFS into the HashMap.


Maybe someone has a good idea.

Regards,
Jürgen

How to let Reducer know on which partition it is working

Reply via email to