Re: MultipleOutputs or Partitioner

2010-05-10 Thread Some Body
, Iterable values, Context context) { mos.write("txt", getSubKey(key,0), new FloatWritable(average.floatValue()), getSubKey(key,2)+".txt"); } Alan - original message Subject: Re: MultipleOutputs or Partit

Re: MultipleOutputs or Partitioner

2010-05-10 Thread Alex Kozlov
Hi Alan, On Mon, May 10, 2010 at 5:08 AM, Some Body wrote: > Hi, > > I'm trying to understand how to generate multiple outputs in my reducer > (using 0.20.2+228). > Do I need MultipleOutput or should I partition my output in the mapper? > > The question is scalability. If you are OK with runnin

Re: MultipleOutputs or Partitioner

2010-05-10 Thread Sonal Goyal
Hi Alan, You can use MultipleOutputFormat. You can override the generateFileName...methods to get the functionality you want. A partitioner controls how data moves from the mapper to the reducer, so if you take that approach, you will have to specify the number of reducers as the number of files

MultipleOutputs or Partitioner

2010-05-10 Thread Some Body
Hi, I'm trying to understand how to generate multiple outputs in my reducer (using 0.20.2+228). Do I need MultipleOutput or should I partition my output in the mapper? My reducer currently gets key/val input pairs like this which all end up in my part_r_ file. hostA_VarX_2010-05-01_mor