Great, thanks for that hint - for some reason I expected that behavior to be a feature of the MultipleTextOutputFormat class - doing so solved my problem! Thanks!!

Here my code (I wanted to specifically omit outputting the key however still having a file per key) if anyone is interested:

.. in my ToolRunner:

conf.setOutputFormat(KeyBasedMultipleTextOutputFormat.class);


...

static class KeyBasedMultipleTextOutputFormat extends MultipleTextOutputFormat<Text, Text> {
                
        @Override
        /** Ensures that no key is written to the file */
        protected Text generateActualKey(Text key, Text value) {
              return null;
        }
        /** Generates a filename that reflects the value of the key */
protected String generateFileNameForKeyValue(Text key, Text v, String name) {
        
              return key.toString().substring(0, key.toString().length());
        }
}


~Florian

On Oct 29, 2008, at 12:12 AM, Mice wrote:

Did you override generateFileNameForKeyValue?

2008/10/29 Florian Leibert <[EMAIL PROTECTED]>:
Thanks Mice,
tried using that already - however this doesn't yield the desired results - upon output collection (using the OutputCollector), it still produces only one output file (note, I only have one input file, not multiple input files,
but want a file per key for the output...)

Thanks

Code Fragment:
jobConfig.setOutputFormat(MultipleTextOutputFormat.class);


On Oct 28, 2008, at 9:09 PM, Mice wrote:

MultipleOutputFormat meets your need. It is in 0.18.1.

2008/10/29 Florian Leibert <[EMAIL PROTECTED]>:

Hi,
for convenience reasons, I was wondering if there is a simple way to
produce
one output file per key in the Reducer?

Thanks,
Florian




Reply via email to