subject:"RE\: Partitioning Reducer Output"

Re: Partitioning Reducer Output

2010-04-05 Thread Eric Sammer

On Mon, Apr 5, 2010 at 4:04 PM, rakesh kothari wrote: > Thanks for the insights. > > My use case is more around sending the reducer output to subdirectories > representing date partitions. > > For example if the base reducer output directory is /hdfs/root/reducer/ and > if there are two records e

RE: Partitioning Reducer Output

2010-04-05 Thread rakesh kothari

hoping if there is a workaround. Thanks, -Rakesh Date: Mon, 5 Apr 2010 08:45:13 -0700 From: erez_k...@yahoo.com Subject: Re: Partitioning Reducer Output To: mapreduce-user@hadoop.apache.org A partitioner can be used to control how keys are distributed across reducers (overriding the default hash(

Re: Partitioning Reducer Output

2010-04-05 Thread Erez Katz

on these files can be moved around to other folders... hope it helps, Erez Katz --- On Mon, 4/5/10, David Rosenstrauch wrote: From: David Rosenstrauch Subject: Re: Partitioning Reducer Output To: mapreduce-user@hadoop.apache.org Date: Monday, April 5, 2010, 7:35 AM On 04/02/2010 08:32 PM,

Re: Partitioning Reducer Output

2010-04-05 Thread David Rosenstrauch

On 04/02/2010 08:32 PM, rakesh kothari wrote: Hi, What's the best way to partition data generated from Reducer into multiple = directories in Hadoop 0.20.1. I was thinking of using MultipleTextOutputFor= mat but that's not backward compatible with other API's in this version of = hadoop. Thank