Re: Multiple Output Formats

2011-07-27 Thread Alejandro Abdelnur
Roger, Or you can take a look at Hadoop's MultipleOutputs class. Thanks. Alejandro On Tue, Jul 26, 2011 at 11:30 PM, Luca Pireddu wrote: > On July 26, 2011 06:11:33 PM Roger Chen wrote: > > Hi all, > > > > I am attempting to implement MultipleOutputFormat to write data to > multiple > > files

Re: Multiple Output Formats

2011-07-26 Thread Luca Pireddu
On July 26, 2011 06:11:33 PM Roger Chen wrote: > Hi all, > > I am attempting to implement MultipleOutputFormat to write data to multiple > files dependent on the output keys and values. Can somebody provide a > working example with how to implement this in Hadoop 0.20.2? > > Thanks! Hello, I ha

Re: Multiple Output Formats

2011-07-26 Thread Harsh J
Gotcha, my bad then. The hadoop distribution I use provides a backported MO, so I overlooked this particular issue while replying. Still, the warning holds as the versions would roll ahead. But I believe the refactor would not be that much of a pain, so perhaps its a no-worry. On Wed, Jul 27, 201

Re: Multiple Output Formats

2011-07-26 Thread Roger Chen
The problem I'm facing right now is with the configuration needed for MultipleOutputs, because JobConf is deprecated now and I am unable to do its equivalent with Configuration. I set the configuration of the job by: Job job = new Job(getConf()); but when I'm trying to use this line in my config

Re: Multiple Output Formats

2011-07-26 Thread Harsh J
Roger, Beyond Ayon's example answer, I'd like you to note that the newer API will *not* carry a supported MultipleOutputFormat as it has been obsoleted away in favor of MultipleOutputs, whose use is much easier, is threadsafe, and also carries an example to look at, at [1]. [1] - http://hadoop.a

Re: Multiple Output Formats

2011-07-26 Thread Ayon Sinha
t: Tuesday, July 26, 2011 9:11 AM Subject: Multiple Output Formats Hi all, I am attempting to implement MultipleOutputFormat to write data to multiple files dependent on the output keys and values. Can somebody provide a working example with how to implement this in Hadoop 0.20.2? Thanks! -- Rog

Multiple Output Formats

2011-07-26 Thread Roger Chen
Hi all, I am attempting to implement MultipleOutputFormat to write data to multiple files dependent on the output keys and values. Can somebody provide a working example with how to implement this in Hadoop 0.20.2? Thanks! -- Roger Chen UC Davis Genome Center