Re: problem when using combiner and MultipleOutputFormat

2011-10-28 Thread Harsh J
Sent: Friday, October 28, 2011 2:43 PM > To: mapreduce-user@hadoop.apache.org > Subject: RE: problem when using combiner and MultipleOutputFormat > > OK, I will prepare my code to show how it works. > > Here another question, my combiner DOES output some records, if it the rea

RE: problem when using combiner and MultipleOutputFormat

2011-10-28 Thread Xin Jing
of the issue, thanks for your help, Harsh. From: Xin Jing [xinj...@beyondfun.net] Sent: Friday, October 28, 2011 2:43 PM To: mapreduce-user@hadoop.apache.org Subject: RE: problem when using combiner and MultipleOutputFormat OK, I will prepare my code to

Re: problem when using combiner and MultipleOutputFormat

2011-10-27 Thread Harsh J
iner code, I am > using output.collect() to output my result, do I misuse it? > > From: Harsh J [ha...@cloudera.com] > Sent: Friday, October 28, 2011 2:11 PM > To: mapreduce-user@hadoop.apache.org > Subject: Re: problem when using combiner a

RE: problem when using combiner and MultipleOutputFormat

2011-10-27 Thread Xin Jing
ubject: Re: problem when using combiner and MultipleOutputFormat Xin, You probably just need to write a special Combiner class instead of reusing your Reducer class for combiner purposes. In an MR job, you need to specifically guarantee that the combiner outputs the same type of K-V pairs as the

Re: problem when using combiner and MultipleOutputFormat

2011-10-27 Thread Harsh J
Xin, You probably just need to write a special Combiner class instead of reusing your Reducer class for combiner purposes. In an MR job, you need to specifically guarantee that the combiner outputs the same type of K-V pairs as the reducer's input. Do not output to files directly from your combine