Re: how to write outputs sequentially?

Shi Yu Tue, 22 Mar 2011 08:52:31 -0700

I guess you need to define a Partitioner to send hased keys to differentreducers (sorry, I am still using the old API so probably there issomething new in the trunk release). Basically you try to segment thekeys into different zones, 0-10, 11-20, ...


maybe check the hashCode() function and see how to categorize these zones?


Shi


On 3/22/2011 9:24 AM, JunYoung Kim wrote:

hi,

I run almost 60 ruduce tasks for a single job.

if the outputs of a job are from part00 to part 59.

is there way to write rows sequentially by sorted keys?

curretly my outputs are like this.

part00)
1
10
12
14

part 01)
2
4
6
11
13

part 02)
3
5
7
8
9

but, my aim is to get the following results.

part00)
1
2
3
4
5

part01)
6
7
8
9
10

part02)
11
12
13
14
15

the hadoop is able to support this kind of one?

thanks

Re: how to write outputs sequentially?

Reply via email to