Hi Hadoop user,

I have two map classes processing two different input files. Both map
functions have same key,value format to emit.

But Reducer called twice for same key , one for value from first map while
one for value from other map.

I am printing (key ,value) pairs in reducer  :
./container_1435326857837_0036_01_000102/stdout:Reduce:391:-1#11
./container_1435326857837_0036_01_000102/stdout:Reduce:391:3278620528725786624:5352454#-1
./container_1435326857837_0036_01_000102/stdout:Reduce:591:3278620528725852160:4194699#-1
./container_1435326857837_0036_01_000102/stdout:Reduce:591:-1#13
./container_1435326857837_0036_01_000102/stdout:Reduce:2391:-1#19
./container_1435326857837_0036_01_000102/stdout:Reduce:2391:3278620528725917696:5283986#-1
./container_1435326857837_0036_01_000102/stdout:Reduce:3291:3278620528725983232:4973087#-1

both maps emit Longwritable key and Text value.


Any idea why this is happening?
Is there any way to get hash values generated by hadoop for keys emitted by
mapper?

Thanks
Ravikant

Reply via email to