Hi Hadoop user, I have two map classes processing two different input files. Both map functions have same key,value format to emit.
But Reducer called twice for same key , one for value from first map while one for value from other map. I am printing (key ,value) pairs in reducer : ./container_1435326857837_0036_01_000102/stdout:Reduce:391:-1#11 ./container_1435326857837_0036_01_000102/stdout:Reduce:391:3278620528725786624:5352454#-1 ./container_1435326857837_0036_01_000102/stdout:Reduce:591:3278620528725852160:4194699#-1 ./container_1435326857837_0036_01_000102/stdout:Reduce:591:-1#13 ./container_1435326857837_0036_01_000102/stdout:Reduce:2391:-1#19 ./container_1435326857837_0036_01_000102/stdout:Reduce:2391:3278620528725917696:5283986#-1 ./container_1435326857837_0036_01_000102/stdout:Reduce:3291:3278620528725983232:4973087#-1 both maps emit Longwritable key and Text value. Any idea why this is happening? Is there any way to get hash values generated by hadoop for keys emitted by mapper? Thanks Ravikant