Hi,
If I have is big gzip files (>>block size) does the M/R will split a single
file to multiple blocks and send them to different mappers ?
The behavior I currently see is that a map is still open per file (and not
per block).

I will also appreciate it  if you can share your experience in defining
block size (compared to HDFS size and to job processing size).


Thanks,
Lior

Reply via email to