Hi, If I have is big gzip files (>>block size) does the M/R will split a single file to multiple blocks and send them to different mappers ? The behavior I currently see is that a map is still open per file (and not per block).
I will also appreciate it if you can share your experience in defining block size (compared to HDFS size and to job processing size). Thanks, Lior