I require each input file to be processed by each mapper as a whole.

I subclass c.o.a.h.mapreduce.lib.input.TextInputFormat and override
isSplitable() to invariably return false.

The job is configured to use this subclass as the input format class via
setInputFormatClass(). The job runs without error, yet the logs reveal
files are still processed line by line by the mappers.

Any help would be greatly appreciated,
Thanks

Reply via email to