What file format do you want to use ?

If it's Text or SequenceFile, or any other existing derivative of 
FileInputFormat, just override isSplittable and rely on the actual RecordReader.

Arun

On Aug 17, 2011, at 3:58 PM, Zhixuan Zhu wrote:

> I'm new Hadoop and currently using Hadoop 0.20.2 to try out some simple
> tasks. I'm trying to send each whole file of the input directory to the
> mapper without splitting them line by line. How should I set the input
> format class? I know I could derive a customized FileInputFormat class
> and override the isSplitable function. But I have no idea how to
> implement around the record reader. Any suggestion or a sample code will
> be greatly appreciated. 
> 
> Thanks in advance,
> Grace

Reply via email to