What file format do you want to use ? If it's Text or SequenceFile, or any other existing derivative of FileInputFormat, just override isSplittable and rely on the actual RecordReader.
Arun On Aug 17, 2011, at 3:58 PM, Zhixuan Zhu wrote: > I'm new Hadoop and currently using Hadoop 0.20.2 to try out some simple > tasks. I'm trying to send each whole file of the input directory to the > mapper without splitting them line by line. How should I set the input > format class? I know I could derive a customized FileInputFormat class > and override the isSplitable function. But I have no idea how to > implement around the record reader. Any suggestion or a sample code will > be greatly appreciated. > > Thanks in advance, > Grace