Do you guys have a copy of Tom White's Hadoop book available? There is an excellent example of a WholeFileInputFormat which definitly works with Hadoop-0.20.0. What do you mean by 'it doesn't seem to work'? Exceptions, unexpected output, ... ?
Regards, Thomas Am 11.03.2010 10:24, schrieb HypOo: > > I have the same problem, I need to asign a whole file per map but I don't > know how to do that. > I've tried to create a new WholeFileFormat.class and override the method > isSplitable() but it doesn't seems to work.. > Have you achieved to do this ? > I'm using hadoop 0.20.2 > > > > stolikp wrote: > > > > I've got some text files in my input directory and I want to pass each > > single text file (whole file not just a line) to a map (one file per one > > map). How can I do this ? TextInputFormat splits text into lines and I do > > not want this to happen. > > I tried: > > > http://hadoop.apache.org/common/docs/r0.20./streaming.html#How+do+I+process+files%2C+one+per+map%3F > > but it doesn't work for me, compiler doesn't know what > > NonSplitableTextInputFormat.class is. > > I'm using hadoop 0.20.1 > > > > -- > View this message in context: > http://old.nabble.com/Passing-whole-text-file-to-a-single-map-tp27287649p27860526.html > Sent from the Hadoop core-user mailing list archive at Nabble.com. >