how to skip a mapper

2012-09-10 Thread Anit Alexander
Hello list, Is it possible to start the mapper from a particular byte location in a file which is in hdfs? Regards, Anit

Re: how to skip a mapper

2012-09-10 Thread Harsh J
Anit, Yes this is possible (and actually does happen in regular MR scenario anyway - when the input is split across several locations). You'll need a custom InputFormat#getSplits implementation to do this (create input splits with the first offset itself set to the known offset location, instead