Re: InputSplits in Mapper

2010-06-05 Thread Torsten Curdt
> No, there isn't an api for that. Bummer. > The data is actually available in HDFS, but > it is considered an internal format and in particular has changed > substantially between 0.20 and 0.21/trunk. Na ...I was after an API for this. Since I control the splits from a custom input format, I c

Re: InputSplits in Mapper

2010-06-05 Thread Torsten Curdt
Hey > I don't know if there is a way to get them, but I believe you shouldn't need > to do so.. Each Mapper is created for a split it is supposed to work on and > should not be aware of other splits - that is basically why MapReduce is > such an effective pattern - each map and reduce task can be

Re: InputSplits in Mapper

2010-06-05 Thread Owen O'Malley
On Jun 5, 2010, at 11:15 AM, Torsten Curdt wrote: but is there a way to get a list of all InputSplits? No, there isn't an api for that. The data is actually available in HDFS, but it is considered an internal format and in particular has changed substantially between 0.20 and 0.21/trunk.

Re: InputSplits in Mapper

2010-06-05 Thread Marcin Sieniek
Hi there, I don't know if there is a way to get them, but I believe you shouldn't need to do so.. Each Mapper is created for a split it is supposed to work on and should not be aware of other splits - that is basically why MapReduce is such an effective pattern - each map and reduce task can b

InputSplits in Mapper

2010-06-05 Thread Torsten Curdt
I know I can get current InputSplit inside a mapper with InputSplit split = context.getInputSplit(); but is there a way to get a list of all InputSplits? cheers -- Torsten