Re: InputSplit is confusing me .. Any clarifications ??

2010-11-28 Thread Steve Lewis
override *TextInputFormat and have *protected boolean *isSplitable*( FileSystem fs, Path filename)

Re: InputSplit is confusing me .. Any clarifications ??

2010-11-27 Thread Arun C Murthy
Moving to mapreduce-user@, bcc common-u...@. Please use project specific lists. Your InputSplits are defined by your InputFormat. Take a look 'getSplits' method in InputFormat.java. http://hadoop.apache.org/common/docs/r0.20.0/mapred_tutorial.html#Job+Input http://hadoop.apache.org/common/d

Re: InputSplit is confusing me .. Any clarifications ??

2010-11-27 Thread maha
Sorry I mistyped LineRecordReader by LineInputSplit ... So here is my question again .. > > Thanks for the reply .. although I read it has to do with "InputSplit" which > represents the data to be processed by an individual Mapper. By default it's > a LineRecordReader. > > How can I change

Re: InputSplit is confusing me .. Any clarifications ??

2010-11-27 Thread maha
Hi li, Thanks for the reply .. although I read it has to do with "InputSplit" which represents the data to be processed by an individual Mapper. By default it's a LineInputSplit. How can I change this property to be FileInputSpilt ? something like JobConf.set ("File.input.format","FileInptS

Re: InputSplit is confusing me .. Any clarifications ??

2010-11-26 Thread li ping
That depends on which InputFormat class you use. For example, The *org.apache.hadoop.mapreduce.lib.input.TextInputFormat. *This TextInputFormat class will break the file into lines, and the key is the position of the line. the value is the text of the line. On Sat, Nov 27, 2010 at 6:08 AM, maha w

InputSplit is confusing me .. Any clarifications ??

2010-11-26 Thread maha
Hi Everyone, What I really wish for Thanksgiving is some one giving me clarification of how the inputSplit is working. Eg. public void map(LongWritable key, Text value, OutputCollector output, Reporter reporter) throws IOException { String line = value.toString(