Hi,
Please read the map section of
http://wiki.apache.org/hadoop/HadoopMapReduce to understand how Hadoop
ends up respecting record boundaries despite block-chops not taking
that into consideration. I hope it helps clear things up for you.
On Fri, Feb 10, 2012 at 10:26 PM, GUOJUN Zhu
Freddie Mac
Harsh J ha...@cloudera.com
02/10/2012 12:02 PM
Please respond to
mapreduce-user@hadoop.apache.org
To
mapreduce-user@hadoop.apache.org
cc
Subject
Re: Does FileSplit respect the record boundary?
Hi,
Please read the map section of
http://wiki.apache.org/hadoop
To: mapreduce-user@hadoop.apache.org
Subject: Re: Does FileSplit respect the record boundary?
Thank you for the reply. That page helps a lot. I still have a more
specific question. In a LineRecordReader's constructor (hadoop 1.0.0)
public LineRecordReader(Configuration job, FileSplit split). Does