Re: No KeyValueTextInputFormat in hadoop-0.20.2?

2010-06-20 Thread Kevin Tse
Is there anybody knowing about this, please?

On Mon, Jun 14, 2010 at 10:21 PM, Kevin Tse kevintse.on...@gmail.comwrote:

 Hi Ted,
 I mean the new API:

 org.apache.hadoop.mapreduce.Job.setInputFormatClass(org.apache.hadoop.mapreduce.InputFormat)

 Job.setInputFormatClass() only accepts
 org.apache.hadoop.mapreduce.InputFormat(of which there are several
 subclasses, while KeyValueTextInputFormat is not one of them) as its
 parameter.

 On Mon, Jun 14, 2010 at 10:03 PM, Ted Yu yuzhih...@gmail.com wrote:

 Have you checked
 src/mapred/org/apache/hadoop/mapred/KeyValueTextInputFormat.java ?

 On Mon, Jun 14, 2010 at 6:51 AM, Kevin Tse kevintse.on...@gmail.com
 wrote:

  Hi,
  I am upgrading my code from hadoop-0.19.2 to hadoop-0.20.2, during the
  process I found that there was no KeyValueTextInputFormat class which
  exists
  in hadoop-0.19.2. It's so strange that this version of hadoop does not
 come
  with this commonly used InputFormat. I have taken a look at the
  SecondarySort.java example code, it uses TextInputFormat and
  StringTokenizer to split each line, it is ok but kinda awkward to me.
 
  Do I have to implement a new InputFormat myself or there's
  a KeyValueTextInputFormat that exists somewhere I didn't notice?
 
  Thank you.
  Kevin Tse
 





Re: No KeyValueTextInputFormat in hadoop-0.20.2?

2010-06-20 Thread Amareshwari Sri Ramadasu
The new api KeyValueTextInputFormat is not available in branch 0.20. It is 
added in branch 0.21 through 
https://issues.apache.org/jira/browse/MAPREDUCE-655.

Thanks
Amareshwari

On 6/21/10 6:52 AM, Kevin Tse kevintse.on...@gmail.com wrote:

Is there anybody knowing about this, please?

On Mon, Jun 14, 2010 at 10:21 PM, Kevin Tse kevintse.on...@gmail.comwrote:

 Hi Ted,
 I mean the new API:

 org.apache.hadoop.mapreduce.Job.setInputFormatClass(org.apache.hadoop.mapreduce.InputFormat)

 Job.setInputFormatClass() only accepts
 org.apache.hadoop.mapreduce.InputFormat(of which there are several
 subclasses, while KeyValueTextInputFormat is not one of them) as its
 parameter.

 On Mon, Jun 14, 2010 at 10:03 PM, Ted Yu yuzhih...@gmail.com wrote:

 Have you checked
 src/mapred/org/apache/hadoop/mapred/KeyValueTextInputFormat.java ?

 On Mon, Jun 14, 2010 at 6:51 AM, Kevin Tse kevintse.on...@gmail.com
 wrote:

  Hi,
  I am upgrading my code from hadoop-0.19.2 to hadoop-0.20.2, during the
  process I found that there was no KeyValueTextInputFormat class which
  exists
  in hadoop-0.19.2. It's so strange that this version of hadoop does not
 come
  with this commonly used InputFormat. I have taken a look at the
  SecondarySort.java example code, it uses TextInputFormat and
  StringTokenizer to split each line, it is ok but kinda awkward to me.
 
  Do I have to implement a new InputFormat myself or there's
  a KeyValueTextInputFormat that exists somewhere I didn't notice?
 
  Thank you.
  Kevin Tse
 






No KeyValueTextInputFormat in hadoop-0.20.2?

2010-06-14 Thread Kevin Tse
Hi,
I am upgrading my code from hadoop-0.19.2 to hadoop-0.20.2, during the
process I found that there was no KeyValueTextInputFormat class which exists
in hadoop-0.19.2. It's so strange that this version of hadoop does not come
with this commonly used InputFormat. I have taken a look at the
SecondarySort.java example code, it uses TextInputFormat and
StringTokenizer to split each line, it is ok but kinda awkward to me.

Do I have to implement a new InputFormat myself or there's
a KeyValueTextInputFormat that exists somewhere I didn't notice?

Thank you.
Kevin Tse


Re: No KeyValueTextInputFormat in hadoop-0.20.2?

2010-06-14 Thread Ted Yu
Have you checked
src/mapred/org/apache/hadoop/mapred/KeyValueTextInputFormat.java ?

On Mon, Jun 14, 2010 at 6:51 AM, Kevin Tse kevintse.on...@gmail.com wrote:

 Hi,
 I am upgrading my code from hadoop-0.19.2 to hadoop-0.20.2, during the
 process I found that there was no KeyValueTextInputFormat class which
 exists
 in hadoop-0.19.2. It's so strange that this version of hadoop does not come
 with this commonly used InputFormat. I have taken a look at the
 SecondarySort.java example code, it uses TextInputFormat and
 StringTokenizer to split each line, it is ok but kinda awkward to me.

 Do I have to implement a new InputFormat myself or there's
 a KeyValueTextInputFormat that exists somewhere I didn't notice?

 Thank you.
 Kevin Tse



Re: No KeyValueTextInputFormat in hadoop-0.20.2?

2010-06-14 Thread Kevin Tse
Hi Ted,
I mean the new API:
org.apache.hadoop.mapreduce.Job.setInputFormatClass(org.apache.hadoop.mapreduce.InputFormat)

Job.setInputFormatClass() only accepts
org.apache.hadoop.mapreduce.InputFormat(of which there are several
subclasses, while KeyValueTextInputFormat is not one of them) as its
parameter.

On Mon, Jun 14, 2010 at 10:03 PM, Ted Yu yuzhih...@gmail.com wrote:

 Have you checked
 src/mapred/org/apache/hadoop/mapred/KeyValueTextInputFormat.java ?

 On Mon, Jun 14, 2010 at 6:51 AM, Kevin Tse kevintse.on...@gmail.com
 wrote:

  Hi,
  I am upgrading my code from hadoop-0.19.2 to hadoop-0.20.2, during the
  process I found that there was no KeyValueTextInputFormat class which
  exists
  in hadoop-0.19.2. It's so strange that this version of hadoop does not
 come
  with this commonly used InputFormat. I have taken a look at the
  SecondarySort.java example code, it uses TextInputFormat and
  StringTokenizer to split each line, it is ok but kinda awkward to me.
 
  Do I have to implement a new InputFormat myself or there's
  a KeyValueTextInputFormat that exists somewhere I didn't notice?
 
  Thank you.
  Kevin Tse