Re: Questions about dfs and MapRed in the Hadoop.

2010-01-05 Thread Ed Mazur
Hi Pedro, I can answer a couple of these. On Tue, Jan 5, 2010 at 5:46 PM, psdc1978 wrote: > 1 - What are the difference between the classes: > org.apache.hadoop.mapred.Reducer.java and > org.apache.hadoop.mapreduce.Reducer.java? In which case the 2 reducers > are used? > > 2 - The same question

Questions about dfs and MapRed in the Hadoop.

2010-01-05 Thread psdc1978
Hi list, I downloaded the Hadoop 0.20.1 and now I'm looking to the source of the MapReduce. I've got the following questions: 1 - What are the difference between the classes: org.apache.hadoop.mapred.Reducer.java and org.apache.hadoop.mapreduce.Reducer.java? In which case the 2 reducers are used?

Re: Only running hadoop Map tasks

2010-01-05 Thread Owen O'Malley
On Jan 5, 2010, at 9:13 AM, psdc1978 wrote: 1 - I would like to see what is output that the Maps is doing on my example. Is it possible to put hadoop only running Map tasks, excluding the Reduce tasks? Set the number of reduce tasks to 0. 2 - The output of the Maps is written into a tempora

Only running hadoop Map tasks

2010-01-05 Thread psdc1978
Hi, 1 - I would like to see what is output that the Maps is doing on my example. Is it possible to put hadoop only running Map tasks, excluding the Reduce tasks? 2 - The output of the Maps is written into a temporary file? 3 - How the output of the maps is passed to the reduce tasks? Is using a

Map-Reduce Framework Reduce output records Counter error?

2010-01-05 Thread Yonggang Qiao
the number from Reduce output records Counter doesn't match its actually # of records in the output files. although after reran it, it did match. any idea what could be wrong? Thanks, Yonggang