MapReduce Child don't exit?

2009-11-11 Thread Ted Xu
hi all, We are using hadoop-0.19.1 on about 200 nodes. We find there are lots of slaves keep Child process even the job is done. Here is an example, the process is running since "AUGEST 09"! > 1000 24625 1 0 Aug09 ?00:00:38 (...java... classpath) > org.apache.hadoop.mapred.Chi

Can I wire a mapper and reducer class?

2009-11-11 Thread Java Developer
I want to know if I can "wire" mapper and reducer classes. By "wire" I mean to use inversion of control (or dependency injection). For example, I see in a lot of books and on the Map/Reduce tutorial page ( http://hadoop.apache.org/common/docs/r0.20.0/mapred_tutorial.html), something like this: con

Re: Hadoop Job Performance as MapReduce count increases?

2009-11-11 Thread Aaron Kimball
That makes sense. It's worth pointing out that tasks are scheduled on a "pull" basis -- tasktrackers ask for more work if they have free slots for tasks -- so it is not a given that all nodes will receive the same number of tasks. If some tasks take considerably longer (or some nodes are faster/slo

Re: Hadoop Job Performance as MapReduce count increases?

2009-11-11 Thread Rob Stewart
Hi Aaron, your response was very useful indeed, thank you very much. OK, I've documented the scenario (relevent to my experiments), where the cluster is very small, only 10 nodes. I have uploaded this section only to : http://linuxsoftwareblog.com/Hadoop/small_cluster_scenario.png Can I ask, d

using job client in mapreduce example

2009-11-11 Thread Rajan Dev
hello I am using hadoop wordcount examples and trying to use jobcleint API to get the status of map and reduce as i am trying to get my own status bar for the map reduce instead using JobTracker.jsp actually i used two JVMs for running the jars but was not sucessful and got this exception in co