hi all,
We are using hadoop-0.19.1 on about 200 nodes. We find there are lots of
slaves keep Child process even the job is done.
Here is an example, the process is running since "AUGEST 09"!
> 1000 24625 1 0 Aug09 ?00:00:38 (...java... classpath)
> org.apache.hadoop.mapred.Chi
I want to know if I can "wire" mapper and reducer classes. By "wire" I mean
to use inversion of control (or dependency injection). For example, I see in
a lot of books and on the Map/Reduce tutorial page (
http://hadoop.apache.org/common/docs/r0.20.0/mapred_tutorial.html),
something like this:
con
That makes sense. It's worth pointing out that tasks are scheduled on a
"pull" basis -- tasktrackers ask for more work if they have free slots for
tasks -- so it is not a given that all nodes will receive the same number of
tasks. If some tasks take considerably longer (or some nodes are
faster/slo
Hi Aaron,
your response was very useful indeed, thank you very much.
OK, I've documented the scenario (relevent to my experiments), where the
cluster is very small, only 10 nodes.
I have uploaded this section only to :
http://linuxsoftwareblog.com/Hadoop/small_cluster_scenario.png
Can I ask, d
hello
I am using hadoop wordcount examples and trying to use jobcleint API to get
the status of map and reduce as i am trying to get my own status bar for
the map reduce instead using JobTracker.jsp
actually i used two JVMs for running the jars but was not sucessful and got
this exception in co