Hi,

for the purpose of a monitoring project, I am trying to find out how to
programmaticallyeally from the client interface (JobClient etc.), do the
following in Hadoop MR:

1) Given a taskID (or taskattemptID), figure out the ID of the node/server
(IP address or hostname) where the map/reduce task is running.

2) From the map method of my mapper class (overloading Mapper), get a hook
to the Partitioner class. Essentially, what I want to do here is to count -
as the map task progresses - the output bytes of a map task that will be
fetched/processed by each reducer task.

*Please note I am using Hadoop 1.0.4*

Any help or hint would be greatly appreciated!

Thanks

Reply via email to