Re: HOD questions

Hemanth Yamijala Wed, 17 Dec 2008 21:38:18 -0800

Craig,

Hello,
We have two HOD questions:
(1) For our current Torque PBS setup, the number of nodes requested byHOD (-l nodes=X) corresponds to the number of CPUs allocated, howeverthese nodes can be spread across various partially or empty nodes.Unfortunately, HOD does not appear to honour the number of processorsactually allocated by Torque PBS to that job.

Just FYI, at Yahoo! we've set torque to allocate separate nodes for thenumber specified to HOD. In other words, the number corresponds to thenumber of nodes, not processors. This has proved simpler to manage. Iforget right now, but I think you can make Torque behave like this (tonot treat processors as individual nodes).

For example, a current running HOD session can be viewed in qstat as:
104544.trmaster user parallel HOD 4178 8 -- -- 288:0 R01:48
  node29/2+node29/1+node29/0+node17/2+node17/1+node18/2+node18/1
  +node19/1
However, on inspection of the Jobtracker UI, it tells us that node19has "Max Map Tasks" and "Max Reduce Tasks" both set to 2, when I thinkthat for node19, it should only be allowed one map task.

While HOD does not do this automatically, please note that since you arebringing up a Map/Reduce cluster on the allocated nodes, you can submitmap/reduce parameters with which to bring up the cluster when allocatingjobs. The relevant options are --gridservice-mapred.server-params (or -Min shorthand). Please refer tohttp://hadoop.apache.org/core/docs/r0.19.0/hod_user_guide.html#Options+for+Configuring+Hadoopfor details.

I believe that for each node, HOD should determine (using theinformation in the $PBS_NODEFILE), how many CPUs for each node areallocated to the HOD job, and then setmapred.tasktracker.map.tasks.maximum appropriately on each node.
(2) In our InputFormat, we use the numSplits to tell us how many maptasks the job's files should be split into. However, HOD does notoverride the mapred.map.tasks property (nor the mapred.reduce.tasks),while they should be set dependent on the number of available tasktrackers and/or nodes in the HOD session.

Can this not be submitted via the Hadoop job's configuration ? Again,HOD cannot do this automatically currently. But you could use thehod.client-params to set up a client side hadoop-site.xml that wouldwork like this for all jobs submitted to the cluster.


Hope this helps some.

Thanks
Hemanth

Re: HOD questions

Reply via email to