Hi Edward, Thanks for the reply. That was my observation too.
Is there any other way to improve performance in a single node pseudo distribution mode ? Say when I have a hama cluster of 3 machines, does it mean it is ideal to have number of tasks (NumBspTasks) to that of the cluster size ? I see in the code when we do not set num tasks for a given job. value is taken from either from site or from default or from cluster size. Is there any other knobs that I can use to improve performance when in clusterd/distributed mode ? Regards, Mahesh Babu On Fri, Aug 23, 2013 at 3:39 PM, Edward J. Yoon <[email protected]>wrote: > The number of partitions is equal to the number of tasks. You might > not able to improve job performance by increasing tasks number on > single machine. It is like a lot of cooks in the bistro. > > On Fri, Aug 23, 2013 at 6:17 PM, Mahesh Babu <[email protected]> wrote: > > Hi, > > > > I am running some SSSP routines in pseudo distributed mode now. However, > > time taken to compute minDist is increasing when increasing NumBspTask. > And > > it reduces when I reduce this site configuration. > > > > > > I wanted to understand this API a bit more : i.e. BSPJob.setNumBspTasks() > > can somebody help me understand this.. > > does it relate to number of threads or does it any way influence > number > > of partitions > > and the reason why my time measurements are so. > > > > > > Also is there any other configuration/property that I can try to improve > > the speed of SSSP ? > > > > > > Regards, > > Mahesh Babu > > > > -- > Best Regards, Edward J. Yoon > @eddieyoon >
