I see.. so if I have a cluster with n nodes, there is no way for me to have
it spawn on just 2 of those nodes, or just one of those nodes? And
furthermore, there is no way for me to have it spawn on just a subset of the
processors? Or am I misunderstanding?

Also, when you say "specify the number of tasks for each node" are you
referring to specifying the number of mappers and reducers I can spawn on
each node?

-SM

On Sun, Sep 7, 2008 at 8:29 PM, Mafish Liu <[EMAIL PROTECTED]> wrote:

> On Mon, Sep 8, 2008 at 2:25 AM, Sandy <[EMAIL PROTECTED]> wrote:
>
> > Hi,
> >
> > This may be a silly question, but I'm strangely having trouble finding an
> > answer for it (perhaps I'm looking in the wrong places?).
> >
> > Suppose I have a cluster with n nodes each with m processors.
> >
> > I wish to test the performance of, say,  the wordcount program on k
> > processors, where k is varied from k = 1 ... nm.
>
>
> You can  specify the number of tasks for each node in your hadoop-site.xml
> file.
> So you can get k varied from k = n, 2*n....m*n instead of k = 1...nm.
>
>
> > How would I do this? I'm having trouble finding the proper command line
> > option in the commands manual (
> > http://hadoop.apache.org/core/docs/current/commands_manual.html)
> >
> >
> >
> > Thank you very much for you time.
> >
> > -SM
> >
>
>
>
> --
> [EMAIL PROTECTED]
> Institute of Computing Technology, Chinese Academy of Sciences, Beijing.
>

Reply via email to