Hi Huang,

Boris's answer should work fine. If it would be useful for you to have a
single command line tool to start up a pseudo-distributed cluster for
testing, please comment on this JIRA:

http://issues.apache.org/jira/browse/MAPREDUCE-987

-Todd

On Fri, Sep 25, 2009 at 10:19 AM, Boris Shkolnik <bo...@yahoo-inc.com>wrote:

> You can run multiple data nodes on the same machine.
> You should create a separate config directory for each dn.
> The following stuff needs to be created
> hdfs-site.xml
> Pid/log/data/tmp dirs
> log4.properties
> Master/slaves
> And then start these data nodes
> ( something like this: bin/hdfs --config $DIR/dn$i datanode >
> $DIR/dn$i/log/datanode_log$i.log 2>&1&)
>
>
> Or you can look at MiniDFSCluster
>
>
> On 9/24/09 11:00 PM, "Huang Qian" <skysw...@gmail.com> wrote:
>
> > Hi guys,
> >
> > I just want to simulate a cluster with Hadoop on my laptop, so I chose
> the
> > pseudo-distribute mode. The example is running well, but now I just want
> to
> > test getting date from different machines.  Unfortunately, I have not
> found
> > anything on that topic yet.  Can Hadoop fit my needs under
> pseudo-distribute
> > mode? Thank you all~
> >
> > All the best
> > Qian
>
>

Reply via email to