This page probably needs updating and/or simplifying. If you want to run
Nutch in fully or pseudo distributed mode, all it takes is to set up Hadoop
separately following the instructions on (Hadoop
Tutorial<http://hadoop.apache.org/common/docs/stable/>),
modify the config files in $NUTCH/conf and compile with 'ant job', then
move to $NUTCH/runtime/deploy/bin and use the commands as explained in
the Nutch
Tutorial <http://wiki.apache.org/nutch/NutchTutorial>. If you want to run
Nutch in local mode, then jump straight to $NUTCH/runtime/local and follow
the steps from the Nutch tutorial.

I'd recommend to test your config in local mode first to make sure that
everything behaves as expected then run Nutch in pseudo-distributed mode
which makes it easier to monitor the progress of your crawl using the
Hadoop UIs + you can get the most of your machine by having more than 1
mapper or reducer going at the same time.

HTH

Julien

On 31 May 2012 15:36, John <[email protected]> wrote:

> Dear everyone,
> Can anybody help me for use the nutch1.4+hadoop1.0.3 and solr3.4 ?
> I have readed the follow link
> http://wiki.apache.org/nutch/NutchHadoopTutorial
> but I cannot understand
> cp -R /path/to/build/* /nutch/search
> ------------------
>  Kind Regards,
> John Tang
>  http://hitangjun.com
> http://t.qq.com/hitangjun




-- 
*
*Open Source Solutions for Text Engineering

http://digitalpebble.blogspot.com/
http://www.digitalpebble.com
http://twitter.com/digitalpebble

Reply via email to