This page probably needs updating and/or simplifying. If you want to run Nutch in fully or pseudo distributed mode, all it takes is to set up Hadoop separately following the instructions on (Hadoop Tutorial<http://hadoop.apache.org/common/docs/stable/>), modify the config files in $NUTCH/conf and compile with 'ant job', then move to $NUTCH/runtime/deploy/bin and use the commands as explained in the Nutch Tutorial <http://wiki.apache.org/nutch/NutchTutorial>. If you want to run Nutch in local mode, then jump straight to $NUTCH/runtime/local and follow the steps from the Nutch tutorial.
I'd recommend to test your config in local mode first to make sure that everything behaves as expected then run Nutch in pseudo-distributed mode which makes it easier to monitor the progress of your crawl using the Hadoop UIs + you can get the most of your machine by having more than 1 mapper or reducer going at the same time. HTH Julien On 31 May 2012 15:36, John <[email protected]> wrote: > Dear everyone, > Can anybody help me for use the nutch1.4+hadoop1.0.3 and solr3.4 ? > I have readed the follow link > http://wiki.apache.org/nutch/NutchHadoopTutorial > but I cannot understand > cp -R /path/to/build/* /nutch/search > ------------------ > Kind Regards, > John Tang > http://hitangjun.com > http://t.qq.com/hitangjun -- * *Open Source Solutions for Text Engineering http://digitalpebble.blogspot.com/ http://www.digitalpebble.com http://twitter.com/digitalpebble

