Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The following page has been changed by PaulDhaliwal: http://wiki.apache.org/nutch/NutchTutorial ------------------------------------------------------------------------------ {{{ bin/nutch }}} This will display the documentation for the Nutch command script. + + Good! You are almost ready to crawl. You need to give your crawler a name. This is required. + 1. Open up $NUTCH_HOME/conf/nutch-default.xml file + 2. Search for {{{http.agent.name}}} , and give it value 'YOURNAME Spider' + 3. Optionally you may also set {{{http.agent.url}}} and {{{http.agent.email}}} properties. Now we're ready to crawl. There are two approaches to crawling: