Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The "JavaDemoApplication" page has been changed by Cristian Vulpe. http://wiki.apache.org/nutch/JavaDemoApplication?action=diff&rev1=11&rev2=12 -------------------------------------------------- <description /> </property> }}} - Place this copy of nutch-site.xml and a copy of common-terms.utf8 (from the conf directory in the Nutch distribution) in the WEB-INF/classes directory of the web application that you're deploying. For a standalone application, the mentioned files have to be available in the classpath. + Place this copy of nutch-site.xml and a copy of common-terms.utf8 (from the conf directory in the Nutch distribution) in the WEB-INF/classes directory of the web application that you're deploying. For a standalone application, the mentioned files have to be available in the CLASSPATH. In some cases, you might want to have extra-flexibility by using runtime configuration parameters. This can be achieved using variable substitution. For example, the nutch-site.xml might look like this: + + {{{ + <?xml version="1.0"?> + <?xml-stylesheet type="text/xsl" href="configuration.xsl"?> + + <configuration> + <property> + <name>plugin.folders</name> + <value>${nutch.site.plugin.folders} + </value> + <description /> + </property> + + <property> + <name>searcher.dir</name> + <value>${nutch.site.searcher.dir}</value> + <description /> + </property> + </configuration> + }}} + + and run the java application using the appropriate parameters: + {{{ + -Dnutch.site.plugin.folders="c:\tools\crawlers\apache-nutch-1.1-bin\plugins" + -Dnutch.site.searcher.dir="c:\tools\crawlers\apache-nutch-1.1-bin\crawl" + }}} + + === CLASSPATH Configuration === You also need to make sure that the following jars are placed in WEB-INF/lib (this assumes usage of Nutch 0.9):

