Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The following page has been changed by MiddleForkMaps: http://wiki.apache.org/nutch/GettingNutchRunningWithDebian ------------------------------------------------------------------------------ '''It is NOT necessary to run ''''~/local/tomcat/bin/catalina.sh start'''' as noted elsewhere in the WIKI, nor is it necessary to start tomcat/catalina from any particular location'''[[BR]] == Configure File and Webapp Paths == - Under Debian Etch, the Catalina configuration files are located under '''/etc/tomcat5.5/policy.d''' At runtime they are combined into a single file, '''/usr/share/tomcat5.5/conf/catalina.policy''' Do not edit the latter, as it will be overwrittten.[[BR]] + Under Debian Etch, the Catalina configuration files are located under '''/etc/tomcat5.5/policy.d''' At runtime they are combined into a single file, ''/usr/share/tomcat5.5/conf/catalina.policy'' Do not edit the latter, as it will be overwrittten.[[BR]] + At the end of /etc/tomcat5.5/policy.d/04webapps.policy include the following code:[[BR]] + ''grant codeBase "file:/usr/share/tomcat5.5-webapps/-" { + permission java.util.PropertyPermission "user.dir", "read"; + permission java.util.PropertyPermission "java.io.tmpdir", "read,write"; + permission java.util.PropertyPermission "org.apache.*", "read,execute"; + permission java.io.FilePermission "/usr/local/nutch/crawls/-" , "read"; + permission java.io.FilePermission "/var/lib/tomcat5.5/temp", "read"; + permission java.io.FilePermission "/var/lib/tomcat5.5/temp/-", "read,write,execute,delete"; + permission java.lang.RuntimePermission "createClassLoader", ""; + permission java.security.AllPermission; + }; + '' + '''Warning: The last line here was necessary in order to make things work for me. If anybody can supply a more restrictive permission set, please do so!!! The effects of this are unknown''' + + == Acquire, install and configure Nutch == + Follow '''ONLY''' the section ''Getting Started'' in the Nutch tutorial at http://lucene.apache.org/nutch/tutorial8.html + ===Configure for multiple, independent site crawls and searches=== + Given two sites, site1 and site2 which you wish to crawl/index (and later search) independently from each other:[[BR]] + ''#cp -rp conf conf.site1''[[BR]] + ''#cp -rp conf conf.site2''[[BR]] + + + + + ------------------------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier. Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 _______________________________________________ Nutch-cvs mailing list Nutch-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nutch-cvs