Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The following page has been changed by MiddleForkMaps:
http://wiki.apache.org/nutch/GettingNutchRunningWithDebian

------------------------------------------------------------------------------
  '''It is NOT necessary to run ''''~/local/tomcat/bin/catalina.sh start'''' as 
noted elsewhere in the WIKI, nor is it necessary to start tomcat/catalina from 
any particular location'''[[BR]]
  
  == Configure File and Webapp Paths ==
- Under Debian Etch, the Catalina configuration files are located under 
'''/etc/tomcat5.5/policy.d'''  At runtime they are combined into a single file, 
'''/usr/share/tomcat5.5/conf/catalina.policy'''  Do not edit the latter, as it 
will be overwrittten.[[BR]]
+ Under Debian Etch, the Catalina configuration files are located under 
'''/etc/tomcat5.5/policy.d'''  At runtime they are combined into a single file, 
''/usr/share/tomcat5.5/conf/catalina.policy''  Do not edit the latter, as it 
will be overwrittten.[[BR]]
+ At the end of /etc/tomcat5.5/policy.d/04webapps.policy include the following 
code:[[BR]]
  
+ ''grant codeBase "file:/usr/share/tomcat5.5-webapps/-" {
+     permission java.util.PropertyPermission "user.dir", "read";
+     permission java.util.PropertyPermission "java.io.tmpdir", "read,write";
+     permission java.util.PropertyPermission "org.apache.*", "read,execute";
+     permission java.io.FilePermission "/usr/local/nutch/crawls/-" , "read";
+     permission java.io.FilePermission "/var/lib/tomcat5.5/temp", "read";
+     permission java.io.FilePermission "/var/lib/tomcat5.5/temp/-", 
"read,write,execute,delete";
+     permission java.lang.RuntimePermission "createClassLoader", "";
+     permission java.security.AllPermission;
+ };
+ ''
+ '''Warning:  The last line here was necessary in order to make things work 
for me.  If anybody can supply a more restrictive permission set, please do 
so!!!  The effects of this are unknown'''
+ 
+ == Acquire, install and configure Nutch ==
+ Follow '''ONLY''' the section ''Getting Started'' in the Nutch tutorial at 
http://lucene.apache.org/nutch/tutorial8.html
+ ===Configure for multiple, independent site crawls and searches===
+ Given two sites, site1 and site2 which you wish to crawl/index (and later 
search) independently from each other:[[BR]]
+  ''#cp -rp conf conf.site1''[[BR]]
+  ''#cp -rp conf conf.site2''[[BR]]
+ 
+ 
+ 
+ 
+ 

Reply via email to