Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The following page has been changed by MiddleForkMaps:
http://wiki.apache.org/nutch/GettingNutchRunningWithDebian

------------------------------------------------------------------------------
   ''# /etc/init.d/tomcat5.5 start''[[BR]]
   ''# /etc/init.d/tomcat5.5 stop''[[BR]]
  '''It is NOT necessary to run ''''~/local/tomcat/bin/catalina.sh start'''' as 
noted elsewhere in the WIKI, nor is it necessary to start tomcat/catalina from 
any particular location'''[[BR]]
- Tomcat5.5 under Debian Etch listens to port 8180, not 8080, so pointing your 
browser to http://mysite:8180 will bring up the Tomcat home page, if everything 
is functioning properly.[[BR]]
+ Tomcat5.5 under Debian Etch listens to port 8180, not 8080, so pointing your 
browser to http://blahblah:8180 will bring up the Tomcat home page, if 
everything is functioning properly.[[BR]]
  === Grant Yourself Tomcat Manager Permissions ===
  Edit ''/usr/share/tomcat5.5/conf/tomcat-users.xml'' and include the 
following:[[BR]]
    {{{<user username="myname" password="mypassword" roles="manager"/>}}}
@@ -71, +71 @@

  == Install Multiple Copies of Nutch under Tomcat5.5 and Prepare for Searching 
==
  Under Debian Etch & Tomcat5.5 the webapps path is located at[[BR]]
   ''/usr/share/tomcat5.5-webapps''[[BR]]
- '''Contrary to the Nutch tutorial(s) it is NOT NECESSARY to remove the ROOT 
context
+ '''Contrary to the Nutch tutorial(s) it is NOT NECESSARY to remove the ROOT 
context nor is it desirable.''' It was noted above that the Tomcat Manager 
allows us to view and control our multiple applications.  Removing ROOT would 
break this functionality.[[BR]]
+ Create two new folders under /usr/share/tomcat5.5-webapps, and explode the 
nutch war file into each:
+ {{{ #cd /usr/share/tomcat5.5-webapps
+ #mkdir site1
+ #mkdir site2
+ #cp /usr/local/nutch/nutch-0.8.1.war site1
+ #cp /usr/local/nutch/nutch-0.8.1.war site2
+ #cd site1; jar xvf nutch-0.8.1.war; rm nutch-0.8.1.war; cd ..
+ #cd site2; jar xvf nutch-0.8.1.war; rm nutch-0.8.1.war; cd ..
+ }}}
+ ===  Configure the site1,site2 webapps ===
+ Edit the site1/WEB-INF/classes/nutch-site.xml file for the searcher.dir 
parameter, so that it points back to your crawl directory under 
/usr/local/nutch:[[BR]]
+ {{{<name>searcher.dir</name>
+ <value>/usr/local/nutch/crawls/site1</value>
+ }}}
+ And repeat for site2.[[BR]]
+ Create site1.xml and site2.xml under /usr/share/tomcat5.5-webapps by 
modifying the distribution nutch-site.xml
+ {{{
+ <Context path="/site1" docBase="/usr/share/tomcat5.5-webapps/site1"
+    debug="0" privileged="true" allowLinking="true">
+ </Context>
+ }}}
+ And repeat for site2.[[BR]]
+ Create symbolic links to these files under 
/usr/share/tomcat5.5/conf/Catalina/localhost
+ {{{
+ ln -s /usr/share/tomcat5.5-webapps/site1.xml 
/usr/share/tomcat5.5/conf/Catalina/localhost/site1.xml
+ ln -s /usr/share/tomcat5.5-webapps/site2.xml 
/usr/share/tomcat5.5/conf/Catalina/localhost/site2.xml
+ }}}
+ === Restart Tomcat ===
+ {{{ /etc/init.d/tomcat5.5 restart }}}
+ Revisit the Tomcat Manager.  You should see new entries for site1 and site2 
and with luck their ''Running'' status should show as ''True''
  
+ = Search Your Sites! =
+ 
+ Point your browser to http://blahblah:8180/site1 and conduct a search.  [[BR]]
+ Point your browser to http://blahblah:8180/site2 and conduct another search.  
[[BR]]
+ If everything was configured properly you should see independent results 
representing independent searches on independent crawls.
+ 
+ FIN.
+ 

-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier.
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-cvs mailing list
Nutch-cvs@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-cvs

Reply via email to