Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The following page has been changed by RandallLucas:
http://wiki.apache.org/nutch/GettingNutchRunningWithUbuntu

------------------------------------------------------------------------------
  {{{
  [EMAIL PROTECTED]:/opt# ./jdk-1_5_0_04-linux-amd64.bin
  }}}
+ 
+ ''You might also want to follow the instructions for Debian-izing the Sun 
JDK:'' http://plugindoc.mozdev.org/faqs/distronotes/ubuntu-x86.html#java-sun
  
  Let's put JAVA_HOME in our ~/.bash_profiles, and source said ~/.bash_profiles 
for root and nutch
  
@@ -110, +112 @@

  Follow the nutch tutorial (http://lucene.apache.org/nutch/tutorial.html) to 
build a index, or for a simple index:
  
  {{{
+ [EMAIL PROTECTED]:~/nutch/trunk $ mkdir urls
- [EMAIL PROTECTED]:~/nutch/trunk $ echo 'http://lucene.apache.org/nutch/' > 
urls
+ [EMAIL PROTECTED]:~/nutch/trunk $ echo 'http://lucene.apache.org/nutch/' > 
urls/nutch
  [EMAIL PROTECTED]:~/nutch/trunk $ perl -pi -e 
's|MY.DOMAIN.NAME|lucene.apache.org/nutch|' \
    conf/crawl-urlfilter.txt
  [EMAIL PROTECTED]:~/nutch/trunk $ bin/nutch crawl urls -dir crawl.test -depth 
3

Reply via email to