Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The following page has been changed by RandallLucas: http://wiki.apache.org/nutch/GettingNutchRunningWithUbuntu ------------------------------------------------------------------------------ {{{ [EMAIL PROTECTED]:/opt# ./jdk-1_5_0_04-linux-amd64.bin }}} + + ''You might also want to follow the instructions for Debian-izing the Sun JDK:'' http://plugindoc.mozdev.org/faqs/distronotes/ubuntu-x86.html#java-sun Let's put JAVA_HOME in our ~/.bash_profiles, and source said ~/.bash_profiles for root and nutch @@ -110, +112 @@ Follow the nutch tutorial (http://lucene.apache.org/nutch/tutorial.html) to build a index, or for a simple index: {{{ + [EMAIL PROTECTED]:~/nutch/trunk $ mkdir urls - [EMAIL PROTECTED]:~/nutch/trunk $ echo 'http://lucene.apache.org/nutch/' > urls + [EMAIL PROTECTED]:~/nutch/trunk $ echo 'http://lucene.apache.org/nutch/' > urls/nutch [EMAIL PROTECTED]:~/nutch/trunk $ perl -pi -e 's|MY.DOMAIN.NAME|lucene.apache.org/nutch|' \ conf/crawl-urlfilter.txt [EMAIL PROTECTED]:~/nutch/trunk $ bin/nutch crawl urls -dir crawl.test -depth 3