Hi Thomas,
I just improved the description on how to run Nutch from the source package in
http://wiki.apache.org/nutch/NutchTutorial
If you are using Nutch 2.x, you should follow
http://wiki.apache.org/nutch/Nutch2Tutorial
Thanks,
Sebastian
On 11/01/2012 10:54 AM, Markus Jelsma wrote:
> Hi,
>
> There are binary versions of 1.5.1 but not 2.x.
> http://apache.xl-mirror.nl/nutch/1.5.1/
>
> About the scripts, you have to build nutch and then go to runtime/local
> directory to run bin/nutch.
>
> Cheers
>
>
> -Original message-
>> From:Dr. Thomas Zastrow
>> Sent: Thu 01-Nov-2012 10:45
>> To: user@nutch.apache.org
>> Subject: Information about compiling?
>>
>> Dear all,
>>
>> I found the following tutorial on the web:
>>
>> http://wiki.apache.org/nutch/NutchTutorial
>>
>> It starts with a binary version of Nutch. Unfortunateley, I didn't
>> found any binary version, just the source code on the web page? So, I
>> downloaded the latest version and compiled it with "ant". Everything
>> seems to work, but I'm a little bit confused about the paths and how I
>> should go on?
>>
>> Following the tutorial, I have to change some files, but they exist in
>> several versions:
>>
>> find . -iname regex-urlfilter.txt
>> ./runtime/local/conf/regex-urlfilter.txt
>> ./conf/regex-urlfilter.txt
>>
>> The same goes for the "nutch" command, I'm not sure which one is the
>> right one. When I execute /src/bin/nutch with the following parameters:
>>
>> ./nutch crawl /opt/crawls/ -dir /opt/crawls/ -depth 3 -topN 5
>>
>> I got an error which I understand that the script can not find the jar files:
>>
>> Exception in thread "main" java.lang.NoClassDefFoundError:
>> org/apache/nutch/crawl/Crawler
>> Caused by: java.lang.ClassNotFoundException: org.apache.nutch.crawl.Crawler
>> at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
>> at java.security.AccessController.doPrivileged(Native Method)
>> at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
>> at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
>> at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
>> at java.lang.ClassLoader.loadClass(ClassLoader.java:247)
>> Could not find the main class: org.apache.nutch.crawl.Crawler.
>> Program will exit.
>>
>>
>> Any help would be nice ;-)
>>
>> Best regards and thank you for the software!
>>
>> Tom
>>
>>
>> --
>> Dr. Thomas Zastrow
>> Süsser Str. 5
>> 72074 Tübingen
>>
>> www.thomas-zastrow.de
>>