Re: Information about compiling?

2012-11-01 Thread Sebastian Nagel
Hi Thomas,

I just improved the description on how to run Nutch from the source package in
 http://wiki.apache.org/nutch/NutchTutorial

If you are using Nutch 2.x, you should follow
 http://wiki.apache.org/nutch/Nutch2Tutorial

Thanks,
Sebastian


On 11/01/2012 10:54 AM, Markus Jelsma wrote:
> Hi,
> 
> There are binary versions of 1.5.1 but not 2.x.
> http://apache.xl-mirror.nl/nutch/1.5.1/
> 
> About the scripts, you have to build nutch and then go to runtime/local 
> directory to run bin/nutch. 
> 
> Cheers
>  
>  
> -Original message-
>> From:Dr. Thomas Zastrow 
>> Sent: Thu 01-Nov-2012 10:45
>> To: user@nutch.apache.org
>> Subject: Information about compiling?
>>
>> Dear all,
>>
>> I found the following tutorial on the web:
>>
>> http://wiki.apache.org/nutch/NutchTutorial
>>
>> It starts with a binary version of Nutch. Unfortunateley, I didn't  
>> found any binary version, just the source code on the web page? So, I  
>> downloaded the latest version and compiled it with "ant". Everything  
>> seems to work, but I'm a little bit confused about the paths and how I  
>> should go on?
>>
>> Following the tutorial, I have to change some files, but they exist in  
>> several versions:
>>
>>   find . -iname regex-urlfilter.txt
>> ./runtime/local/conf/regex-urlfilter.txt
>> ./conf/regex-urlfilter.txt
>>
>> The same goes for the "nutch" command, I'm not sure which one is the  
>> right one. When I execute /src/bin/nutch with the following parameters:
>>
>> ./nutch crawl /opt/crawls/ -dir /opt/crawls/ -depth 3 -topN 5
>>
>> I got an error which I understand that the script can not find the jar files:
>>
>> Exception in thread "main" java.lang.NoClassDefFoundError:  
>> org/apache/nutch/crawl/Crawler
>> Caused by: java.lang.ClassNotFoundException: org.apache.nutch.crawl.Crawler
>>  at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
>>  at java.security.AccessController.doPrivileged(Native Method)
>>  at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
>>  at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
>>  at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
>>  at java.lang.ClassLoader.loadClass(ClassLoader.java:247)
>> Could not find the main class: org.apache.nutch.crawl.Crawler.   
>> Program will exit.
>>
>>
>> Any help would be nice ;-)
>>
>> Best regards and thank you for the software!
>>
>> Tom
>>
>>
>> -- 
>> Dr. Thomas Zastrow
>> Süsser Str. 5
>> 72074 Tübingen
>>
>> www.thomas-zastrow.de
>>



RE: Information about compiling?

2012-11-01 Thread Markus Jelsma
Hi,

There are binary versions of 1.5.1 but not 2.x.
http://apache.xl-mirror.nl/nutch/1.5.1/

About the scripts, you have to build nutch and then go to runtime/local 
directory to run bin/nutch. 

Cheers
 
 
-Original message-
> From:Dr. Thomas Zastrow 
> Sent: Thu 01-Nov-2012 10:45
> To: user@nutch.apache.org
> Subject: Information about compiling?
> 
> Dear all,
> 
> I found the following tutorial on the web:
> 
> http://wiki.apache.org/nutch/NutchTutorial
> 
> It starts with a binary version of Nutch. Unfortunateley, I didn't  
> found any binary version, just the source code on the web page? So, I  
> downloaded the latest version and compiled it with "ant". Everything  
> seems to work, but I'm a little bit confused about the paths and how I  
> should go on?
> 
> Following the tutorial, I have to change some files, but they exist in  
> several versions:
> 
>   find . -iname regex-urlfilter.txt
> ./runtime/local/conf/regex-urlfilter.txt
> ./conf/regex-urlfilter.txt
> 
> The same goes for the "nutch" command, I'm not sure which one is the  
> right one. When I execute /src/bin/nutch with the following parameters:
> 
> ./nutch crawl /opt/crawls/ -dir /opt/crawls/ -depth 3 -topN 5
> 
> I got an error which I understand that the script can not find the jar files:
> 
> Exception in thread "main" java.lang.NoClassDefFoundError:  
> org/apache/nutch/crawl/Crawler
> Caused by: java.lang.ClassNotFoundException: org.apache.nutch.crawl.Crawler
>  at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
>  at java.security.AccessController.doPrivileged(Native Method)
>  at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
>  at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
>  at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
>  at java.lang.ClassLoader.loadClass(ClassLoader.java:247)
> Could not find the main class: org.apache.nutch.crawl.Crawler.   
> Program will exit.
> 
> 
> Any help would be nice ;-)
> 
> Best regards and thank you for the software!
> 
> Tom
> 
> 
> -- 
> Dr. Thomas Zastrow
> Süsser Str. 5
> 72074 Tübingen
> 
> www.thomas-zastrow.de
>