Re: [Nutch-dev] working with nutch

2004-10-09 Thread Matthias Jaekle
/nutch$ ./nutch crawl Go to the nutch dir. Try: bin/nutch crawl Matthias --- This SF.net email is sponsored by: IT Product Guide on ITManagersJournal Use IT products in your business? Tell us what you think of them. Give us Your Opinions, Get Free

[Nutch-dev] working with nutch

2004-10-09 Thread Abdul Rahman Advany
hi, I can't get nutch to work, I set the java path. I get the following: /nutch$ ./nutch crawl run java in /usr/local/jdk1.4.2 expr: syntax error Exception in thread "main" java.lang.NoClassDefFoundError: net/nutch/tools/CrawlTool can someone help me? thnx in advace -

Re: [Nutch-dev] FastSegmentMergeTool

2004-10-09 Thread john
On Sat, Oct 09, 2004 at 11:46:09AM +0200, Andrzej Bialecki wrote: > [EMAIL PROTECTED] wrote: > > >Andrzej, > > > >Could you make paring optional too? > > s/paring/parsing/ ? Ture > > The tool does not do any parsing. It considers the data only from the > fetcher output files - other output fi

[Nutch-dev] Updatedb not work for medium-big size crawler

2004-10-09 Thread massimo
Hi, I think, after about 9 months of test on Nutch, that updatedb not work for big size crawler. I have 4 segments: one with about 390 pages and the others with size of about 1000 pages. The process to build all was: mkdir db mkdir segments bin/nutch admin db -create get http://rdf.dm

Re: [Nutch-dev] FastSegmentMergeTool

2004-10-09 Thread Andrzej Bialecki
[EMAIL PROTECTED] wrote: Andrzej, Could you make paring optional too? s/paring/parsing/ ? The tool does not do any parsing. It considers the data only from the fetcher output files - other output files like parse data, content and parse text are simply copied verbatim from input segments to the o

[Nutch-dev] Updatedb not work with NDFSÃ

2004-10-09 Thread massimo
Hi, I have tested the Nutch NDFS on local and remote machine. All work fine with inject db, generate segments, fetcher. But I have discovered that is impossible to use NDFS to updatedb. For more information see my log below. After Processing the latest document, and some time later that Finishi

[Nutch-dev] Build Failure

2004-10-09 Thread nutch-admin
init: [mkdir] Created dir: /tmp/nutch/build [mkdir] Created dir: /tmp/nutch/build/classes [mkdir] Created dir: /tmp/nutch/build/test [mkdir] Created dir: /tmp/nutch/build/test/classes [copy] Copying 4 files to /tmp/nutch/conf [copy] Copying /tmp/nutch/conf/regex-urlfilter