/nutch$ ./nutch crawl
Go to the nutch dir.
Try:
bin/nutch crawl
Matthias
---
This SF.net email is sponsored by: IT Product Guide on ITManagersJournal
Use IT products in your business? Tell us what you think of them. Give us
Your Opinions, Get Free
hi,
I can't get nutch to work, I set the java path.
I get the following:
/nutch$ ./nutch crawl
run java in /usr/local/jdk1.4.2
expr: syntax error
Exception in thread "main" java.lang.NoClassDefFoundError:
net/nutch/tools/CrawlTool
can someone help me?
thnx in advace
-
On Sat, Oct 09, 2004 at 11:46:09AM +0200, Andrzej Bialecki wrote:
> [EMAIL PROTECTED] wrote:
>
> >Andrzej,
> >
> >Could you make paring optional too?
>
> s/paring/parsing/ ?
Ture
>
> The tool does not do any parsing. It considers the data only from the
> fetcher output files - other output fi
Hi,
I think, after about 9 months of test on Nutch, that updatedb not work for
big size crawler.
I have 4 segments: one with about 390 pages and the others with size
of about 1000 pages.
The process to build all was:
mkdir db
mkdir segments
bin/nutch admin db -create
get http://rdf.dm
[EMAIL PROTECTED] wrote:
Andrzej,
Could you make paring optional too?
s/paring/parsing/ ?
The tool does not do any parsing. It considers the data only from the
fetcher output files - other output files like parse data, content and
parse text are simply copied verbatim from input segments to the o
Hi,
I have tested the Nutch NDFS on local and remote machine. All
work fine with inject db, generate segments, fetcher.
But I have discovered that is impossible to use NDFS to updatedb.
For more information see my log below.
After Processing the latest document, and some time later that
Finishi
init:
[mkdir] Created dir: /tmp/nutch/build
[mkdir] Created dir: /tmp/nutch/build/classes
[mkdir] Created dir: /tmp/nutch/build/test
[mkdir] Created dir: /tmp/nutch/build/test/classes
[copy] Copying 4 files to /tmp/nutch/conf
[copy] Copying /tmp/nutch/conf/regex-urlfilter