Nutch (distribution, "ant package") has a folder /lib/ containing lucene
jar files... As usual, specific versions of library files were tested
for production use, any upgrade in libs is not recommended...
-Original Message-
From: Michael Ji [mailto:[EMAIL PROTECTED]
Sent: Wednesday, Augu
Nutch simply uses the Lucene JAR file. Upgrading Nutch to use a new
Lucene release would involve replacing the JAR file with the new
version, and depending on the changes to Lucene itself it may involve
rebuilding indexes (to ensure normalization factors and such changes
are incorporated),
As I understand, Nutch is a crawling/searching
application based on Lucene;
Just a curious question, when Lucene has a new
version/release, how to merge Lucene to Nutch?
I didn't see an explicity Lucene Java source in Nutch
source tree. I don't think Nutch and Lucene do low
level API independent
...
Should be "clean" instead "deploy"
Hi Olena
I'm currently starting my work with Nutch. My goal is to have a topic
> specific (or at least language specific) crawler tool. Is it possible
> to apply the LanguageIdentifier plugin on webpages that are not yet
> fetched, so that e.g. only French or German pages are crawled?
No. The rea
Hi,
I was able to prepare the release today (it took longer than expected
due to build problem). I made all steps described in Release HOWTO
except two:
1) Update and deploy main Lucene site with news about release as I do
not have commit rights there. Anyone with commit rights who wants to
up