Re: [Nutch-dev] Injecting URLs from database

2005-02-16 Thread John X
On Tue, Feb 15, 2005 at 11:07:54PM -0800, David Spencer wrote: > Kelvin Tan wrote: > > >I'd like to > > > >1) inject URLs from a database > >2) add a RegexFilter for each URL such that only pages under each URL's > >TLD is indexed > > > >For the first, looking at the code, I suppose a way is to

Re: [Nutch-dev] Compilation problem

2005-02-16 Thread Olaf Thiele
Hi Christophe, please post your ant and java versions. As ant is changing rapidly between versions, this could be the problem. Using the war build works fine for me with the latest CVS update. Regards, Olaf On Wed, 16 Feb 2005 10:39:49 +0100, Christophe Noel <[EMAIL PROTECTED]> wrote: > Yes, I

Re: [Nutch-dev] Adding title and site to scoring

2005-02-16 Thread Michael Nebel
Hi, I'm afraid, I'll have to deal with the ranking the next days / weekend. So perhaps I can contribute some time and work for all of us. Before taking the wrong way, some questions in advance: - using luke to look at my indexes I see a field called - some more checking: there is a query-site-pl

[Nutch-dev] JavaCC Query Parser

2005-02-16 Thread Christophe Noel
I'd like to allow Lucene Fuzzy Query as "informatian~"... Should I modify "NutchAnalysis.jj" from net.nutch.analysis or is there an easier way to make this change ? Thanks a lot for your help. Christophe. --- SF email is sponsored by - The IT Pr

Re: [Nutch-dev] Injecting URLs from database

2005-02-16 Thread Matt Kangas
Kelvin, (1) can be achieved by instantiating WebDBInjector and calling addPage() repeatedly. This method is public in CVS. (2) is best done with PrefixURLFilter; it uses a trie datastructure, which scales much better for thousands of rules. HTH, --Matt On Tue, 15 Feb 2005 20:45:12 +0100, Kelvin

[Nutch-dev] (no subject)

2005-02-16 Thread preethavm
sir, we are happy to get ur email id while searching for our project. And also we are planning to do a _mini project_ on search engines.we will be very thankful to u if u send ur suggestions.Hoping ur reply. Thanking You

[Nutch-dev] [ nutch-Bugs-1123792 ] UpdateDatabaseTool ignores url-filters

2005-02-16 Thread SourceForge.net
Bugs item #1123792, was opened at 2005-02-16 10:42 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=491356&aid=1123792&group_id=59548 Category: tools Group: None Status: Open Resolution: Non

Re: [Nutch-dev] Compilation problem

2005-02-16 Thread Christophe Noel
Yes, I forgot to set JAVA_HOME :))) But now I get the following : "BUILD FAILED ./nutch/build.xml:112: ./nutch/build/plugins not found. " Thanks. == Stefan Groschupf wrote: Check that you have installed a JDK and not a JRE. As well check if your JAVA_HOME environment variable point to the JDK.