MoreLikeThis jar doesn't contain classes

2008-02-22 Thread Jonathan Ariel
Hi, I've downloaded Lucene 2.3.0 binaries and in the contrib folder I can see the Similarity package, but inside the Jar there are no classes! Downloading the sources I ran into the same issue. Am I doing something wrong? Where should I get the MoreLikeThis classes from? Thanks! Jonathan

MoreLikeThis queries

2008-02-22 Thread Jonathan Ariel
Hi, I'm trying to use MoreLikeThis but I can't find how to make a MoreLikeThis query that will return related documents given a document and some conditions, like country field in the related documents should be 1, etc. Is there any documentation on how to do this kind of queries? Thanks, Jonat

MoreLikeThis over a subset of documents

2008-04-22 Thread Jonathan Ariel
Is there any way to execute a MoreLikeThis over a subset of documents? I need to retrieve a set of interesting keywords from a subset of documents and not the entire index (imagine that my index has documents categorized as A, B and C and I just want to work with those categorized as A). Right now

Re: MoreLikeThis over a subset of documents

2008-04-22 Thread Jonathan Ariel
ties to > Query moreQuery = mlt.like( target); > BooleanQuery bq = new BooleanQuery(); > bq.add(moreQuery, BooleanClause.Occur.MUST); > Query restrictQuery = new TermQuery(new Term("Category", "A")); > bq.add(restrictQuery, BooleanClause.Occur.MUST); > Hits hits = is.search

Re: MoreLikeThis over a subset of documents

2008-04-22 Thread Jonathan Ariel
I could have up to 2 million documents and growing. On Tue, Apr 22, 2008 at 7:29 PM, Karl Wettin <[EMAIL PROTECTED]> wrote: > Jonathan Ariel skrev: > > Is there any way to execute a MoreLikeThis over a subset of documents? I > > need to retrieve a set of interesting keyw

Re: MoreLikeThis over a subset of documents

2008-04-22 Thread Jonathan Ariel
in category A, only add the text to the > catA field. Now do MoreLikeThis on catA. This assumes you know the > categories at index time, of course. > Redundant but workable. > > -Glen > > 2008/4/22 Jonathan Ariel <[EMAIL PROTECTED]>: > > Is there any way to execute a M

Re: MoreLikeThis over a subset of documents

2008-04-23 Thread Jonathan Ariel
hange MoreLikeThis to receive a set of term frequencies, instead of an IndexReader, and use that to do all the process. Anyone knows if a document contains for his fields the term frequencies? On Wed, Apr 23, 2008 at 7:46 AM, Karl Wettin <[EMAIL PROTECTED]> wrote: > Jonathan Ariel skrev: > &g

MoreLikeThis patch to support boost factor

2008-04-23 Thread Jonathan Ariel
This is a patch I made to be able to boost the terms with a specific factor beside the relevancy returned by MoreLikeThis. This is helpful when having more then 1 MoreLikeThis in the query, so words in the field A (i.e. Title) can be boosted more than words in the field B (i.e. Description). Any f

Re: boosting relevance of certain documents

2008-04-25 Thread Jonathan Ariel
How are you analyzing the searchable field? On Fri, Apr 25, 2008 at 12:49 PM, Daniel Freudenberger < [EMAIL PROTECTED]> wrote: > Hello, > > > > I'm using lucene within a new project and I'm not sure about how to solve > the following problem: My index consists of the two attributes "id" and > "se

Re: boosting relevance of certain documents

2008-04-25 Thread Jonathan Ariel
n Fri, Apr 25, 2008 at 2:30 PM, Daniel Freudenberger < [EMAIL PROTECTED]> wrote: > I'm using the StandardAnalyzer - hope this answers your question (I'm > quite > new to the lucene thing) > > -Original Message- > From: Jonathan Ariel [mailto:[EMAIL PROTECTED]

Indexing

2007-08-22 Thread Jonathan Ariel
Hi, I'm new to this list. So first of all Hello to everyone! So right now I have a little issue I would like to discuss with you. Suppose that your are in a really big application where the data in your database is updated really fast. I reindex lucene every 5 min but since my application lists ev

Re: Indexing

2007-08-22 Thread Jonathan Ariel
RAMDir, > but that's "an exercise for the reader", > > You could also define the problem away and just live > with a 5 minute latency. > > Best > Erick > > On 8/22/07, Jonathan Ariel <[EMAIL PROTECTED]> wrote: > > > > Hi, > > I'm ne