Maximum indexable data

2002-03-07 Thread Herman Chen
Hi, Is there a limit for the amount of data indexable by a segment? If so is there a limit for searching? i.e. can I give MultiSearcher several indices that are all close to the maximum size. Thanks. -- Herman

Combining FuzzyQueries

2002-03-07 Thread Christian Schrader
I have the following problem. When I create a Fuzzyquery: FuzzyQuery fuzzy = new FuzzyQuery(new Term("categoryName", "test")); and add it to a new BooleanQuery finalQuery = new BooleanQuery(); finalQuery.add(fuzzy,false,false); fuzzy1.toString("contents"); gives me categoryName:test~ which

Moving an index from one Directory to another?

2002-03-07 Thread Avi Drissman
To improve speed for our application, I'm trying to move our index from the disk to a RAM directory when our database is opened, and move it back to the disk when the database is closed. The code looks something like this: -open- Directory diskDirectory = FSDirectory.getDirectory(index

Re: What type of indexer is Lucene? Question reworded.

2002-03-07 Thread Brandon Jockman
You should read the Lucene FAQ http://www.lucene.com/cgi-bin/faq/faqmanager.cgi ). Most of your questions (over the past few days) are already answered there. -B - Original Message - From: "Melissa Mifsud" <[EMAIL PROTECTED]> To: "Lucene User" <[EMAIL PROTECTED]> Sent: Thursday, March 0

RE: What type of indexer is Lucene? Question reworded.

2002-03-07 Thread Alex Murzaku
In http://www.lucene.com/publications.html you will find many answers to your questions. The papers "An Object-Oriented Architecture for Text Retrieval" and "Optimizations for Dynamic Inverted Index Maintenance" will be especially helpful for your understanding of Lucene. -Original Message---

Re: What type of indexer is Lucene? Question reworded.

2002-03-07 Thread Joshua O'Madadhain
Melissa: These questions are answered in the Lucene FAQ, which is located at http://www.lucene.com/cgi-bin/faq/faqmanager.cgi However, if I correctly understand your fundamental question, my understanding is that Lucene basically uses the vector model of IR. Joshua [EMAIL PROTECTED] Per Obs

Re: Lucene throws an ArrayIndexOutOfBoundsException() if the first te rm in my query string is a stopWord

2002-03-07 Thread Otis Gospodnetic
Hm, I've got the latest Lucene (from CVS) and don't have this issue. The query I tried on our index is: +title:of +title:someotherwordthatDOESgetmeresults Otis --- "Biswas, Goutam_Kumar" <[EMAIL PROTECTED]> wrote: > Dear Lucene Users > > Lucene throws an ArrayIndexOutOfBoundsException()

Lucene throws an ArrayIndexOutOfBoundsException() if the first term in my query string is a stopWord

2002-03-07 Thread Biswas, Goutam_Kumar
Dear Lucene Users Lucene throws an ArrayIndexOutOfBoundsException() if the first term in my query string is a stopWord. Why is it so ? I'm making AND as the default mode of search. So I'm adding an AND operator between each term of my query. That is if my query is 'cats dogs' I'm

Re: Lucene-created files

2002-03-07 Thread Melissa Mifsud
Thanks for your help!! I'll let you know how it goes. Melissa. - Original Message - From: "Mark Tucker" <[EMAIL PROTECTED]> To: "Lucene Users List" <[EMAIL PROTECTED]> Sent: Wednesday, March 06, 2002 5:53 PM Subject: RE: Lucene-created files This document is by no means complete, bu

What type of indexer is Lucene? Question reworded.

2002-03-07 Thread Melissa Mifsud
Hi again! I should really reword my question as follows: On which criteria are relevant documents chosen given a particular query and once retrieved, how are these documents ranked? The techniques by which this is done will then determine what type of IR model Lucene implements. Thanks agai

RE: Support for russian morphology in Lucene

2002-03-07 Thread Alex Murzaku
Real morphology (finding the root for all the forms of a word) in Russian might not be that easy since in Russian you have both prefixes (aspect) and suffixes (case, number, conjugation) that inflect a word. But, there are already efforts to write stemmers (suffix strippers) for Russian following

Re: phrase query and slop factor

2002-03-07 Thread Norbert Pabiƛ
Rather not. Terms in phrase query are ordered and slop factor specifies how much unordered they can be. Otis Gospodnetic wrote: > > Wouldn't that depend on how far from each other you wanted to allow > them to be? If you have a document with 100 words indexed and you are > searching for "first