documentation on Lucene

2002-04-03 Thread Suhas Indra
Hello folks, I am new to Lucene search engine. I have read about the power of Lucene in indexing and search. I just browsed through the site http://jakarta.apache.org/lucene to find about the documentation on Lucene classes. But I was unable to find the required information(about the abstract cla

storing index in third party database.

2002-04-03 Thread amithnz
Hi all I want to index the datas which I already stored in a thirdparty database table and develop a search facility using lucene. I am thinking of storing this indexes back to the database in another table. I know for this we have to create a 'directory' which do all the indexing operations,

Re: compiling lucene

2002-04-03 Thread Otis Gospodnetic
JavaCC 2.1 works, too. This is how I have it set up: [otis@linux2 otis]$ ls -al /usr/local/.version/javacc2.1/ total 44 drwxrwxr-x6 otis otis 4096 Jan 28 06:50 . drwxr-xr-x 20 otis otis 4096 Apr 2 23:32 .. drwxrwxr-x3 otis otis 4096 Jan 28 06:50 bin

Re: storing index in third party database.

2002-04-03 Thread Otis Gospodnetic
If you want to store indices in a database search the mailing list archives for SqlDirectory. Once I considered using it for one application at work, so I asked its author about performance. The answer was that it doesn't perform all that well when the index grows, if I recall correctly. Conseq

Re: storing index in third party database.

2002-04-03 Thread Karl Øie
without having investigated the problem much i would think that a SQL database would be a very bad match for lucene as most of lucene's working is creating key's for words and documents and then creating indexes of these keys. for these purposes a SQL database is an unecessary overhead, not eve

RE: storing index in third party database.

2002-04-03 Thread David Elworthy
> -Original Message- > From: Karl Øie [mailto:[EMAIL PROTECTED]] > Sent: Wednesday, April 03, 2002 10:00 AM > To: Lucene Users List > Subject: Re: storing index in third party database. > > > without having investigated the problem much i would think that a SQL > database would be a ver

Re: storing index in third party database.

2002-04-03 Thread Karl Øie
hm puting lucene on top of BDB would actually be quite cool. it would provide lucene with recovery and transaction handling but as far as i have seen in the lucene implementation of Directory it pushes back a inputstream, for BDB this would require us to iterate over the keys and gener

Re: documentation on Lucene

2002-04-03 Thread Peter Carlson
Hi Suhas, I think you have checked out everything (except maybe the FAQ's). Do you have a specific question? --Peter On 4/2/02 2:40 AM, "Suhas Indra" <[EMAIL PROTECTED]> wrote: > Hello folks, > > I am new to Lucene search engine. I have read about the power of Lucene in > indexing and search

(ZipDirectory) RE: storing index in third party database.

2002-04-03 Thread Spencer, Dave
Another interesting variation - possibly - is storing the index in a zip file (thus we'd have "ZipDirectory"). Then, say, the index would be in one on-disk-file (thus, "easier to manage") and in some applet applications the whole thing could easily/efficiently be loaded. Just did a test and a on

RE: search similar docs?

2002-04-03 Thread Spencer, Dave
Can't you feed the text of the orig/matching doc to the search engine as a query and see what docs it returns? Then a "similar" has words in common w/ the orig doc. I've done this kind of monster query with some of our internal systems - we have support mail that comes in, and all kinds of intran

What does it take to optimize?

2002-04-03 Thread Robert A. Decker
My index is larger than it should be. My deletable file has entries. I'm trying to optimize the index, but it just doesn't seem to be doing anything. Here's how I'm trying to optimize: IndexWriter writer = new IndexWriter(pathToIndex, fragmentAnalyzer,

RE: What does it take to optimize?

2002-04-03 Thread Aruna Raghavan
Hi, I would call close after I call optimize... -Original Message- From: Robert A. Decker [mailto:[EMAIL PROTECTED]] Sent: Wednesday, April 03, 2002 1:15 PM To: Lucene Users List Subject: What does it take to optimize? My index is larger than it should be. My deletable file has entries.

Case Sensitivity

2002-04-03 Thread Alan Weissman
What can I do to configure Lucene to make in case insensitive? Thanks, Alan -- To unsubscribe, e-mail: For additional commands, e-mail:

Re: (ZipDirectory) RE: storing index in third party database.

2002-04-03 Thread Peter Carlson
I am doing some testing on managing the underlying data in a zip archive and found that there is about a 15ms hit to use a zip vs. grabbing directly from file system. This is very reasonable for me vs. having to manage moving around a 1000's + files. --Peter On 4/3/02 9:10 AM, "Spencer, Dave"

RE: Case Sensitivity

2002-04-03 Thread Aruna Raghavan
Hi, I worked around the problem by converting everything to lowercase in my code prior to indexing into lucene and also prior to searching for a string. Ofcourse, I also had to use pattern matching to change bool operators such as ANDs and ORs to uppercase again because lucene expects those to be

RE: Case Sensitivity

2002-04-03 Thread Joshua O'Madadhain
Alan, Aruna: The built-in solution is to use LowerCaseFilter in your Analyzer. (The SimpleAnalyzer, StopAnalyzer, and StandardAnalyzer classes already do this; see the Lucene API docs to see which filters each uses.) The FAQ includes an example implementation of an Analyzer if you want to build

RE: Case Sensitivity

2002-04-03 Thread Aruna Raghavan
Hi, I am using StandardAnalyzer - the problem was with wildcard queries being case sensitive. Even with Standard Analyzer, you have to worry about case sensitivity in this case. Thanks for the tip on example Analyzer, I will take a peek. -Original Message- From: Joshua O'Madadhain [mailto

Re: Case Sensitivity

2002-04-03 Thread Peter Carlson
You can use the standard analyzer. This lower cases all the words (it uses the lowerCaseFilter). Note this also uses the stop word filter so your results may vary. Also when you index, be sure to use text instead of keyword as the field type since the keyword doesn't go through the filter. --Pe

Rationale for having boolean operators as ALL CAPS

2002-04-03 Thread Peter Carlson
Hi, Can someone tell me the rationale for having the boolean operator only work if they are all caps? Thanks --Peter -- To unsubscribe, e-mail: For additional commands, e-mail:

Objects as search results

2002-04-03 Thread Kelvin Tan
Here's a topic which to my recollection (surprisingly) hasn't been brought up: Assuming development in an object-oriented environment, it's a fair assumption that the eventual target of searching is an object. How are developers making this happen? Are all fields of the objects indexed and displa

Querying multiple fields of a index

2002-04-03 Thread Harpreet S Walia
Hi, Is it possible to query multiple fields of a given index and get the result based on this combined query. i.e for example if i want to serach for a word "lucene" in the title field and the word "engine" in the summary filed and want the results based on these words . How can i achieve this

Re: compiling lucene

2002-04-03 Thread Victor Hadianto
> JavaCC 2.1 works, too. > This is how I have it set up: Yes, to confirm, a list member pointed out earlier that I have to _install_ JavaCC first, serve me right not redaing tfm. Sorry for the noise -- Victor Hadianto --- Every cloud engenders not a storm. -- William Shakespeare,