Lucene used

2002-05-01 Thread Mark Ayad
Hi All I just started to use the free Java IDE from eclipse, www.eclipse.org. I've noticed that the M5 version uses Lucene as the search engine for the help system. It's just indexex my help plugin with over 500M of html and pdf documents placed in a zip, in under 5 min ! ! Since all eclipse is

Re: Homogeneous vs Heterogeneous indexes (was: FileNotFoundException)

2002-05-01 Thread Dmitry Serebrennikov
Subject: Re: Homogeneous vs Heterogeneous indexes (was: FileNotFoundException) From: petite_abeille <[EMAIL PROTECTED]> Date: Wed, 1 May 2002 08:37:51 +0200 To: "Lucene Users List" <[EMAIL PROTECTED]> On Wednesday, May 1, 2002, at 12:41 AM, Dmitry Serebrennikov wrote: > - the numb

Re: FileNotFoundException: Too many open files

2002-05-01 Thread Dmitry Serebrennikov
PA, > On average, there seem to be less than one hundred Lucene files per index. You are probably past this point by now, but since I didn't see anyone pick up on this, I wanted to respond. "Less then a hundred" is definetely too many files for a Lucene index, unless you have a very large nu

Re: term search speeds

2002-05-01 Thread Otis Gospodnetic
Caching? The OSes usually cache recently opened files... Otis --- a person <[EMAIL PROTECTED]> wrote: > Does anyone know exactlty why when searching for a term the engine is > much slower on the first search of a term, than on subsequent searchs > of the same term? > > Thanks > > > Join 18 mi

term search speeds

2002-05-01 Thread a person
Does anyone know exactlty why when searching for a term the engine is much slower on the first search of a term, than on subsequent searchs of the same term? Thanks Join 18 million Eudora users by signing up for a free Eudora Web-Mail account at http://www.eudoramail.com -- To unsubscribe, e

Re: indexing PDF files

2002-05-01 Thread Otis Gospodnetic
> > Hm, this should be a FAQ. > > Maybe it should... ;-) It is now. > > Check Lucene contributions page, there are some starting points > there, > > Well, this seems to be a very popular request... In fact I need > something like that also. Unfortunately, there seems to be no > authoritative

Re: Options for sorting on an integer or date

2002-05-01 Thread Otis Gospodnetic
Hello, --- Joel Bernstein <[EMAIL PROTECTED]> wrote: > At my company we trying to decide on a new search engine. > I am very impressed with what I see with Lucene and am thinking very > seriously of not going with AltaVista, FAST etc... :) > One of things that is very important to us is sorting

Options for sorting on an integer or date

2002-05-01 Thread Joel Bernstein
At my company we trying to decide on a new search engine. I am very impressed with what I see with Lucene and am thinking very seriously of not going with AltaVista, FAST etc... One of things that is very important to us is sorting by an integer or by a date, which Lucene currently cannot do. So

Searching UNICODE

2002-05-01 Thread Hyong Ko
Hello, I think there's something wrong with the QueryParser.jj file. I downloaded lucene-1.2-rc4-src and compiled successfully with JAVA_UNICODE_ESCAPE=true and DEBUG_TOKEN_MANAGER = true. My output debug info for Indexing looked okay. It showed the correct byte arrays in UTF8. However, when I

Re: Search all fields

2002-05-01 Thread Peter Carlson
Thanks, I'll fix it. --Peter On 5/1/02 6:54 AM, "Armbrust, Daniel C." <[EMAIL PROTECTED]> wrote: > There's a cut and paste error on that contributions page, with the link for > multiple field searching. > > It reads (notice the two http://'s) in the link > > href="http://http://www.mail-arc

RE: Search all fields

2002-05-01 Thread Armbrust, Daniel C.
There's a cut and paste error on that contributions page, with the link for multiple field searching. It reads (notice the two http://'s) in the link http://http://www.mail-archive.com/lucene-user@jakarta.apache.org/msg0 0775.html"> http://www.mail-archive.com/lucene-user@jakarta.apache.org/msg0

Re: indexing PDF files

2002-05-01 Thread Peter Carlson
I don't know what they have to offer, but I think adobe has something. Here is something I just found on the topic from Abobe's site. How can I license Acrobat Viewer to distribute with my own products or to use in my custom Java development? How much will it cost to license? Adobe Acrobat Viewe

Re: Search all fields

2002-05-01 Thread Peter Carlson
There is an example of how to do this in the contributions section of the website (it's toward the bottom). --Peter On 5/1/02 5:22 AM, "Christoph Kiehl" <[EMAIL PROTECTED]> wrote: > > is it somehow possible to simple search all indexed fields, without > explicitly naming them in parse()? Or is

Search all fields

2002-05-01 Thread Christoph Kiehl
Hi, is it somehow possible to simple search all indexed fields, without explicitly naming them in parse()? Or is there a method to get all fields ever indexed? Thanks Christoph -- To unsubscribe, e-mail: For additional commands, e-mail: