Re: search for words separated by spaces

2002-03-13 Thread Winton Davies
nd look for just >I am a programmer >what is the result set returning? >Is it AND of all words or OR? >or is it just picking up the firs word "I" and searching for it? >Thanks again! >Aruna. > >-- >To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> >Fo

Re: Build index using RAMDirectory "out of memory" errors

2002-02-25 Thread Winton Davies
ting the "out of memory" errors ? > > > > >-- >To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> >For additional commands, e-mail: <mailto:[EMAIL PROTECTED]> -- Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mat

Re: results sorting

2002-02-19 Thread Winton Davies
gt; >> Also, if your collection is small, you can probably afford to simply >> enumerate all hit documents and sort them as you wish. >> >> Doug >> >> -- >> To unsubscribe, e-mail: >><mailto:[EMAIL PROTECTED]> >> For additional commands, e-mail: >

Re: Lucene Query Structure

2002-02-19 Thread Winton Davies
>understanding of the exact definition of the scoring system, especially as >it relates to the semantics of queries involving multiple terms. > >Once I get a clear understanding of this issue, I would be happy to write >it up and submit it as an addition to the FAQ/docs. > >Th

Re: Lucene Query Structure

2002-02-18 Thread Winton Davies
-- >To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> >For additional commands, e-mail: <mailto:[EMAIL PROTECTED]> -- Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mateo, CA 94404 work: (650) 403-2259 cell: (650) 867-1598 http://www.overture

Re: Phrase Query

2002-02-18 Thread Winton Davies
ee" (notice that the >comma is missing.) > >Does Lucene remove special characters before indexing the documents? > >Thanks! > >-- >To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> >For additional commands, e-mail: <mailto:[EMAIL PROTECTED]> --

Re: Performance Issue

2002-02-01 Thread Winton Davies
> >For additional commands, e-mail: <mailto:[EMAIL PROTECTED]> -- Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mateo, CA 94404 work: (650) 403-2259 cell: (650) 867-1598 http://www.overture.com/ ` -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>

Anyone run Linux JVM 1.4 Beta 3 with lucene ?

2002-01-14 Thread Winton Davies
Hi guys, I'm getting stung by JVM 1.3.1_01 on Linux, max allocation of heap is about 1.9 gb. Anyway, I'm thinking of going to 1.4 ? Anyone run Lucene under this beta ? Cheers, Winton -- Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San

Re: One Question -- Successful Deployments

2002-01-11 Thread Winton Davies
sions. Cheers, Winton -- Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mateo, CA 94404 work: (650) 403-2259 cell: (650) 867-1598 http://www.overture.com/ -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: <

Re: One Question -- Successful Deployments

2002-01-11 Thread Winton Davies
the changes you made >to Lucene to support the query rate that you mentioned? > >Thanks, >Otis > > >--- Winton Davies <[EMAIL PROTECTED]> wrote: >> Hi, >> >>We're (Overture/Goto) evaluating Lucene ... email me specific >> questions.

Re: indexing big files

2002-01-08 Thread Winton Davies
on > > >-- >To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> >For additional commands, e-mail: <mailto:[EMAIL PROTECTED]> Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mateo, CA 94404 work: (650) 403-2259 cell: (650) 867-159

Re: One Question -- Successful Deployments

2002-01-08 Thread Winton Davies
free as far as I can tell). Cheers, Winton Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mateo, CA 94404 work: (650) 403-2259 cell: (650) 867-1598 http://www.overture.com/ -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>

Re: Error while creating term

2002-01-04 Thread Winton Davies
I think it needs to be a field name (when you create a document it has to be composed of at least one field). Winton Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mateo, CA 94404 work: (650) 403-2259 cell: (650) 867-1598 http://www.overture.com/ -- To

Re: EJB solution for Lucene

2002-01-03 Thread Winton Davies
Hotmail. >http://www.hotmail.com > > >-- >To unsubscribe, e-mail: < >mailto:[EMAIL PROTECTED]> >For additional commands, e-mail: < >mailto:[EMAIL PROTECTED]> > > > > > > >-- >To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> >For addit

Re: boolean query

2002-01-03 Thread Winton Davies
to run. > >Carmine > > >-- >To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> >For additional commands, e-mail: <mailto:[EMAIL PROTECTED]> Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mateo, CA 94404 work: (

Re: Killing the ongoing indexing

2001-12-03 Thread Winton Davies
ve me firsdt 50 matchs > date 1 ), then this is perfectly respectable thing to do. Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mateo, CA 94404 work: (650) 403-2259 cell: (650) 867-1598 http://www.overture.com/ -- To unsubscribe, e-mail: <mailto:[

RE: Parallelising a query...

2001-11-29 Thread Winton Davies
ing I implement my own hit collector (basically just ignore the score...) Winton Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mateo, CA 94404 work: (650) 403-2259 cell: (650) 867-1598 http://www.overture.com/ -- To unsubscribe, e-mail: <mailto:[EMAIL P

RE: Parallelising a query...

2001-11-29 Thread Winton Davies
Thanks ! (one thing to cross off my list of optimizations...) Cheers, Winton Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mateo, CA 94404 work: (650) 403-2259 cell: (650) 867-1598 http://www.overture.com/ -- To unsubscribe, e-mail: <mailto:[EM

GCJ and Lucene ?

2001-11-29 Thread Winton Davies
?) Winton Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mateo, CA 94404 work: (650) 403-2259 cell: (650) 867-1598 http://www.overture.com/ -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>

Re: Parallelising a query...

2001-11-29 Thread Winton Davies
actly whether it is warrants investigation. Winton Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mateo, CA 94404 work: (650) 403-2259 cell: (650) 867-1598 http://www.overture.com/ -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additiona

RE: Parallelising a query...

2001-11-29 Thread Winton Davies
(actually I don't think I could get 4 on the machine --- the index is nearly a Gig each). Winton Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mateo, CA 94404 work: (650) 403-2259 cell: (650) 867-1598 http://www.overture.com/ -- To unsubscribe, e

Parallelising a query...

2001-11-28 Thread Winton Davies
Winton Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mateo, CA 94404 work: (650) 403-2259 cell: (650) 867-1598 http://www.overture.com/ -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>

RE: javac -O ?

2001-11-28 Thread Winton Davies
inton Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mateo, CA 94404 work: (650) 403-2259 cell: (650) 867-1598 http://www.overture.com/ -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>

javac -O ?

2001-11-28 Thread Winton Davies
? Cheers, Winton Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mateo, CA 94404 work: (650) 403-2259 cell: (650) 867-1598 http://www.overture.com/ -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: <mail

Re: OutOfMemoryError

2001-11-28 Thread Winton Davies
them is at the minimum. >> >> even if the multithreading doesn't increase performance I would be glad if I >> could just once get it running again. >> >> I would be even happier if someone could give me a hint what would be the >> best way to index this

Re: OutOfMemoryError

2001-11-28 Thread Winton Davies
e size of an entry that >gets parsed for a Document is about 1Kb.) > >thanx for any help! >chantal > >-- >To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> >For additional commands, e-mail: <mailto:[EMAIL PROTECTED]> Winton Davies Lead Engineer, Overture (N

Lucene, Java and GC

2001-11-27 Thread Winton Davies
pparently Java still has issues in the area of Scavenges vs Global GC. I'm going to be running experiments, and will let you know my finding. Cheers, Winton Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mateo, CA 94404 work: (650) 403-

Re: Determine the line number of a match

2001-11-22 Thread Winton Davies
HI Juergen, I believe the closest thing is the Term proximity information, but I think that is relative token position, rather than the actually offset from the original. Winton Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mateo, CA 94404 work

RE: Sorting Options for Query Results

2001-11-16 Thread Winton Davies
don't need the data stored in the Index -- just the DocID would suffice, then I could reference this from a big array (I have a couple of Gigs of RAM to play with). I'm down to about 200msecs response time for 8 million short records. I need it below 50 (ideally around 5 :)). Ch

RE: Sorting Options for Query Results

2001-11-16 Thread Winton Davies
the search ? I'm not quite sure how it is used, but it seems that Hits and HitCollector don't work the same way ? Cheers, Winton Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mateo, CA 94404 work: (650) 403-2259 cell: (650) 867-1598 http://www.

RE: Efficient doc information retrieval.

2001-11-15 Thread Winton Davies
Thanks so much Doug! Looks clever, I'll try it out :) Winton Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mateo, CA 94404 work: (650) 403-2259 cell: (650) 867-1598 http://www.overture.com/ -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECT

RE: RAMdirectory from Directory ?

2001-11-15 Thread Winton Davies
ink a RAMdir would be faster than the /tmpfs ? I am not familiar with how the comparative disk/io operations would work compared with what is set up inside a RAMdir Winton >> From: Winton Davies [mailto:[EMAIL PROTECTED]] >> >> It loaded into memory, but then when I tried a sea

Re: RAMdirectory from Directory ?

2001-11-14 Thread Winton Davies
ay(MiniSearchListings.java:175) at com.go2.idxsearch.godzilla.FastGodzilla.callSearchResult(FastGodzilla. java:55) at com.go2.idxsearch.godzilla.FastGodzilla.setSortType(FastGodzilla.java: 199) Anyone have any ideas why it could fail ? Cheers, Winton >Winton Dav

Re: Efficient doc information retrieval.

2001-11-14 Thread Winton Davies
Thanks anyway ! Much appreciate you thinking about it ? Cheers, Winton Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mateo, CA 94404 work: (650) 403-2259 cell: (650) 867-1598 http://www.overture.com/ -- To unsubscribe, e-mail: <mailto:[EM

Re: Efficient doc information retrieval.

2001-11-14 Thread Winton Davies
iot Kimber wrote: >Winton Davies wrote: >> >> Hi all, > >> In my application, I have to be able to return a list of documents, >> that have been uniqified according to an accountID. The most relevant >> document for an accountID is returned, and then susequent hi

Efficient doc information retrieval.

2001-11-14 Thread Winton Davies
elf ? (b) or have a special kind of field which is ultrafast to access given "i" ? (c) or anyway to speed up the existing behaviour! Cheers, Winton Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mateo, CA 94404 work: (650) 403-2259 cell: (6

Re: RAMdirectory from Directory ?

2001-11-14 Thread Winton Davies
Many thanks Gerhard! Winton Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mateo, CA 94404 work: (650) 403-2259 cell: (650) 867-1598 http://www.overture.com/ -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: &

RAMdirectory from Directory ?

2001-11-13 Thread Winton Davies
ory from a regular Directory ? Cheers, Winton Winton Davies Lead Engineer, Overture (NSDQ: OVER) 1820 Gateway Drive, Suite 360 San Mateo, CA 94404 work: (650) 403-2259 cell: (650) 867-1598 http://www.overture.com/ -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional

RE: Indexing problem

2001-11-02 Thread Winton Davies
Hi, I implemented this, and I do have a question -- What should the MergeFactor and MaxMergeDocuments be ? Cheers, Winton >Well, I don't know if there's an archive of the list, so this what Doug >wrote: > >" >A more efficient and slightly more complex approach would be to build >lar

Maximum file size problem

2001-11-02 Thread Winton Davies
Hi, I ran into a problem earlier this week, where by an index of 8 million small documents resulted in an index file of 2GB. It turns out this is a common file system limit (some say it might be a java limit as well). Anyway, I have no idea which index file it was, but it seems that I ne

Phrase (and other) scoring?

2001-11-01 Thread Winton Davies
Hi, So my manager asked an intersting question -- is the score of a phrase search normalised or not ? Lets imagine I have queried "Deer Hunter" /SLOP = 2 and there is just one document with: "Deer and Duck Hunter" Now, this comes back with a relevance of 1.0 -- however it clearly is and is

Querying an exact string match ?

2001-10-31 Thread Winton Davies
Hi, I have a field which I need to match exactly -- Ideally it shouldn't even be tokenized, but I can't find a way to do that -- adding a field that is indexed but not tokenized, doesn't seem to work -- or at least a Term Query doesn't work. eg the field is "x y z" -- and I do a phrase qu