RE: addIndexes() Question

2004-12-23 Thread Garrett Heaver
Hi Ryan I too am using addIndexes(), all be it for slightly different reasons. However, I would recommend only calling addIndexes() for fairly sizable slices and all slices at once. The reason I'm suggesting it is that optimize is called automagically both before and after the addIndexes method so

RE: Indexing a large number of DB records

2004-12-16 Thread Garrett Heaver
r app under a profiler to see where the time and memory are going. Otis --- Garrett Heaver <[EMAIL PROTECTED]> wrote: > Hi Homan > > I had a similar problem as you in that I was indexing A LOT of data > > Essentially how I got round it was to batch the index. > &g

RE: Indexing a large number of DB records

2004-12-15 Thread Garrett Heaver
Hi Homan I had a similar problem as you in that I was indexing A LOT of data Essentially how I got round it was to batch the index. What I was doing was to add 10,000 documents to a temporary index, use addIndexes() to merge to temporary index into the live index (which also optimizes the live i

C# Ports

2004-12-15 Thread Garrett Heaver
I was just wondering what tools (JLCA?) people are using to port Lucene to c# as I'd be well interesting in converting things like snowball stemmers, wordnet etc. Thanks Garrett

maxDoc()

2004-12-09 Thread Garrett Heaver
Can anyone please explain to my why maxDoc returns 0 when Luke shows 239,473 documents? maxDoc returns the correct number until I delete a document. And I have called optimize after the delete but still the problem remains Strange. Any ideas greatly appreciated Garrett

RE: addIndexes() Size

2004-12-07 Thread Garrett Heaver
fos.info(0))); Perhaps this is resulting in an unnecessary ballooning of the index? I'll leave it for someone with a better understanding of the underlying file system to answer... Thanks Garrett -Original Message- From: Garrett Heaver [mailto:[EMAIL PROTECTED] Sent: 06 December 2004 17

RE: addIndexes() Size

2004-12-06 Thread Garrett Heaver
(and maybe 1.4.1?) that kept some index files around that were not used. Are you using Lucene 1.4.3? It not, try that and see if it helps. Erik On Dec 6, 2004, at 12:17 PM, Garrett Heaver wrote: > No there are no duplicate copies - I've the correct number when I view > th

RE: addIndexes() Size

2004-12-06 Thread Garrett Heaver
ies of the same Document. Does your temp index 'overlap' with A index in terms of Documents? If so, you will end up with multliple copies, as addIndexes method doesn't detect and remove duplicate Documents. Otis --- Garrett Heaver <[EMAIL PROTECTED]> wrote: > Hi. > >

addIndexes() Size

2004-12-06 Thread Garrett Heaver
Hi. Its probably really simple to explain this but since I'm not up to speed on the way Lucene stores the data I'm a little confused. I'm building an Index, which resides on Server A, with the Lucene Service running on Server B. Now not to bore you with the details but because of the networ