Question: Can lucene do parallel indexing?

2008-06-27 Thread David Lee
If I'm using a computer that has multiple cores, or if I want to use several computers to speed up the indexing process, how should I do that? Is there some kind of support for that in the API? David Lee

Nested Proximity searches

2008-06-30 Thread David Lee
Is it possible to do nested proximity searches with lucene? i.e. can I say I want a to be within 1 word of b and then that group to be within 4 words of c? The syntax ""a b"~1" c"~4 doesn't seem to work (since it treats the first two quotes as a pair and the later 2 as another pair).

Do Lucene Deletes delete the physical file? If yes, is there a way not to?

2008-07-02 Thread David Lee
iling list for simple questions like this? I tried googling, but didn't seem to get the information I wanted. Thanks! David Lee

Lucene Index Structure

2008-08-21 Thread David Lee
Clarification question: If I don't store term vectors, then I: -- won't have information on the position of matching terms -- I don't have the term frequency vector -- but I should still have the frequency of terms per document in the .frq file, right? So what's the difference between the term f

Re: Lucene Index Structure

2008-08-21 Thread David Lee
ley wrote: > > On Thu, Aug 21, 2008 at 7:20 PM, David Lee <[EMAIL PROTECTED]> wrote: >> >>> Clarification question: >>> >>> If I don't store term vectors, then I: >>> -- won't have information on the position of matching terms >>&

Clarification about segments

2008-08-22 Thread David Lee
So from what I understand, is it true that if mergeFactor is 10, then when I index my first 9 documents, I have 9 separate segments, each containing 1 document? And when searching, it will search through every segment? Thanks! David

Re: Clarification about segments

2008-08-25 Thread David Lee
; and > #optimize(getMergeFactor()) > (btw #optimize() is equal to optimize(1) ). > > > Best regards > Karsten > > p.s. and yes, searching goes through every segment. > > > David Lee-26 wrote: > > > > So from what I understand, is it true that if

Lucene AND queries

2008-09-25 Thread David Lee
Hi, I was wondering when lucene queries two or more terms, does that mean the time it takes will be twice as long? For example if I search +lucene +apache, then does lucene get all the documents that match 'lucene' and all the documents that match 'apache', and then combine them together? Or can it

Re: Lucene AND queries

2008-09-25 Thread David Lee
ROTECTED]> wrote: > On Thu, Sep 25, 2008 at 1:39 PM, David Lee <[EMAIL PROTECTED]> wrote: > > I was wondering when lucene queries two or more terms, does that mean the > > time it takes will be twice as long? For example if I search +lucene > > +apache, then does lucene ge

Extracting Dates

2008-10-02 Thread David Lee
t of these projects are associated to lucene, someone might know. David Lee