Re: Complex proximity clauses within Lucene QueryParser

2007-08-05 Thread Paul Elschot
Nir, As Erick said, the surround query language does the things requested. It is in contrib/surround. Mark, On Sunday 05 August 2007 06:33, Mark Miller wrote: > You might check out Qsol: http://myhardshadow.com/qsol.php > > It is my attempt at just such a parser. Some of that logic can be open

Re: Complex proximity clauses within Lucene QueryParser

2007-08-05 Thread Mark Miller
Paul Elschot wrote: I had a quick look at the sources, and I missed a parser definition file in the svn repository. Did I miss it, and when not, could you add it? As far as qsol syntax goes, here are some suggestions. Qsol mostly has binary operators, but with the following operators also as pre

Re: Bug in Lucene 2.2.0 code? Simple code included (StringIndexOutOfBoundsException).

2007-08-05 Thread Lukas Vlcek
Mark, thanks a lot. Based on my first tests it seems that I will be able to finish my initial goal. I will be doing something like the following: for (int i = 0; i < hits.length(); i++) { String[] texts = hits.doc(i).getValues("lotid"); for (String te

Re: Bug in Lucene 2.2.0 code? Simple code included (StringIndexOutOfBoundsException).

2007-08-05 Thread Mark Miller
Its questionable if you are losing performance. Unless you have really large docs or a nasty slow analyzer, I have found it is usually faster or as fast to reanalyze as it is to use TermVectors, which can be quite time consuming to load up and assemble a TokenStream from. You might run some tes

docFreq takes long time to execute in a multiple index environment

2007-08-05 Thread tierecke
Hi there, I have my 25 indexes of 1.8GB each read with MultiReader. I try to get the document frequency of all the terms in specific documents and it takes quite a long time - a document with 1000 terms takes around 4:30 minutes to calculate all the document frequencies of its terms - and there a

答复: Get the terms and frequency vector of an indexed but unstored field

2007-08-05 Thread Kai Hu
you use the flag to state the should store term? does it mean that you use the Field.Store.YES to store the large data? Can it reduce the data's size? -邮件原件- 发件人: tierecke [mailto:[EMAIL PROTECTED] 发送时间: 2007年8月3日 星期五 21:35 收件人: java-user@lucene.apache.org 主题: Re: Get the terms and fr

Re: Get the terms and frequency vector of an indexed but unstored field

2007-08-05 Thread Dmitry
What is advantage to use term frequency vector? thanks, DT www.ejinz.com Search News - Original Message - From: "Kai Hu" <[EMAIL PROTECTED]> To: Sent: Sunday, August 05, 2007 8:40 PM Subject: 答复: Get the terms and frequency vector of an indexed but unstored field you use the flag

About muti-Threads in Lucene

2007-08-05 Thread Kai Hu
Hi, How do you solve the problems when add,update,delete documents in muti-threads,use synchronized ?

Re: Can I do boosting based on term postions?

2007-08-05 Thread Cedric Ho
Paul, Hm..even being a Lucene newbie, I can understand your solution easily. Thanks =) Shailendra, Also thank you for your efforts in helping me to do this. I did learn a lot more about the inner working of lucene through your examples =) Thanks, Cedric On 8/4/07, Shailendra Sharma <[EMAIL PROT

Re: About muti-Threads in Lucene

2007-08-05 Thread Patrick Kimber
Hi Kai We use the Lucene Index Accessor contribution: http://www.nabble.com/Fwd%3A-Contribution%3A-LuceneIndexAccessor-t17416.html#a47049 Patrick On 06/08/07, Kai Hu <[EMAIL PROTECTED]> wrote: > Hi, > > How do you solve the problems when add,update,delete documents > in muti-threads,us