Re: problems with lucene in multithreaded environment

2004-06-03 Thread Jayant Kumar
We conducted a test on our search for 500 requests given in 27 seconds. We noticed that in the first 5 seconds, the results were coming in 100 to 500 ms. But as the queue size kept increasing, the response time of the search increased drastically to approx 80-100 seconds. Please find enclosed jvm

Re: Writing a stemmer

2004-06-03 Thread Leo Galambos
Erik Hatcher <[EMAIL PROTECTED]> wrote: __ >> How proficient must I be in a language for which I wish to write the >> stemmer? >I would venture to say you would need to be an expert in a language to >write a decent stemmer. I'm sorry for a self-promo ;), but the stemmer of egothor proje

bonus for exact case match

2004-06-03 Thread David Spencer
Does anyone have any experiences with giving a bonus for exactly matching case in queries? One use case is in the java world maybe I want to see references to "Map" (java.util.Map) but am not interested in a (geographical) "map". I believe, in the context of Lucene, one way is to have an Analy

Re: Writing a stemmer

2004-06-03 Thread Erik Hatcher
On Jun 3, 2004, at 4:09 PM, Musku, Anil (LA) wrote: Can anyone provide some help on writing a stemmer for non-english languages? Have a look at the snowball project in the Lucene sandbox. If its non-European-based languages, I suspect it's quite complex. It's highly language dependent. How pr

Re: Writing a stemmer

2004-06-03 Thread Grant Ingersoll
Anil, I suppose it depends on how complex the language is and what is acceptable for your program. I have written a couple of stemmers that are fairly straightforward based on papers that I have read and work well for the langs. we are using. Your best bet is probably to do a literature searc

Re: FileNotFoundException when trying to indexing.

2004-06-03 Thread Prasad Ganguri
Hi Terry, Thanks for your reply. I identified the problem. I am creating a new index, but passing a parameter that the index is existing one. So, it is looking for segment file (thinking that the index is existing). You rightly pointed out that, i am not calling UseCompoundFile method. I will in

Writing a stemmer

2004-06-03 Thread Musku, Anil (LA)
Hi, Can anyone provide some help on writing a stemmer for non-english languages? How proficient must I be in a language for which I wish to write the stemmer? Regards, Anil - To unsubscribe, e-mail: [EMAIL PROTECTED] For additi

Re: problems with lucene in multithreaded environment

2004-06-03 Thread Supun Edirisinghe
I noticed delays when concurrent threads query an IndexSearcher too. our index is about 550MB with about 850,000 docs. each doc with 20-30 fields of which only 3 are indexed. Our queries are not very complex -- just 3 required term queries. this is what my test did: intialilize an array of terms

Re: FileNotFoundException when trying to indexing.

2004-06-03 Thread Terry Steichen
Prasad, I think you'll have to provide more code so we can see what's actually going on. BTW, I don't see you calling the UseCompoundFile method (unless you do it inside indexFile/Directory) - I wonder if that could be an issue? Regards, Terry PS: I run on XP/Pro just fine, so there's nothing

disableLuceneLocks system property

2004-06-03 Thread Supun Edirisinghe
why is disableLuceneLocks not in the list at http://jakarta.apache.org/lucene/docs/systemproperties.html ? is it not advisable to use anymore? is it still valid? will it be supported in version 1.4? how is the usage? java ... -DdisableLuceneLocks ... or java ... -DdisableLuceneLocks=true thanks

FileNotFoundException when trying to indexing.

2004-06-03 Thread Prasad Ganguri
I am using Lucene for buiding our document management system. I tested it in Windows2000 Professional and got successful execution. Recently, when we ported the code onto an WindowsXP Professional, we are getting the following exception. I tried to create segments folder using my code, but throwin

Re: flush an index directory

2004-06-03 Thread jt oob
If I understand your question correctly you have document, you index it, you change the document, you index the document again. This will lead to the document being in your index twice, once with each version. Searches will return hits for either the old or new version of the document. If you chan

flush an index directory

2004-06-03 Thread uddam chukmol
Hi all, I'm in real trouble with the way Lucene organizes its index. First time, i run an application to index a text. Then, I changed the text and executed it once again, but still can not get my index refreshed. Is there anyway to do this? PLEASE HELP!!! Thanks you before hand! Uddam

Re: why the score is not 1.0?

2004-06-03 Thread Erik Hatcher
Without looking at your code, a good first suggestion is to se IndexSearcher.explain(Query,docId) to see why scores are they way they are. Erik On Jun 3, 2004, at 7:21 AM, uddam chukmol wrote: Dear all, I have another trouble in one of my program using Lucene. I tried to compare the

why the score is not 1.0?

2004-06-03 Thread uddam chukmol
Dear all, I have another trouble in one of my program using Lucene. I tried to compare the same string and use the same analyzer to index it. You can see my code as following: --