return single document from duplicated documents in index

2006-06-08 Thread Alan Boo
g'day, i've two questions. let's say the following is my index with 2 field : title and contents title contents beer beer is good beer beer is good catsleepy dog what a cute one! beer beer

RE: Compound / non-compound index files and SIGKILL

2006-06-08 Thread Chris Hostetter
: Whom should I post to suggest adding a note to : http://lucene.apache.org/java/docs/api/org/apache/lucene/document/Field.html : #Field(java.lang.String,%20java.io.Reader) to say that the Reader is only : actually read from when the document is added? Without internal knowledge, Documentation im

Adding Fields to Documents with UnStored Fields - crazy scheme?

2006-06-08 Thread Bob Arens
Hi, I've been handed a legacy index containing Documents with two Fields; one is a file ID, the other is contents of the file. The contents field was added using UnStored. Now, we want to add fields. Oh, the humanity! I've looked through the archives, and previously suggested methods wo

Re: IndexWriter.addIndexes & optimizatio

2006-06-08 Thread Yonik Seeley
On 6/8/06, Otis Gospodnetic <[EMAIL PROTECTED]> wrote: When writing a unit test that comapres RAMDirectory and FSDirectory performance for Lucene in Action I had a very hard time showing that RAMDirectory really is faster. :) For indexing, even if you open IndexWriter with a FSDirectory, it i

Re: fieldNorm

2006-06-08 Thread Otis Gospodnetic
In English, fieldNorm essentially means: "give term hits in shorter fields more weight/importance than those in longer fields". I believe the implementation is 1/sqrt(number of terms in field). Keep in mind that index-time boost is calculated into the field norm. Otis - Original Message -

fieldNorm

2006-06-08 Thread Seeta Somagani
Hi, My simple search application is returning hits ordered by the default relevance. But, I noticed that when for example searched for "what", the first hit has considerably lesser occurrences of the term "what" than those that follow. When I viewed the explanation for each document, all the value

Re: IndexWriter.addIndexes & optimization

2006-06-08 Thread Otis Gospodnetic
When writing a unit test that comapres RAMDirectory and FSDirectory performance for Lucene in Action I had a very hard time showing that RAMDirectory really is faster. :) Just set your maxBufferedDocs to as high a number as your RAM/heap will let you, and pick a mergeFactor that is high, but do

Re: Fwd: Lucene 2.0.0 release available

2006-06-08 Thread Otis Gospodnetic
Hi Sven, Somebody asked about the same thing a few weeks back and opened a JIRA issue. It looks like the push to ibiblio servers did not happen, even though I added notes about that to the release HOWTO doc on the Wiki. My notes might be incorrect, so I commented on that JIRA issues asking fo

adding term information to Index

2006-06-08 Thread Patricio Galeas
Hello, I try to use Lucene to make some experiments with the distribution of words in documents. Using the TermPositionVector, one can compute some statistics about word positions (mean, standard deviation, etc.). It is possible to store such statistical values directly in the Lucene index? For

adding term information to index ...

2006-06-08 Thread Patricio
Hello, I try to use Lucene to make some experiments with the distribution of words in documents. Using the TermPositionVector, one can compute some statistics about word positions (mean, standard deviation, etc.). It is possible to store such statistical values directly in the Lucene index? For

Re: Multisearch Problem

2006-06-08 Thread Daniel Naber
On Donnerstag 08 Juni 2006 19:59, Dan Wiggin wrote: > java.lang.ArrayIndexOutOfBoundsException: -1 You get this when you access a Lucene 1.9/2.0 index with Lucene 1.4. Maybe your classpath contains two versions of Lucene. Regards Daniel -- http://www.danielnaber.de -

Multisearch Problem

2006-06-08 Thread Dan Wiggin
Hello everybody I have a new problem that I want share with you :D The problem is doing a search in a Multisearcher. Normally this multisearcher works well. When I start my tomcat and my two index are empties, and if I do a search wtih this index I have no exception and obviusly no hits, but afte

RE: The fields in my document aren't getting analyzed

2006-06-08 Thread Satuluri, Venu_Madhav
I am sorry about the previous mail. It turns out that I was confusing the *stored* value of the field with the *indexed* value. The indexed value is what I expect it to be, alright. Thanks all, Venu -Original Message- From: Satuluri, Venu_Madhav Sent: Thursday, June 08, 2006 10:08 PM To:

RE: Property comparison possible??

2006-06-08 Thread Mordo, Aviran (EXP N-NANNATEK)
AFIK it is not possible to perform this kind of query with Lucene Aviran http://www.aviransplace.com -Original Message- From: Robert Haycock [mailto:[EMAIL PROTECTED] Sent: Thursday, June 08, 2006 12:59 PM To: java-user@lucene.apache.org Subject: Property comparison possible?? Is it po

Property comparison possible??

2006-06-08 Thread Robert Haycock
Is it possible to perform a search using fields instead of terms, eg. like this sql: SELECT col1, col2 FROM table1 WHERE col1 = col2 Or is this something that might be going on the whiteboard? Thanks, Rob.

The fields in my document aren't getting analyzed

2006-06-08 Thread Satuluri, Venu_Madhav
Hi all, It seems to me my Fields aren't getting analyzed before they are stored in the index. I am sure I am overlooking some obvious point here, but cant figure out what that is. I recently migrated to Lucene2.0 from Lucene 1.4.3, and my fields used to get indexed earlier, so maybe I am missing s

SynonymsQuery

2006-06-08 Thread Vinny Ng
Dear all, I tried to use SynonymsQuery mentioned at: http://mail-archives.apache.org/mod_mbox/lucene-java-user/200603.mbox/[EMAIL PROTECTED] Compiling the given source code results in error message: "The method createWeight(Searcher) in the type Query is not applicable for the arguments (Searc

Re: Fwd: Lucene 2.0.0 release available

2006-06-08 Thread Sven Duzont
Hello, Nice job, thanks. Question : When will it be available on Maven2 repositories ? Thanks --- Sven Le samedi 27 mai 2006 à 13:37:54, vous écriviez : DN> -- Forwarded Message -- DN> Subject: Lucene 2.0.0 release available DN> Date: Samstag 27 Mai 2006 05:57 DN> From: Doug

RE: Compound / non-compound index files and SIGKILL

2006-06-08 Thread Rob Staveley (Tom)
OK, I see it now. It all happens in DocumentWriter.invertDocument. I should have looked at the Lucene source. IndexWriter.addDocument -> DocumentWriter.addDocument -> DocumentWriter.invertDocument Many thanks, Chris. Whom should I post to suggest adding a note to http://lucene.apache.org/java/d