RE: "Catalog" backend for document stored fields?

2006-11-14 Thread Robichaud, Jean-Philippe
[sorry for the long delay for my answer, we are having some issues with our mail server...] Thanks for your comment. Yes it would make sense if the log files were not so big. In fact, I'm only indexing a subset of the log information. Because I store the information in Lucene, it is easier and f

RE: "Catalog" backend for document stored fields?

2006-11-06 Thread Robichaud, Jean-Philippe
[sorry for the long delay for my answer, we are having some issues with our mail server...] Thanks for your comment. Yes it would make sense if the log files were not so big. In fact, I'm only indexing a subset of the log information. Because I store the information in Lucene, it is easier and f

RE: "Catalog" backend for document stored fields?

2006-10-23 Thread Robichaud, Jean-Philippe
age- From: Mike Klaas [mailto:[EMAIL PROTECTED] Sent: Friday, October 20, 2006 5:00 PM To: java-user@lucene.apache.org Subject: Re: "Catalog" backend for document stored fields? On 10/20/06, Robichaud, Jean-Philippe <[EMAIL PROTECTED]> wrote: > 3- Any ideas on how

"Catalog" backend for document stored fields?

2006-10-20 Thread Robichaud, Jean-Philippe
Hello to all of you! I'm using Lucene to index millions a relatively small documents. In fact, I'm indexing logs from a transaction-based application. Each document represents what happened inside during 'transaction'. Each of them is composed by 5-6 main 'states' which are themselves compos

Optimizing insertion of duplicate documents

2005-09-06 Thread Robichaud, Jean-Philippe
Hi Everyone, I have a special scenario where I frequently want to insert duplicates documents in the index. For example, I know that I want 400 copies of the same document. (I use the docboost of something else so I can't just add one document and set the docboost to 400). I would like to hac

Writting a custom query/indexSearcher

2005-07-18 Thread Robichaud, Jean-Philippe
Hi everyone. I need a special query type that looks like a phrase query but with special logic inside (like allowing inversions of certain terms only and not of others, special score manipulation on certain 'events', ...) I wonder what approach I should take? How does someone build a custom q

Getting the "directory/location" of an IndexReader/IndexWriter

2005-06-16 Thread Robichaud, Jean-Philippe
Hi Everyone. I'm currently in a situation where I have multiples indexSearcher opened at the same, each on different indices. They are kept inside a "IndicesManager" that export getSearcherAtLocation/FreeSearcher method. I would like to be able to log the "path" used by a searcher I'm about to "c

RE: Need a way to set a result limit on a particular field

2005-06-15 Thread Robichaud, Jean-Philippe
It may be simpler and more effective to use the Hits object and keep the number of time each host was actually "returned" to the user and skip it if the limit has been reach. This way, if your users just look at the 10-20 highest hits, you will save you a lot of processing time, especially if you

Queries boost and scoring problems

2005-06-15 Thread Robichaud, Jean-Philippe
Ok, I know that usually, the scores returned by Lucene do not mean "really" something. But in my case, it does, I play with the similarity and bla bla bla... Now my concern is that the Query.setBoost() does not always seems to affect the score. I've built a simple test (code completely at the e

SimilarityDelegator examples ?

2005-06-10 Thread Robichaud, Jean-Philippe
Hi Everyone, I've been using Lucene a lot and I would like to know how the SimilarityDelegator should be used. I would like to override only the lengthNorm member of the DefaultSimilarity and I understand that this is exactly the purpose of SimilarityDelegator ? Am I right? Does this class

RE: ACLs and Lucene

2005-05-30 Thread Robichaud, Jean-Philippe
What about: http://svn.apache.org/viewcvs.cgi/lucene/java/trunk/src/java/org/apache/luce ne/index/ParallelReader.java?rev=169859&view=markup Jp -Original Message- From: Bruce Ritchie [mailto:[EMAIL PROTECTED] Sent: Monday, May 30, 2005 11:26 AM To: java-user@lucene.apache.org Subject: RE:

RE: PerFieldSimilarity

2005-05-05 Thread Robichaud, Jean-Philippe
oug Cutting [mailto:[EMAIL PROTECTED] Sent: Wednesday, May 04, 2005 5:10 PM To: java-user@lucene.apache.org Subject: Re: PerFieldSimilarity Robichaud, Jean-Philippe wrote: > How cool, I did not knew that... that may help me... If I understand you > correctly, I can create a boolean que

RE: PerFieldSimilarity

2005-05-04 Thread Robichaud, Jean-Philippe
java-user@lucene.apache.org Subject: Re: PerFieldSimilarity Robichaud, Jean-Philippe wrote: > Again, I can change > the similarity of the reader at run-time and issue specific queries, summing > the score myself, but that is pretty inefficient. You can also specify a Similarity implementation

RE: PerFieldSimilarity

2005-05-04 Thread Robichaud, Jean-Philippe
From: Erik Hatcher [mailto:[EMAIL PROTECTED] Sent: Tuesday, May 03, 2005 7:40 PM To: java-user@lucene.apache.org Subject: Re: PerFieldSimilarity On May 3, 2005, at 5:57 PM, Robichaud, Jean-Philippe wrote: > Hi Everyone, > > I've been searching the archive without success

RE: Implementation of a ScoreObject ?

2005-05-03 Thread Robichaud, Jean-Philippe
ril 27, 2005 12:30 PM To: java-user@lucene.apache.org Subject: Re: Implementation of a ScoreObject ? Robichaud, Jean-Philippe wrote: >Probably the simplest/ideal schema of the ScoreObject would be something >like a hashtable with Term being the keys and a TermScoreObject the value. >The

PerFieldSimilarity

2005-05-03 Thread Robichaud, Jean-Philippe
Hi Everyone, I've been searching the archive without success to answer this one: is it possible to specify one similarity class per field, just like we can do with an analyzer ? I know I can change the similarity of the searcher, but that restrict me to break some complex queries into different

Implementation of a ScoreObject ?

2005-04-27 Thread Robichaud, Jean-Philippe
Hi Everyone, Lucene is incredible for a lot of reasons. I've been using it for the past months and it served me quite well. I'm using the subversion snapshots, which I update every now and then. Almost every functionality I need is already present and well implemented, but sadly

RE: SPECIFIC HIT

2005-03-15 Thread Robichaud, Jean-Philippe
Hi Guys, It is somewhat difficult to suggest something useful without more details. If you a pretty sure of the quality of the query, then here is my suggestion: Index the documents with an extra field called "last_word" that will contains the last word in the document. So from your exa

How to set individual boost factor to each word in a phrase query ?

2005-03-02 Thread Robichaud, Jean-Philippe
Hi everyone. I've been playing with Lucene a lot in the past few months for an important project. We are using the raw score returned by Lucene (we created a custom similarity) as a part of a confidence score calculation. My problem is exactly what the subject line of this email says: How to s