Re: "Deleting" documents without deleting them

2010-03-16 Thread Daniel Noll
On Tue, Mar 16, 2010 at 20:45, Rene Hackl-Sommer wrote: > Hi Daniel, > > Unless you have only a few documents and a small index, I don't think never > calling optimize is going to be a means you should rely upon. > > What about if you reindexed the documents you are deleting, adding a field > wit

OutOfMemory ParallelMultisearcher

2010-03-16 Thread Jamie
Hi There I have an index which is 36 GB large. When I perform eight simultaneous searches (performed by JMeter) on the index, an OutOfMemory error occurs. Since I need to potentially search across multiple indexes and those indexes can change from one search query to the next, each user has

Re: score and multiValued fields

2010-03-16 Thread Erick Erickson
Have you looked at: http://lucene.apache.org/java/2_4_0/scoring.html even though it's for 2.4, I don't think there's any relevant changes for 3.x... I'm pretty sure that your supposition 2 is the right one. HTH Erick On Tue, Mar 16, 2010 at 2:58

score and multiValued fields

2010-03-16 Thread Marc Sturlese
I would like to know how Lucene deals with the score on multiValued fields. I am wandering if: 1) a score is computed per field and the maximum between them wins or 2)all terms of all fields (from the multivalued field) influence eachother to compute the score Let's say I have a document with a m

Re: "Deleting" documents without deleting them

2010-03-16 Thread Rene Hackl-Sommer
I cannot comment on the "marked-as-deleted" documents, but for the approach I outlined: this might impact the scores. I prefer to say 'impact' instead of 'skew', because to me 'skew' would imply that the original scores are some kind of ideal state which is distorted. I don't think this is nece

Re: Increase number of available positions?

2010-03-16 Thread Erick Erickson
Sure. I'd start a new thread though, referencing this one and outlining why none of the solutions you tried worked. Erick On Tue, Mar 16, 2010 at 4:35 AM, Rene Hackl-Sommer wrote: > Hi Guys, > > Thanks for the input! I am now going to put in some work to see how things > fare. > > Should I p

Re: "Deleting" documents without deleting them

2010-03-16 Thread TCK
Wouldn't these excluded/filtered documents skew the scores even though they are supposed to be marked as deleted? Don't the idf values used in scoring depend on the entire document set and not just the matching hits for a query? Thanks, TCK On Tue, Mar 16, 2010 at 5:45 AM, Rene Hackl-Sommer wr

Re: "Deleting" documents without deleting them

2010-03-16 Thread Rene Hackl-Sommer
Hi Daniel, Unless you have only a few documents and a small index, I don't think never calling optimize is going to be a means you should rely upon. What about if you reindexed the documents you are deleting, adding a field with the value "true"? This would imply that either 1) all fields

Re: "Deleting" documents without deleting them

2010-03-16 Thread Michael McCandless
An incidental merge will delete them. I think you'll have to maintain your own filter... but it shouldn't be that large? Ie it's as large as deleted docs BitVector would be anyway... except that the docs never go away. Mike On Mon, Mar 15, 2010 at 11:20 PM, Daniel Noll wrote: > Hi all. > > I'm

Re: Increase number of available positions?

2010-03-16 Thread Rene Hackl-Sommer
Hi Guys, Thanks for the input! I am now going to put in some work to see how things fare. Should I post the question about substituting int with long on lucene-dev again, if need arises? Thanks again, Rene Am 15.03.2010 23:04, schrieb Steven A Rowe: Hi Rene, Have you seen SpanNotQuery?