Re: best way to interest two queries?

2010-05-12 Thread mark harwood
>>two terminology questions: >>- is multiplier in the mail mentioned there the same as boost? This factor controls how many decimal places precision is retained in the adjusted scores. Pick to low a multiplier and scores that are only differentiated by a very small value will appear equal. Pi

How to avoid sharing docStore files?

2010-05-12 Thread Ivan Vasilev
Hi Guys, Can anybody tell me how to avoid sharing of docStore files (term vectors & stored fields)? I mean to avoid creation of cfx files. This is important for us because we support some operations like splitting index, updating index fields (via running optimization that has some differenc

Unsupported operation in DocIdSetIterator using lucene 2.9

2010-05-12 Thread Gregory Tarr
I am using lucene 2.9 and I can't understand why a succession of un-deprecated methods calls a deprecated method in this class. The series of calls is as follows: Searcher.search(Query, Collector) IndexSearcher.search(Weight, Filter, Collector) Scorer.score(Collector) DocIdSetIterator.nextDoc() Do

Re: How to avoid sharing docStore files?

2010-05-12 Thread Michael McCandless
This isn't something you can disable in Lucene, currently. In general, how Lucene represents the index as files is "private" to it -- it's free to change from release to release. That said, we are thinking of moving away from doc stores, with LUCENE-2324. Now that both stored fields & term vecto

Return Entire field from GetBestFragment in FastVectorHighlighter

2010-05-12 Thread Midhat Ali
Is it possible to return entire field contents instead of a fixed size fragment. In Highlightrer, there is a Nullfragmenter. Whats's its counterpart in FastVectorhighlighter

Re: How to avoid sharing docStore files?

2010-05-12 Thread Ivan Vasilev
Hi Michael, Thanks for your answer. What we do now: 1. Splitting indexes. We do it not by reading indexes and distributing docs in separate indexes like in MultiPassIndexSplitter. We do it by binary copping segments to different folders and then recreate segment descriptor file for each one (we

Re: How to avoid sharing docStore files?

2010-05-12 Thread Andrzej Bialecki
On 2010-05-12 14:29, Ivan Vasilev wrote: > Hi Michael, > Thanks for your answer. > What we do now: > 1. Splitting indexes. We do it not by reading indexes and distributing > docs in separate indexes like in MultiPassIndexSplitter. We do it by > binary copping segments to different folders and then

Re: How to avoid sharing docStore files?

2010-05-12 Thread Ivan Vasilev
That`s fine Andrzej :) doing split in just one pass really matters for big indexes. Hope we will use it in our application. Thanks, Ivan Andrzej Bialecki wrote: On 2010-05-12 14:29, Ivan Vasilev wrote: Hi Michael, Thanks for your answer. What we do now: 1. Splitting indexes. We do it not b

How to influence the score in lucene 2.9

2010-05-12 Thread Gregory Tarr
How easy is it to influence the score of search results in lucene 2.9? The situation is that we have a large number of dated documents that match the term "john" but we want to return the latest documents when "john" is the search term. My solution to this would be to override the score such that

Re: How to influence the score in lucene 2.9

2010-05-12 Thread Erick Erickson
Why wouldn't simply sorting by date descending work in this case? If you can detect the situation ahead of time that might be simpler. Do be aware that sorting by date has its own issues if you've stored the dates with very fine resolutions (e.g. seconds), but there are well-understood ways of dea

TermDocs

2010-05-12 Thread roy-lucene-user
Hi guys, I've had this code for some time but am just now questioning if it works. I have a custom filter that i've been using since Lucene 1.4 to Lucene 2.2.0 and it essentially builds up a BitSet like so: for ( int x = 0; x < fields.length; x++ ) { for ( int y = 0; y < values.length; y++