Re: Flex & Docs/AndPositionsEnum

2010-02-11 Thread Michael McCandless
On Wed, Feb 10, 2010 at 2:42 PM, Marvin Humphrey wrote: > On Wed, Feb 10, 2010 at 12:33:27PM -0500, Michael McCandless wrote: > >> In Lucene, skipping is done through the aggregator, > > I had a look at MultiDocsEnum in the flex blanch. It doesn't know when > sub-enum is reading skip data. I'm c

question regarding BooleanQuery:equals() method

2010-02-11 Thread Smith G
Hello All, I am writing some test cases for a custom-class which modifies incoming TermQuery and adds some other Terms and returns a BooleanQuery. As always I have used assertEquals(), which apparently uses equals() method. I found out that following two queries are different from each

Re: TREC Data and Topic-Specific Index

2010-02-11 Thread Ivan Provalov
Thank you, Robert. --- On Wed, 2/10/10, Robert Muir wrote: > From: Robert Muir > Subject: Re: TREC Data and Topic-Specific Index > To: java-user@lucene.apache.org > Date: Wednesday, February 10, 2010, 9:23 AM > Hi, so you mean around 15% and 24% > respectively? i think you could fairly > say ei

Retrieve fields that are not empty

2010-02-11 Thread mored
Hello, is there a way query syntax fo rretrieving fields which have a value? Something like myfield:NOT EMPTY. I searched the lucene docu, but didnĀ“t found anything for this issue. Thanks in advance. chris -- View this message in context: http://old.nabble.com/Retrieve-fields-that-are-not-emp

RE: Retrieve fields that are not empty

2010-02-11 Thread Uwe Schindler
Hi Chris, its only possible with TermRangeQuery/NumericRangeQuery (depending on data type, corresponding filters can also be used) with both ends null. E.g., Solr does it that way. - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -O

Re: Do deleted documents affect scores?

2010-02-11 Thread Ian Lea
I'm pretty sure that the answer is no and a quick test on a small index with/without deleted docs showed no difference in the scores, using 3.0. But that was hardly a rigorous test and I don't know enough about lucene internals and scoring to give a definitive answer. Shouldn't be too hard for yo

Re: Do deleted documents affect scores?

2010-02-11 Thread Andrzej Bialecki
On 2010-02-11 17:35, Ian Lea wrote: I'm pretty sure that the answer is no and a quick test on a small index with/without deleted docs showed no difference in the scores, using 3.0. But that was hardly a rigorous test and I don't know enough about lucene internals and scoring to give a definitive

Re: Flex & Docs/AndPositionsEnum

2010-02-11 Thread Marvin Humphrey
On Thu, Feb 11, 2010 at 08:30:14AM -0500, Michael McCandless wrote: > Oh you're saying we don't know if the underlying enum actually skipped vs > just scanned? Yep. > Isn't the skip data also based on deltas? Yes, but that's internal to the skip reader, in both Lucene and Lucy/KS. When it com

RE: Do deleted documents affect scores?

2010-02-11 Thread Yuval Feinstein
Thanks Ian and Andrzej. You solved a mystery for us. -- Yuval From: Andrzej Bialecki [...@getopt.org] Sent: Thursday, February 11, 2010 6:53 PM To: java-user@lucene.apache.org Subject: Re: Do deleted documents affect scores? On 2010-02-11 17:35, Ian Lea wr

ConstantScoreQuery without filters

2010-02-11 Thread Michel Nadeau
Hi, I use ConstantScoreQuery to find all documents in an index like this: td = searcher.search(new ConstantScoreQuery(cluCF), null, md, cluSort); * cluCF is a Filter * md is int = 999 * cluSort is a Sort My problem is that I don't always have a filter (cluCF) - so sometimes its value is 'null'

Re: ConstantScoreQuery without filters

2010-02-11 Thread Michel Nadeau
I think I solved my problem - used MatchAllDocsQuery() - is that the best solution ? - Mike aka...@gmail.com On Thu, Feb 11, 2010 at 3:50 PM, Michel Nadeau wrote: > Hi, > > I use ConstantScoreQuery to find all documents in an index like this: > > td = searcher.search(new ConstantScoreQuery(clu

RE: ConstantScoreQuery without filters

2010-02-11 Thread Uwe Schindler
Yes, the fastest to get *all* documents, as it simply iterates over global TermDocs(null) without intersecting with a filter. - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: Michel Nadeau [mailto:aka...@