RE: Filter.getDocIdSet() returning null, and what this means for CachingWrapperFilter

2010-05-26 Thread Uwe Schindler
Can you open an issue, null should be handled like an empty DocIdSet? This seems to be a bug in CachingWrapperFilter. To go around this, don’t return null, and instead return the constant DocIdSet.EMPTY_DOCIDSET. This is the preferable solution and maybe we will change this in Lucene 4.0, to no

Precision-recall curve with /contrib/benchmark/quality

2010-05-26 Thread calin014
Hi, I am testing some ranking methods with /contrib/benchmark/quality package, and i was wondering if there is a simple way of building a precision-recall graph with the info gathered(maybe QualityStats.RecallPoint). Thanks -- View this message in context: http://lucene.472066.n3.nabble.com/Pr

Re: Precision-recall curve with /contrib/benchmark/quality

2010-05-26 Thread Robert Muir
Personally I use the generated submission.txt, and run it thru trec_eval to get all numbers. by default, trec_eval will dump ircl_prn. values, and you could plug them in openoffice. I prefer to use trec_eval as the results from the benchmark summary often differ with trec_eval. it would be gr

RE: Filter.getDocIdSet() returning null, and what this means for CachingWrapperFilter

2010-05-26 Thread Uwe Schindler
I opened https://issues.apache.org/jira/browse/LUCENE-2478 and will fix soon! - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: Uwe Schindler [mailto:u...@thetaphi.de] > Sent: Wednesday, May 26, 2010 9:41 A

IndexSearcher - open file handles by deleted files

2010-05-26 Thread Thomas Rewig
Hello, I use Lucene 2.9.1 with two indices, which alternate each day. One is live, the other is erased and renewed with the latest data. The problem is that the index files will be deleted, but the file handles are still available. If the program (JBOSS) is not restarted for some time, the dis

Re: Getting DF & IDF

2010-05-26 Thread Sethu_424
Hi, I am not sure if you are still searching the answer for your question. If so, then please read on... You can get the DF & IDF for each of the query terms in the query as below.. IndexReader reader = IndexReader.open(FSDirectory.open(new File(indexDir)), true); //Create a FilterInde

Re: Test File locks

2010-05-26 Thread Ian Lea
No answers but some things you could try: There have been some reports of problems with file paths being specified in different ways - UNC versus something else perhaps. Use SimpleFSLockFactory rather than the default native class. Try a recent version of lucene unless you already are. -- Ian.

ApacheCon CFP Closes on Friday

2010-05-26 Thread Grant Ingersoll
If you are planning on submitting for ApacheCon, you have until Friday to do so See the CFP at http://blogs.apache.org/conferences/date/20100428 - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional comm

Re: Precision-recall curve with /contrib/benchmark/quality

2010-05-26 Thread calin014
Thanks, that worked:) -- View this message in context: http://lucene.472066.n3.nabble.com/Precision-recall-curve-with-contrib-benchmark-quality-tp82p845518.html Sent from the Lucene - Java Users mailing list archive at Nabble.com.

Customer TokenFilter

2010-05-26 Thread tsuraan
I'd like to have all my queries and terms run through Unicode Normalization prior to being executed/indexed. I've been using the StandardAnalyzer with pretty good luck for the past few years, so I think I'd like to write an analyzer that wraps that, and tacks a custom TokenFilter onto the chain pr

Re: Filter.getDocIdSet() returning null, and what this means for CachingWrapperFilter

2010-05-26 Thread Daniel Noll
On Wed, May 26, 2010 at 23:30, Uwe Schindler wrote: > I opened https://issues.apache.org/jira/browse/LUCENE-2478 and will fix soon! It does sound like a good idea to not permit a null doc ID set, since there is a convenient constant for the empty result anyway. :-) And actually, it turns out tha

Re: Test File locks

2010-05-26 Thread Chris Hostetter
It would be helpful to know: 1) what version of Lucene you are using 2) what exactly like 167 of LibraryBuilder looks like (ie: what options are you using when instantiating the IndexWRiter) 3) what filesystems are using on each of the two different machines you are using. 4) does it really say