[EMAIL PROTECTED]

2007-05-02 Thread axel.reymonet
Unsubscribe

Re: batch indexing

2007-05-02 Thread Chris
Sorry , ask a question. You say FSDirectory is RADdirectory as least until it flushes. I cannot understand your means . May you please teach me what it means? FSDirectory stored in filesystem , and RADdirectory stored in RAM. MergeFactor and MaxBufferedDocs settings are limited and controlled

Re: batch indexing

2007-05-02 Thread Erick Erickson
For some intermediate period of time when indexing, the document are buffered in RAM. There is a complex interplay between several of the parameters to an IndexWriter that govern how many documents are indexed in RAM before being flushed to the FSDirectory. Of course, if you specify a

Re: Keyphrase Extraction

2007-05-02 Thread Mark Miller
From what I know you generally have to pay if you want something that does this really well. Or check out http://www.nzdl.org/Kea/ Unfortunately, the license is GPL. Really too bad; now that it is all Java, it would make a great combo with Lucene. - Mark mark harwood wrote: I believe the

Problem with scoring:is it absolute or relative ?

2007-05-02 Thread Paul Taylor
Hi I am having problems understanding lucenes scoring. I am using the Musicbrainz which uses Lucene to provide searching facility over its data, which put simply consists of a database about recording artists , albums and song titles I can construct a query such as: track:Minus AND

Re: Problem with scoring:is it absolute or relative ?

2007-05-02 Thread Otis Gospodnetic
Paul, That is because the Hits class is likely being used under the covers. When you use the IndexSearcher's search(...) method that returns Hits, hit scores are normalized, so they are always between 0 and 1. If you want the raw scores, and it sounds like you do, you could use a lower-level

Re: Problem with scoring:is it absolute or relative ?

2007-05-02 Thread Paul Taylor
Hi Otis thanks, so the scores are relative to the result set. Unfortunately I only have access to an xmlwebservice that let me send a lucene query so I don't have access to the various IndexSearcher methods. So it sounds like rather than just taking the top result and using that Ill have to

Email Definition in StandardTokenizer.jj

2007-05-02 Thread Winton Davies
Hey guys, Does someone who makes commits want to fix the EMAIL definition in StandardTokenizer.jj Its a not very well known exception to the naming process, that you can use a + in the middle of the email name, and the delivery agent sends the email to the prefixing name, but keeps the