Re: PhraseQuery Performance Issues [Lucene 2.9.0]

2010-03-29 Thread Rafael Turk
unsubscribe - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Apache Lucene EuroCon Call For Participation: Prague, Czech Republic May 20 & 21, 2010

2010-03-29 Thread Grant Ingersoll
Apache Lucene EuroCon Call For Participation - Prague, Czech Republic May 20 & 21, 2010 All submissions must be received by Tuesday, April 13, 2010, 12 Midnight CET/6 PM US EDT The first European conference dedicated to Lucene and Solr is coming to Prague from May 18-21, 2010. Apache Lucene Eu

Re: Problem / question concerning "Fuzzy Search"

2010-03-29 Thread Robert Muir
On Mon, Mar 29, 2010 at 10:57 AM, Benjamin Patrick Jung wrote: > > [Examples] Search term --> Subset of expected result > Cinamo~0.5 --> Cinema, Cinnamon [works] > Strawbarr~0.8 --> Strawberry[doesn't work] > > --> > As far as I understand, the "Edit distance" > (aka "Levinshtein distance")

Problem / question concerning "Fuzzy Search"

2010-03-29 Thread Benjamin Patrick Jung
Hi all, I tried to figure out how the fuzzy search implementation in Apache Lucene works and I'm kinda stuck here. --> Version : Apache Lucene 3.0.1 (JAVA) [What I want / need] I'm looking for a way to combine a prefix-, fuzzy- and wildcard query. Q: Is it possible to have a query like "user_i

Re: adapting lucene's practical scoring function

2010-03-29 Thread Grant Ingersoll
On Mar 25, 2010, at 3:07 PM, Mathias Silbermann wrote: > Dear Lucene Users, > > I'd like to use Lucene to find scientific papers in the index that are > similar to a given paper from the > index. This seems to be possible using the MoreLikeThis-feature or wrapping > the given document > in a q

Re: Lucene scalability observations with a large volatile Index

2010-03-29 Thread Michael McCandless
OK I opened https://issues.apache.org/jira/browse/LUCENE-2357 for #3. Mike On Mon, Mar 29, 2010 at 7:17 AM, Michael McCandless wrote: > On #1: Unfortunately, you cannot control the terms index divisor that > IW uses when opening its internal readers. > > Long term we need to factor out the reade

Re: Lucene scalability observations with a large volatile Index

2010-03-29 Thread Michael McCandless
On #1: Unfortunately, you cannot control the terms index divisor that IW uses when opening its internal readers. Long term we need to factor out the reader pool that IW uses... so that an app can provide its own impl that could control this (and other) settings. There's already work being done to

Lucene scalability observations with a large volatile Index

2010-03-29 Thread ajjb 936
Hi, I have some observations when using Lucene with my particular use case, I thought it may be useful to capture some of these observations. I need to create and continuously update a Lucene Index where each document adds (2 to 3) unique terms. The number of documents in the index is between 150