Re: Improving String Distance calculation performance

2010-12-28 Thread Robert Muir
On Tue, Dec 28, 2010 at 5:26 AM, Biedermann,S.,Fa. Post Direkt wrote: > Hi Robert, > > Thanks for your hint about LevensteinAutomata. Are AutomatonQueries planned > for an upcoming release? yes, but its in trunk, so you can use it now... > > At the moment, we build the reference to boost docume

AW: Improving String Distance calculation performance

2010-12-28 Thread Biedermann,S.,Fa. Post Direkt
r. Furthermore we are sensible about query time. Do you have any ideas? -Ursprüngliche Nachricht- Von: Robert Muir [mailto:rcm...@gmail.com] Gesendet: Montag, 27. Dezember 2010 17:11 An: dev@lucene.apache.org Betreff: Re: Improving String Distance calculation performance On Mon, Dec 27

Re: Improving String Distance calculation performance

2010-12-27 Thread Robert Muir
On Mon, Dec 27, 2010 at 10:31 AM, Biedermann,S.,Fa. Post Direkt wrote: > > As for our problem: we are trying to build reference data against which > requests shall be matched. In this case we need quite a huge amount of string > distance measurements for preparing this reference. > If this is y

AW: Improving String Distance calculation performance

2010-12-27 Thread Biedermann,S.,Fa. Post Direkt
g, 27. Dezember 2010 16:07 An: dev@lucene.apache.org Betreff: Re: Improving String Distance calculation performance Hi Biedermann: you are correct in that the comparator in spellcheck could maybe use some optimizations. But I'm curious as to why you would be doing a lot of comparison

Re: Improving String Distance calculation performance

2010-12-27 Thread Robert Muir
Hi Biedermann: you are correct in that the comparator in spellcheck could maybe use some optimizations. But I'm curious as to why you would be doing a lot of comparisons with the spellchecker? Are you using this class separately for some other purpose? The reason is that the spellchecker works l

Improving String Distance calculation performance

2010-12-27 Thread Biedermann,S.,Fa. Post Direkt
Hi, this is a re-post, because the first time I re-used another thread (sorry for any inconvenience): this is my first post to this mailing list, so I first want to say hello to all of you! You are doing a great job In org.apache.lucene.search.FuzzyTermEnum I found an optimised implem

RE: Improving String Distance calculation performance

2010-12-27 Thread Uwe Schindler
ier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: Biedermann,S.,Fa. Post Direkt [mailto:s.biederm...@postdirekt.de] > Sent: Monday, December 27, 2010 12:23 PM > To: dev@lucene.apache.org > Subject: Improving String Di

PS: Improving String Distance calculation performance

2010-12-27 Thread Biedermann,S.,Fa. Post Direkt
Ups... I forgot to say, that the candiate only works if left.length() <= right.length() ! -Ursprüngliche Nachricht- Von: Biedermann,S.,Fa. Post Direkt Gesendet: Montag, 27. Dezember 2010 12:23 An: 'dev@lucene.apache.org' Betreff: Improving String Distance calculation pe

Improving String Distance calculation performance

2010-12-27 Thread Biedermann,S.,Fa. Post Direkt
Hi, this is my first post to this mailing list, so I first want to say hello to all of you! You are doing a great job In org.apache.lucene.search.FuzzyTermEnum I found an optimised implementation of the Levenstein-Algorithms which makes use of the fact that the algorithm can be aborted