Re: Refactored FuzzyTermEnum

2006-10-20 Thread Otis Gospodnetic
2006 3:14:20 PM Subject: Refactored FuzzyTermEnum I refactored the org.apache.lucene.search.FuzzyTermEnum edit distance implementation. It now only uses a single pair of arrays, and those never get resized. That required turning the order of text/target around in the loops. You'll see th

Re: Refactored FuzzyTermEnum

2006-06-15 Thread Otis Gospodnetic
I'm still waiting for my employer to send in the CCLA. :( But I've got a cool name for this thing - "lube" - contrib/lube. Otis - Original Message From: Marvin Humphrey <[EMAIL PROTECTED]> To: java-dev@lucene.apache.org Sent: Wednesday, June 14, 2006 3:51:52

Re: Refactored FuzzyTermEnum

2006-06-15 Thread karl wettin
On Thu, 2006-06-15 at 11:42 +0200, karl wettin wrote: > > 2. Standard RAMDirectory: Oups, I got the headers wrong here. Again: With your fixes: 1000 fuzzy~0.1, 121 per min. 1000 fuzzy~0.2, 190 per min. 1000 fuzzy~0.3, 342 per min. 1000 fuzzy~0.4, 456 per min. 1000 fuzzy~0.5, 578 per min. 1000 f

Re: Refactored FuzzyTermEnum

2006-06-15 Thread karl wettin
On Tue, 2006-06-13 at 15:14 -0400, Bob Carpenter wrote: > I refactored the org.apache.lucene.search.FuzzyTermEnum Nice! > Does anyone have regression/performance test harnesses? I have a quite unscientific test I ran. 500 documents in corpus. One index reader. A new Index searcher for each query

Re: Refactored FuzzyTermEnum

2006-06-14 Thread Marvin Humphrey
On Jun 13, 2006, at 12:14 PM, Bob Carpenter wrote: Does anyone have regression/performance test harnesses? There's been talk of formalizing a benchmarker suite. Andrzej wrote something a while ago; I don't know if it would be appropriate for this. The benchmarker I wrote is indexing only

Re: Refactored FuzzyTermEnum

2006-06-14 Thread Chris Hostetter
: I don't know what the protocol is for one-off contributions. I'm not sure what you mean by a "one-off contribution" but the process for submiting changes/improvements/additions was recenlty wiki-ized... http://wiki.apache.org/jakarta-lucene/HowToContribute (Incidently, please feel free to sha

Refactored FuzzyTermEnum

2006-06-13 Thread Bob Carpenter
I refactored the org.apache.lucene.search.FuzzyTermEnum edit distance implementation. It now only uses a single pair of arrays, and those never get resized. That required turning the order of text/target around in the loops. You'll see that with the pair of arrays method, they get re-used hand