Re: [sqlite] Near misses

2009-07-01 Thread Alberto Simões
Hello 2009/6/26 Alberto Simões : > I am trying to find words in a dictionary stored in sqlite, and trying > a near miss approach. > For that I tried an algorithm to create patterns corresponding to > Levenshtein distance of 1 (edit distance of 1). > That means, one adition, one remotion or one sub

Re: [sqlite] Near misses

2009-06-28 Thread Eric Bohlman
Simon Slavin wrote: > On 26 Jun 2009, at 12:25pm, Alberto Simões wrote: > >> one adition, one remotion or one substitution > > I am always amazed at how well people use English. For your word > 'remotion' you probably mean 'removal' or 'omission'. You have joined > the two possibilities tog

Re: [sqlite] Near misses

2009-06-28 Thread John Machin
On 27/06/2009 7:00 AM, Jean-Christophe Deschamps wrote: > At 13:25 26/06/2009, you wrote: > ´¯¯¯ >> I am trying to find words in a dictionary stored in sqlite, and trying >> a near miss approach. >> For that I tried an algorithm to create patterns corresponding to >> Levenshtein distance of 1 (edit

Re: [sqlite] Near misses

2009-06-28 Thread Alberto Simões
On Fri, Jun 26, 2009 at 10:00 PM, Jean-Christophe Deschamps wrote: > Hi, > > I'm currently finishing an C extension offering, among other functions, > a "TYPOS" scalar operator which is meant to perform just that, and a > bit more. > > Internally, it applies a Unicode fold() function, a Unicode low

Re: [sqlite] Near misses

2009-06-26 Thread Igor Tandetnik
Alberto Simoes wrote: > On Fri, Jun 26, 2009 at 3:00 PM, Igor Tandetnik > wrote: >> Alberto Simoes wrote: >>> SELECT DISTINCT(word) FROM dict WHERE word = "ar" OR word = "ca" OR >>> word LIKE "_car" OR word LIKE "c_r" OR word = "cr" OR word LIKE >>> "_ar" OR word LIKE "ca_r" OR word LIKE "c_ar" OR

Re: [sqlite] Near misses

2009-06-26 Thread Jean-Christophe Deschamps
At 13:25 26/06/2009, you wrote: ´¯¯¯ >I am trying to find words in a dictionary stored in sqlite, and trying >a near miss approach. >For that I tried an algorithm to create patterns corresponding to >Levenshtein distance of 1 (edit distance of 1). >That means, one adition, one remotion or one subst

Re: [sqlite] Near misses

2009-06-26 Thread Alberto Simões
On Fri, Jun 26, 2009 at 3:43 PM, Simon Slavin wrote: > > On 26 Jun 2009, at 12:25pm, Alberto Simões wrote: > >> one adition, one remotion or one substitution > > I am always amazed at how well people use English.  For your word > 'remotion' you probably mean 'removal' or 'omission'.  You have joine

Re: [sqlite] Near misses

2009-06-26 Thread Alberto Simões
Hello On Fri, Jun 26, 2009 at 3:00 PM, Igor Tandetnik wrote: > Alberto Simoes wrote: >> For that, my script receives a word (say, 'car') and generated all >> possible additions and remotions, and substitutions: >> >> Additions: _car c_ar ca_r car_ >> Substitutions: _ar c_r ca_ >> remotions: ar cr

Re: [sqlite] Near misses

2009-06-26 Thread Simon Slavin
On 26 Jun 2009, at 12:25pm, Alberto Simões wrote: > one adition, one remotion or one substitution I am always amazed at how well people use English. For your word 'remotion' you probably mean 'removal' or 'omission'. You have joined the two possibilities together ! > Then, the script cons

Re: [sqlite] Near misses

2009-06-26 Thread Igor Tandetnik
Alberto Simoes wrote: > For that, my script receives a word (say, 'car') and generated all > possible additions and remotions, and substitutions: > > Additions: _car c_ar ca_r car_ > Substitutions: _ar c_r ca_ > remotions: ar cr ca > > Then, the script constructs an SQL query: > > SELECT DISTINCT(w

Re: [sqlite] Near misses

2009-06-26 Thread Martin Pfeifle
;s my understanding. Thank you for your idea, because I am also thinking of putting some fuzzy search on top of FTS. Best Martin Von: Alberto Simões An: General Discussion of SQLite Database Gesendet: Freitag, den 26. Juni 2009, 13:25:57 Uhr Betreff: [sqlite] Near m

[sqlite] Near misses

2009-06-26 Thread Alberto Simões
Hello. I am trying to find words in a dictionary stored in sqlite, and trying a near miss approach. For that I tried an algorithm to create patterns corresponding to Levenshtein distance of 1 (edit distance of 1). That means, one adition, one remotion or one substitution. For that, my script rece