[sqlite] Advice needed for fuzzy search

2009-07-02 Thread Jean-Christophe Deschamps
Hello group, I'm writing a fuzzy search extension. The current code is getting a little messy and I'm not completely satisfied by the way it works. So I'm about to rewrite it from scratch on stronger foundations. The goal is to provide a fuzzy search on _short_ fields like names, street

Re: [sqlite] Advice needed for fuzzy search

2009-07-02 Thread Simon Slavin
On 2 Jul 2009, at 2:01pm, Jean-Christophe Deschamps wrote: I need to deal with codepoints that would expand to several individual characters. Examples are ligatures or fractions. I've never seen ligatures used in French, nor in any european language, when it comes to user input. I believe

Re: [sqlite] Advice needed for fuzzy search

2009-07-02 Thread Jean-Christophe Deschamps
Simon, At 15:26 02/07/2009, you wrote: ´¯¯¯ What we need is a new version of Soundex which is written to deal with unicode instead of ASCII. `--- Umm, soundex already fails often with plain english names. It would need a whole lot of native speakers of all those languages around to come up