http://www.gedpage.com/soundex.html offers a simple explanation of what it does.
One possibility would be building a referential table with only a recordID and soundex column, unique over both; and filling that with the soundex of individual nonjunk words. So, from the titles 1 | Rain in Spain 2 | Spain's Rain you'd get 1 | R500 1 | S150 2 | S150 2 | R500 >From thereon, you can see that all the same words have been used - ignoring a >lot of spelling errors like Spian. Obviously not a magic solution, but it's a >start. ----- Original Message ----- > From: "Jerry Schwartz" <je...@gii.co.jp> > To: "Johan De Meersman" <vegiv...@tuxera.be> > Cc: "Jim McNeely" <j...@newcenturydata.com>, "mysql mailing list" > <mysql@lists.mysql.com> > Sent: Monday, 2 May, 2011 4:09:36 PM > Subject: RE: Join based upon LIKE > > [JS] I've thought about using soundex(), but I'm not quite sure how. > > I didn't pursue it much because there are so many odd terms such as > chemical > names, but perhaps I should give it a try in my infinite free time. > > > [JS] Thanks for your condolences. > > Regards, > > Jerry Schwartz > Global Information Incorporated > 195 Farmington Ave. > Farmington, CT 06032 > > 860.674.8796 / FAX: 860.674.8341 > E-mail: je...@gii.co.jp > Web site: www.the-infoshop.com > -- Bier met grenadyn Is als mosterd by den wyn Sy die't drinkt, is eene kwezel Hy die't drinkt, is ras een ezel -- MySQL General Mailing List For list archives: http://lists.mysql.com/mysql To unsubscribe: http://lists.mysql.com/mysql?unsub=arch...@jab.org