RE: Join based upon LIKE

2011-05-05 Thread Jerry Schwartz
>-Original Message- >From: Nuno Tavares [mailto:nuno.tava...@dri.pt] >Sent: Tuesday, May 03, 2011 6:21 PM >To: mysql@lists.mysql.com >Subject: Re: Join based upon LIKE > >Dear Jerry, > >I've been silently following this discussion because I've misse

Re: Join based upon LIKE

2011-05-03 Thread Nuno Tavares
p > Web site: www.the-infoshop.com > > >> -Original Message- >> From: shawn wilson [mailto:ag4ve...@gmail.com] >> Sent: Tuesday, May 03, 2011 4:08 PM >> Cc: mysql mailing list >> Subject: Re: Join based upon LIKE >> >> I'm actually enjoyi

RE: Join based upon LIKE

2011-05-03 Thread Jerry Schwartz
-mail: je...@gii.co.jp Web site: www.the-infoshop.com >-Original Message- >From: shawn wilson [mailto:ag4ve...@gmail.com] >Sent: Tuesday, May 03, 2011 4:08 PM >Cc: mysql mailing list >Subject: Re: Join based upon LIKE > >I'm actually enjoying this discussion beca

Re: Join based upon LIKE

2011-05-03 Thread shawn wilson
I'm actually enjoying this discussion because I have the same type of issue. However, I have done away with trying to do a full text search in favor of making a table with unique fields where all fields should uniquely identify the group. If I get a dupe, I can clean it up. However, like you, they

Re: Join based upon LIKE

2011-05-03 Thread Johan De Meersman
- Original Message - > From: "Jerry Schwartz" > > I'm not sure that I could easily build a dictionary of non-junk > words, since The traditional way is to build a database of junk words. The list tends to be shorter :-) Think and/or/it/the/with/like/... Percentages of mutual and non-

RE: Join based upon LIKE

2011-05-03 Thread Jerry Schwartz
>-Original Message- >From: Johan De Meersman [mailto:vegiv...@tuxera.be] >Sent: Tuesday, May 03, 2011 5:31 AM >To: Jerry Schwartz >Cc: Jim McNeely; mysql mailing list; Johan De Meersman >Subject: Re: Join based upon LIKE > > >http://www.gedpage.com/soundex.html

Re: Join based upon LIKE

2011-05-03 Thread Johan De Meersman
rom: "Jerry Schwartz" > To: "Johan De Meersman" > Cc: "Jim McNeely" , "mysql mailing list" > > Sent: Monday, 2 May, 2011 4:09:36 PM > Subject: RE: Join based upon LIKE > > [JS] I've thought about using soundex(), but I'm not q

RE: Join based upon LIKE

2011-05-02 Thread Jerry Schwartz
>-Original Message- >From: Johan De Meersman [mailto:vegiv...@tuxera.be] >Sent: Sunday, May 01, 2011 4:01 AM >To: Jerry Schwartz >Cc: Jim McNeely; mysql mailing list >Subject: Re: Join based upon LIKE > > >- Original Message - >> From: "Jerry S

Re: Join based upon LIKE

2011-05-01 Thread Johan De Meersman
- Original Message - > From: "Jerry Schwartz" > > I shove those modified titles into a table and do a JOIN ON > `prod_title` LIKE > `wild_title`. Roughly what I meant with the shadow fields, yes - keep your own set of data around :-) I have little more to offer, then, I'm afraid. The

Re: FW: Join based upon LIKE

2011-04-30 Thread Hal�sz S�ndor
2011/04/28 15:28 -0400, Jerry Schwartz No takers? And this is not real taking, because the algorithm of which I am thinking, the edit-distance (Levens(h)tein-distance) algorithm costs too much for you (see the Wikipedia entry), but it yields, I believe, much more nearly such an

Re: FW: Join based upon LIKE

2011-04-29 Thread Hal�sz S�ndor
2011/04/28 15:28 -0400, Jerry Schwartz No takers? And this is not real taking, because the algorithm of which I am thinking, the edit-distance (Levens(h)tein-distance) algorithm costs too much for you (see the Wikipedia entry). The obvious implementation takes as many steps as

RE: Join based upon LIKE

2011-04-29 Thread Jerry Schwartz
>-Original Message- >From: Jim McNeely [mailto:j...@newcenturydata.com] >Sent: Thursday, April 28, 2011 6:43 PM >To: Jerry Schwartz >Subject: Re: Join based upon LIKE > >It just smells wrong, a nicer system would have you joining on ID's of some >kind so tha

RE: Join based upon LIKE

2011-04-29 Thread Jerry Schwartz
>-Original Message- >From: Johan De Meersman [mailto:vegiv...@tuxera.be] >Sent: Friday, April 29, 2011 5:56 AM >To: Jerry Schwartz >Cc: mysql mailing list >Subject: Re: Join based upon LIKE > > >- Original Message - >> From: "Jerry Schwartz&qu

Re: Join based upon LIKE

2011-04-29 Thread Johan De Meersman
- Original Message - > From: "Jerry Schwartz" > > [JS] This isn't the only place I have to deal with fuzzy data. :-( > Discretion prohibits further comment. Heh. What you *really* need, is a LART. Preferably one of the spiked variety. > A full-text index would work if I were only looki

RE: Join based upon LIKE

2011-04-28 Thread Jerry Schwartz
>-Original Message- >From: Johan De Meersman [mailto:vegiv...@tuxera.be] >Sent: Thursday, April 28, 2011 4:18 PM >To: Jerry Schwartz >Cc: mysql mailing list >Subject: Re: Join based upon LIKE > > >- Original Message - >> From: "Jerry Schwartz&q

Re: Join based upon LIKE

2011-04-28 Thread Johan De Meersman
- Original Message - > From: "Jerry Schwartz" > > No takers? Not willingly, no :-p This is a pretty complex problem, as SQL itself isn't particularly well-equipped to deal with fuzzy data. One approach that might work is using a fulltext indexing engine (MySQL's built-in ft indices,

FW: Join based upon LIKE

2011-04-28 Thread Jerry Schwartz
No takers? -Original Message- From: Jerry Schwartz [mailto:je...@gii.co.jp] Sent: Monday, April 25, 2011 2:34 PM To: 'Mailing-List mysql' Subject: Join based upon LIKE I have to match lists of new publications against our database, so that I can replace the existing publicati

Join based upon LIKE

2011-04-25 Thread Jerry Schwartz
I have to match lists of new publications against our database, so that I can replace the existing publications in our catalog. For example, "The UK Market for Puppies in February 2011" would be a replacement for "The UK Market for Puppies in December 2010" Unfortunately, the publishers aren't