Re: Mailing Address Deduplication

2004-07-21 Thread SGreen
I know you are on the right track: Standardize first, then deduplicate. In my experience, SQL is not a great language for doing this kind of text manipulation standardizing sometimes requires some rather complex logic. You should be able to easily justify the cost of a commercial app to stand

RE: Mailing Address Deduplication

2004-07-21 Thread Kevin Cowley
Validating addresses into a standard format for deduplication is a field in its own right. If you want to do this in a meaningful manner than you'll need to validate the addresses prior to database load using a product such as Global Address and then to do the comparison which an application that

RE: Mailing Address Deduplication

2004-07-21 Thread Sunmaia
You might also find SOUNDEX useful. http://dev.mysql.com/doc/mysql/en/String_functions.html Peter > -Original Message- > From: Duncan Hill [mailto:[EMAIL PROTECTED] > Sent: 21 July 2004 15:10 > To: [EMAIL PROTECTED] > Subject: Re: Mailing Address Deduplication > > &

Re: Mailing Address Deduplication

2004-07-21 Thread Duncan Hill
On Wednesday 21 July 2004 15:04, Andrew Kuebler might have typed: > I know this is not necessarily a MySQL question, but everyone on this > listserv is always so helpful and I was wondering if anyone had any > pointers on how to deduplicate a list of mailing address since there can > be so many inc