I know you are on the right track: Standardize first, then deduplicate.
In my experience, SQL is not a great language for doing this kind of text
manipulation standardizing sometimes requires some rather complex logic.
You should be able to easily justify the cost of a commercial app to
stand
ECTED]
Subject: Mailing Address Deduplication
I know this is not necessarily a MySQL question, but everyone on this
listserv is always so helpful and I was wondering if anyone had any
pointers on how to deduplicate a list of mailing address since there can
be so many inconsistencies on how an address
You might also find SOUNDEX useful.
http://dev.mysql.com/doc/mysql/en/String_functions.html
Peter
> -Original Message-
> From: Duncan Hill [mailto:[EMAIL PROTECTED]
> Sent: 21 July 2004 15:10
> To: [EMAIL PROTECTED]
> Subject: Re: Mailing Address Deduplication
>
>
&
On Wednesday 21 July 2004 15:04, Andrew Kuebler might have typed:
> I know this is not necessarily a MySQL question, but everyone on this
> listserv is always so helpful and I was wondering if anyone had any
> pointers on how to deduplicate a list of mailing address since there can
> be so many inc
I know this is not necessarily a MySQL question, but everyone on this
listserv is always so helpful and I was wondering if anyone had any
pointers on how to deduplicate a list of mailing address since there can
be so many inconsistencies on how an address can be written (road vs rd vs
rd., etc). I