On 13 May 2010 17:23, Robert Scott <li...@humanleg.org.uk> wrote:

> Hi all,
>
> I've been running some countrywide comparisons of the recently released OS
> Locator against the streets in OSM, using fuzzy string matching and the
> supplied bounding boxes to attempt to match each street in each dataset to
> one in the other. It's worked pretty well for most areas I tested. Of the
> ~826k named streets in OS Locator, about 424k of them have near perfect
> matches in OSM. A few tens of thousands more have what I would call spelling
> 'disagreements'. The rest of them have bad or no matches at all.
>
> I've put a description of the technique up here along with the preliminary
> results:
>
> http://humanleg.org.uk/code/oslmusicalchairs
>
> The thing I really need is suggestions for getting this data to users in a
> way that's practical to work with. It's a CSV currently.
>
> Thoughts welcome. So are bug reports of where my matching algorithm has
> gotten things wrong.
>

What about using double metaphone for finding spelling disagreements?

Emilie Laffray
_______________________________________________
Talk-GB mailing list
Talk-GB@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-gb

Reply via email to