Rob,

Probably, I'm a little busy at the moment so not really going to get round
to doing it in the short term.  Can probably rustle up a list mapping the
ways and nodes to the incorrect postcode fairly quickly which would
probably help?

Aidan

On 1 March 2013 17:35, Rob Nickerson <rob.j.nicker...@gmail.com> wrote:

> That's an interesting list for anyone who is concerned with data
> cleansing! Some of the results are because only the first part of a
> postcode has been entered, however even these have numerous formats (e.g.
> CV3, CV3 ???, CV3 ///). For the other errors, it tends to be typos (e.g.
> CO!6 7BJ, where ! is a probably a typo of 1 - Shift+1=!), but there are
> also road names, numbers, and web URLs in the postcode tag.
>
> Would it be possible to create a list of these where we could add the
> correct postcode in a new column and then upload the new data into OSM?
>
> Rob
>
>
>
>
> On 1 March 2013 17:24, Aidan McGinley <aidmcgin+openstreet...@gmail.com>wrote:
>
>> * How accurate is the data already in OSM?
>> Interesting question Rob, as of today there's approximately 200,000 ways
>> or nodes tagged with postcodes in OSM, this is made up of about 29,000
>> unique postcodes.  Those numbers are not 100% accurate as my bounding box
>> for getting the data overlaps a bit with France and Ireland. I've removed
>> the obvious French postcodes (5 digits) there might be a few I missed
>> although I'm pretty sure the extras don't skew the numbers too much.
>>
>> I've compared the unique values from that list with the ONS dataset
>> (excluding terminated postcodes) and come up with the list linked below [1]
>>
>> There's 1119 unique invalid postcodes, which of of course doesn't account
>> for ways or nodes that are incorrectly tagged with a valid postcode but is
>> a useful stat nonetheless.  It should also be relatively easy to get those
>> cleaned up I would think.
>> Couple of notes about the data, there are a few postcodes that look like
>> they are valid (e.g. BR3 1AZ, WC2H 9BD) but they have in fact got some
>> invalid characters at the end that are not visible so that's why they are
>> listed.  It also includes postcodes in lowercase as well since it breaks
>> from the convention of uppercase postcodes, you could argue that they
>> should be in or out, but it was easier to leave them in.
>>
>> [1]
>> https://docs.google.com/file/d/0B0viaV_xKHyCNmJDY1A1X092Zkk/edit?usp=sharing
>>
>>  On 28 February 2013 23:44, Rob Nickerson <rob.j.nicker...@gmail.com>wrote:
>>
>>>  Interestingly out of the 95 you also identified 2 postcodes that are
>>> incorrect in OSM... raising the obvious questions:
>>>
>>> * How accurate is the data already in OSM?
>>> * Should imports be compared to 100% accuracy or a more realistic
>>> measure of OSM accuracy?
>>>
>>> Rob
>>>
>>> _______________________________________________
>>> Talk-GB mailing list
>>> Talk-GB@openstreetmap.org
>>> http://lists.openstreetmap.org/listinfo/talk-gb
>>>
>>>
>>
>
_______________________________________________
Talk-GB mailing list
Talk-GB@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-gb

Reply via email to