[Talk-GB] Bounty to get Lincoln mapped
Hi folks, One of our larger web development clients at the company I work for wants a map putting on their website. Unfortunately they're in Lincoln, which is not particularly well mapped in OSM. I want to offer £50 to anyone who will map at least 95% of roads in Lincoln, specifically those on the East side, before the end of January 2009. I'd like to offer more but as this is out of my pocket I though £50 seemed reasonable. Also, I'd really like it before years end as this project shouldn't run any longer than that but I realise people are busy at this time of year. Please let me know if you're interested or pass this on to anyone who might be. Cheers! ___ Talk-GB mailing list Talk-GB@openstreetmap.org http://lists.openstreetmap.org/listinfo/talk-gb
Re: [Talk-GB] Request for UK address lists for postcode extraction
> One source I am exploring is planning application listings produced by > the > local authority. Which is I think were you had headed? I'm not sure of the legal situation with planning data, but if things seem fine with that then you might be interested to know that the PlanningAlerts project have developed a number of screen scrapers for various local authorities: http://code.google.com/p/planningalerts/wiki/ExistingScrapers Cheers, Gregory ___ Talk-GB mailing list Talk-GB@openstreetmap.org http://lists.openstreetmap.org/listinfo/talk-gb
Re: [Talk-GB] Request for UK address lists for postcode extraction
http://en.wikipedia.org/wiki/Pdftotext might help. PHILLIP BARNETT SERVER MANAGER 200 GRAY'S INN ROAD LONDON WC1X 8XZ UNITED KINGDOM T +44 (0)20 7430 4474 F E [EMAIL PROTECTED] http://WWW.ITN.CO.UK P Please consider the environment. Do you really need to print this email? -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Brian Quinion Sent: 01 December 2008 17:00 To: Andy Robinson (blackadder-lists) Cc: talk-gb@openstreetmap.org Subject: Re: [Talk-GB] Request for UK address lists for postcode extraction Andy Robinson wrote: > Brian Quinion wrote: >>I just need some test data that I didn't write to make sure I'm not >>making silly assumptions! :-) > This any use? > > "http://www.birmingham.gov.uk/GenerateContent?CONTENT_ITEM_ID=40870&CONTENT_ > ITEM_TYPE=0&MENU_ID=13170" That seems suitably horrible, although I'd not considered writing anything to parse the data out of PDFs. I guess I've got some extra work to do... Cheers, -- Brian ___ Talk-GB mailing list Talk-GB@openstreetmap.org http://lists.openstreetmap.org/listinfo/talk-gb Please Note: Any views or opinions are solely those of the author and do not necessarily represent those of Independent Television News Limited unless specifically stated. This email and any files attached are confidential and intended solely for the use of the individual or entity to which they are addressed. If you have received this email in error, please notify [EMAIL PROTECTED] Please note that to ensure regulatory compliance and for the protection of our clients and business, we may monitor and read messages sent to and from our systems. Thank You. ___ Talk-GB mailing list Talk-GB@openstreetmap.org http://lists.openstreetmap.org/listinfo/talk-gb
Re: [Talk-GB] Request for UK address lists for postcode extraction
Andy Robinson wrote: > Brian Quinion wrote: >>I just need some test data that I didn't write to make sure I'm not >>making silly assumptions! :-) > This any use? > > "http://www.birmingham.gov.uk/GenerateContent?CONTENT_ITEM_ID=40870&CONTENT_ > ITEM_TYPE=0&MENU_ID=13170" That seems suitably horrible, although I'd not considered writing anything to parse the data out of PDFs. I guess I've got some extra work to do... Cheers, -- Brian ___ Talk-GB mailing list Talk-GB@openstreetmap.org http://lists.openstreetmap.org/listinfo/talk-gb
Re: [Talk-GB] Request for UK address lists for postcode extraction
On 01/12/2008 16:13, Andy Robinson (blackadder-lists) wrote: > Brian Quinion wrote: >> Probably it would be possible to filter it so not too many requests >> went to any one site, but that still leaves the possibility that they >> used royal mails postcode finder (or similar) to find their original >> data. Across a large number of sites you could end up doing a >> database extraction from royal mail regardless. So are those (also) infringing by making the PAF available in bulk online? >> Address books and company mailing lists seemed like a preferable >> source and as long as individuals names are not included privacy >> shouldn't be an issue. >> > I'd noted that too. Business directory listings (Yell, Thomson etc) or house > price finders which are using copyright Land Registry data in the > background. Is Google infringing by sucking all this data into their database? David ___ Talk-GB mailing list Talk-GB@openstreetmap.org http://lists.openstreetmap.org/listinfo/talk-gb
Re: [Talk-GB] Request for UK address lists for postcode extraction
Brian Quinion wrote: >Sent: 01 December 2008 4:28 PM >To: Andy Robinson (blackadder-lists) >Cc: talk-gb@openstreetmap.org >Subject: Re: [Talk-GB] Request for UK address lists for postcode extraction > >Andy Robinson wrote: >> I'd noted that too. Business directory listings (Yell, Thomson etc) or >house >> price finders which are using copyright Land Registry data in the >> background. >> >> One source I am exploring is planning application listings produced by >the >> local authority. Which is I think were you had headed? > >It's one of the options I hope to look at but my main idea was to get >a tool that could do the extraction working and make it available to >anyone who wants to provide data. Anyone with a mailing list or >address book is a potential data source - I just hope to make it easy >for them to submit it. > >I just need some test data that I didn't write to make sure I'm not >making silly assumptions! :-) > This any use? "http://www.birmingham.gov.uk/GenerateContent?CONTENT_ITEM_ID=40870&CONTENT_ ITEM_TYPE=0&MENU_ID=13170" Cheers Andy >-- > Brian > >No virus found in this incoming message. >Checked by AVG - http://www.avg.com >Version: 8.0.176 / Virus Database: 270.9.12/1821 - Release Date: 30/11/2008 >5:53 PM ___ Talk-GB mailing list Talk-GB@openstreetmap.org http://lists.openstreetmap.org/listinfo/talk-gb
Re: [Talk-GB] Request for UK address lists for postcode extraction
Andy Robinson wrote: > I'd noted that too. Business directory listings (Yell, Thomson etc) or house > price finders which are using copyright Land Registry data in the > background. > > One source I am exploring is planning application listings produced by the > local authority. Which is I think were you had headed? It's one of the options I hope to look at but my main idea was to get a tool that could do the extraction working and make it available to anyone who wants to provide data. Anyone with a mailing list or address book is a potential data source - I just hope to make it easy for them to submit it. I just need some test data that I didn't write to make sure I'm not making silly assumptions! :-) -- Brian ___ Talk-GB mailing list Talk-GB@openstreetmap.org http://lists.openstreetmap.org/listinfo/talk-gb
Re: [Talk-GB] Request for UK address lists for postcode extraction
Brian Quinion wrote: >Sent: 01 December 2008 4:01 PM >To: Andy Robinson (blackadder-lists) >Cc: David Earl; talk-gb@openstreetmap.org >Subject: Re: [Talk-GB] Request for UK address lists for postcode extraction > >Andy Robinson wrote: >> David Earl wrote: >>>On 01/12/2008 14:11, Brian Quinion wrote: Has anyone got any suggestions, or is willing to offer any data? Even personal address books would be useful for testing... >>> >>>You know all the 2,500 or so prefixes, and there are only 26 x 26 * 100 >>>combinations for the second part for each - about 200 million in all. If >>>you feed these potential postcodes in quotes into Google UK over a long >>>period with appropriate pauses so as not to get locked out, and look at >>>the result for recognizable addresses (that's the tricky bit) as I'm >>>doing in the Namefinder, you'd probably cover 75% of UK postcodes. >>> >> I'm curious about this. Data scraped via Google is still subject to the >> terms of the original page it references? > >I looked into this and came to the conclusion that you could probably >claim 'fair use' as long as you pulled each address from a different >website. The trouble is that for most searches you end up on one of a >small number of directory sites so doing any significant number is >likely to end up as a database extraction. The results are also >mostly limited to business addresses. > >Probably it would be possible to filter it so not too many requests >went to any one site, but that still leaves the possibility that they >used royal mails postcode finder (or similar) to find their original >data. Across a large number of sites you could end up doing a >database extraction from royal mail regardless. > >Address books and company mailing lists seemed like a preferable >source and as long as individuals names are not included privacy >shouldn't be an issue. > I'd noted that too. Business directory listings (Yell, Thomson etc) or house price finders which are using copyright Land Registry data in the background. One source I am exploring is planning application listings produced by the local authority. Which is I think were you had headed? Cheers Andy ___ Talk-GB mailing list Talk-GB@openstreetmap.org http://lists.openstreetmap.org/listinfo/talk-gb
Re: [Talk-GB] Request for UK address lists for postcode extraction
Andy Robinson wrote: > David Earl wrote: >>On 01/12/2008 14:11, Brian Quinion wrote: >>> Has anyone got any suggestions, or is willing to offer any data? Even >>> personal address books would be useful for testing... >> >>You know all the 2,500 or so prefixes, and there are only 26 x 26 * 100 >>combinations for the second part for each - about 200 million in all. If >>you feed these potential postcodes in quotes into Google UK over a long >>period with appropriate pauses so as not to get locked out, and look at >>the result for recognizable addresses (that's the tricky bit) as I'm >>doing in the Namefinder, you'd probably cover 75% of UK postcodes. >> > I'm curious about this. Data scraped via Google is still subject to the > terms of the original page it references? I looked into this and came to the conclusion that you could probably claim 'fair use' as long as you pulled each address from a different website. The trouble is that for most searches you end up on one of a small number of directory sites so doing any significant number is likely to end up as a database extraction. The results are also mostly limited to business addresses. Probably it would be possible to filter it so not too many requests went to any one site, but that still leaves the possibility that they used royal mails postcode finder (or similar) to find their original data. Across a large number of sites you could end up doing a database extraction from royal mail regardless. Address books and company mailing lists seemed like a preferable source and as long as individuals names are not included privacy shouldn't be an issue. -- Brian ___ Talk-GB mailing list Talk-GB@openstreetmap.org http://lists.openstreetmap.org/listinfo/talk-gb
Re: [Talk-GB] Request for UK address lists for postcode extraction
David Earl wrote: >Sent: 01 December 2008 3:10 PM >To: talk-gb@openstreetmap.org >Subject: Re: [Talk-GB] Request for UK address lists for postcode extraction > >On 01/12/2008 14:11, Brian Quinion wrote: >> Hi, >> >> I'm currently doing some work trying to generate postcode location >> data for the UK using address lists and address lookup using OSM data >> to supplement NPE. So far it seems to work quite well with the >> address lists that I have available to me (and coping quite well with >> ambiguous road names) but I'm limited in my data sources and most of >> the address data is fairly consistent in both format and quality. >> >> So, before I open the interface to the public, I'd like to test the >> code with some lists provided by other people. >> >> Does anyone have, or know of, any address lists that I would be able >> to use for this purpose? Obviously it needs to be license compatible >> with OSM (so please no lists generated from royal mail postcode data!) >> and ideally I'm after data sets containing at least: >> >> street address (house name / number optional) >> town / city >> postcode >> >> formatted as CSV or TSV. I'm specifically not after data containing >> the names of individuals. >> >> Has anyone got any suggestions, or is willing to offer any data? Even >> personal address books would be useful for testing... > >Why not do it the other way round? > >You know all the 2,500 or so prefixes, and there are only 26 x 26 * 100 >combinations for the second part for each - about 200 million in all. If >you feed these potential postcodes in quotes into Google UK over a long >period with appropriate pauses so as not to get locked out, and look at >the result for recognizable addresses (that's the tricky bit) as I'm >doing in the Namefinder, you'd probably cover 75% of UK postcodes. > >Yes, its slow, but it's probably the biggest source there is. At one a >second it would take about 6 years, but by enlisting 100 friends you'd >do it in a month - less if it's possible to be more intelligent about it >- for example, for the number part if there's no 14XX or 15XX I doubt >there would be any 16s or above either, except for a few special cases. I'm curious about this. Data scraped via Google is still subject to the terms of the original page it references? Cheers Andy > >David > > >___ >Talk-GB mailing list >Talk-GB@openstreetmap.org >http://lists.openstreetmap.org/listinfo/talk-gb > >No virus found in this incoming message. >Checked by AVG - http://www.avg.com >Version: 8.0.176 / Virus Database: 270.9.12/1821 - Release Date: 30/11/2008 >5:53 PM ___ Talk-GB mailing list Talk-GB@openstreetmap.org http://lists.openstreetmap.org/listinfo/talk-gb
Re: [Talk-GB] Request for UK address lists for postcode extraction
On 01/12/2008 14:11, Brian Quinion wrote: > Hi, > > I'm currently doing some work trying to generate postcode location > data for the UK using address lists and address lookup using OSM data > to supplement NPE. So far it seems to work quite well with the > address lists that I have available to me (and coping quite well with > ambiguous road names) but I'm limited in my data sources and most of > the address data is fairly consistent in both format and quality. > > So, before I open the interface to the public, I'd like to test the > code with some lists provided by other people. > > Does anyone have, or know of, any address lists that I would be able > to use for this purpose? Obviously it needs to be license compatible > with OSM (so please no lists generated from royal mail postcode data!) > and ideally I'm after data sets containing at least: > > street address (house name / number optional) > town / city > postcode > > formatted as CSV or TSV. I'm specifically not after data containing > the names of individuals. > > Has anyone got any suggestions, or is willing to offer any data? Even > personal address books would be useful for testing... Why not do it the other way round? You know all the 2,500 or so prefixes, and there are only 26 x 26 * 100 combinations for the second part for each - about 200 million in all. If you feed these potential postcodes in quotes into Google UK over a long period with appropriate pauses so as not to get locked out, and look at the result for recognizable addresses (that's the tricky bit) as I'm doing in the Namefinder, you'd probably cover 75% of UK postcodes. Yes, its slow, but it's probably the biggest source there is. At one a second it would take about 6 years, but by enlisting 100 friends you'd do it in a month - less if it's possible to be more intelligent about it - for example, for the number part if there's no 14XX or 15XX I doubt there would be any 16s or above either, except for a few special cases. David ___ Talk-GB mailing list Talk-GB@openstreetmap.org http://lists.openstreetmap.org/listinfo/talk-gb
[Talk-GB] Request for UK address lists for postcode extraction
Hi, I'm currently doing some work trying to generate postcode location data for the UK using address lists and address lookup using OSM data to supplement NPE. So far it seems to work quite well with the address lists that I have available to me (and coping quite well with ambiguous road names) but I'm limited in my data sources and most of the address data is fairly consistent in both format and quality. So, before I open the interface to the public, I'd like to test the code with some lists provided by other people. Does anyone have, or know of, any address lists that I would be able to use for this purpose? Obviously it needs to be license compatible with OSM (so please no lists generated from royal mail postcode data!) and ideally I'm after data sets containing at least: street address (house name / number optional) town / city postcode formatted as CSV or TSV. I'm specifically not after data containing the names of individuals. Has anyone got any suggestions, or is willing to offer any data? Even personal address books would be useful for testing... Cheers, -- Brian ___ Talk-GB mailing list Talk-GB@openstreetmap.org http://lists.openstreetmap.org/listinfo/talk-gb