Re: [Talk-ca] Importing buildings in Canada

John Whelan Sat, 28 Sep 2019 11:05:42 -0700

And I totally agree. Because the Stat Can data has come from manysources the data quality is variable to put it politely. The Microsoftdata has been shown in the US to also be of variable quality. I'm notso sure about the NR Can LiDAR data hopefully it is at least consistent.

If we look at the history of the project then we can get an idea of howwe came to be where we are.

First I wanted to import all the bus stops in Ottawa because only byimporting could you ensure you had all the stops with their referencenumbers but the City of Ottawa Open Data license did not align withOSM. I was also talking to Treasury Board and explained to them thattheir Open Data license version 1 didn't align with OSM so we couldn'tuse their data. Five years later TB released their Open Data licenseversion 2 which they felt did align.

Stat Can has two types of project, pilot ones and ones that earn money.The original pilot was based on Ottawa and Gatineau and was for twoyears. Their original plan was mapathons using iD. I was impressedwhen a stat can employee managed to accurately map a building using iDduring a presentation. I hadn't thought it was possible after some ofthe efforts I'd seen in HOT mapping. Fine except that it requires alot of mappers and I think Fredrick has commented this sort of mappingwith new mappers needs a lot of clean up effort sometimes more thanrequired for an experienced mapper to map it right in the first place.Montreal has identified there just aren't enough experienced mappersavailable.

I worked at Stats Canada for a number of years. The corporate cultureis very different to OSM. It makes its money by selling data. Want toopen a new coffee bar? Stats Canada will combine its data to sell youthe ideal spot based on residents' income etc.. I had a meeting withStats Canada, City of Ottawa planning department, an Open Dataspecialist from Carlton University, someone from Metrolink who had addeddata to openstreetmap to help people find the nearest bus stop and acouple of HOT board members. We convinced Stats Canada to change thedirection of the pilot to use Open Data rather than go the mapathonroute partly for data quality reasons and partly because I didn't thinkwe could find the mappers to map the buildings completely. The Stat Caninvolvement meant the City of Ottawa was persuaded to change its OpenData license to the same as the TB one. That took time and had to go tocouncil for approval. There was a lot of discussion with the localcommunity and it was they who organised and did the import. The localgroup worked nicely together and had a range of skill sets in thegroup. I actually played more of a connecting role than anything else.

The import was challenged on the data license amongst other things buteventually the OSM legal working group was very kind and ruled thelicense was acceptable. Stats is very interested in added detail tobuildings. I was very interested that we could now import the bus stops.

I think you picked up on the fact that the buildings mapped in amapathon were less than ideal. I was involved in one in Ottawa and justtaught the new mappers to use JOSM and the building_tool. That producedmore buildings per mapper hour and they were fairly accurate. I mustconfess not every attached garage was mapped in detail.


I seem to recall Mapbox being involved in the Maperthons in some way.

The Stats Can involvement meant we saw some interest from schools. WhatI was interested in was added detail so mapped a couple of thousandbuildings in Ontario using JONM and the building_tool so details couldbe added easily. We got two addresses added. Apparently in Ontario theprovincial government has purchased ESRI for school children to learnabout GIS.

At the end of the pilot the money had run out. Stats covered some ofthe costs involved in the HOT summit that was held in Ottawa and duringthat summit phase two was launched but without any real funding.

What Stats could do though was release data from the municipalitiesunder the government Open Data license and that is what they did. AsJarek has pointed out following the import process is stressful so Ivolunteered to do the paperwork and submit the plan. There was somediscussion on talk ca and the idea surfaced to go with one plan ratherthan divide the country up. So that's what I did.

Today we have three sources of data that could be imported, and Isuspect the two that are not municipal data are more consistent. Westill have the original plan of mapathons with iD floating around.

My person view is the imported data quality is better than the mapathonapproach but to go forward from here I think it needs to be re-plannedand a new import plan(s) drawn up.

I don't think Stats have any real funding available at the moment. Theymay find an odd hour in a quiet time but its coming up to March 31th anddeadline time so I don't expect any major resources to be made availablefrom them certainly not of the data clean up variety.


Cheerio John

Pierre Béland wrote on 2019-09-28 12:20 PM:

Je comprends que c'est la saison des tomates. Mais essayons de lesutiliser pour nos conserves et non comme argument pour convaincre lesautres contributeurs ! ;)
Comme les autres l'ont exprimé, c'est à ceux qui proposent de fairedes imports de bien documenter le processus, non l'inverse. Et lesmenaces d'agir de façon impériale et négliger les communautés locales,cela ne tient évidemment pas la route.
Pour discuter sur la qualité des données, il est nécessaire de pouvoirfacilement examiner les données. Et je ne penses pas que les donnéessoient comparables d'un endroit à l'autre. La qualité des images, ladensité du bâti en milieu urbains sont autant de facteurs.
Les fichiers accessibles aussi bien pour StatCan que Microsoft sonttrès gros. Simplement pour analyser les données de nos municipalitésrespectives, il faut traiter de gros fichiers et tenter d'extraire lesdonnées. Ce qui n'est pas nécessairement facile et va bien sûr limiterla participation.
Question de donner des exemples sur les limites d'observation desimages par les technique de AI, j'ai publié des images avec les 2tweets suivants montrant des bâtiments au centre de Toronto :
https://twitter.com/pierzen/status/1177976517902684160

https://twitter.com/pierzen/status/1177978125377884160
On voit bien qu'il ne suffit pas de valider si les angles sont droits.Ces exemples montrent bien comment le tracé peut variersignificativement vs la réalité au sol. Et tout comme les humains, lestechniques de AI ont de la difficulté à identifier les bâtimentsindividuels.
cordialement


--
Sent from Postbox <https://www.postbox-inc.com>

_______________________________________________
Talk-ca mailing list
Talk-ca@openstreetmap.org
https://lists.openstreetmap.org/listinfo/talk-ca

Re: [Talk-ca] Importing buildings in Canada

Reply via email to