Re: [Talk-ca] FW: Re: Importing buildings in Canada

Nate Wessel Thu, 16 Jan 2020 14:24:41 -0800

Daniel,

The section you cite from the import guidelines seems to support myposition. Perhaps that means I haven't explained my thoughts well? It says:

"If you are importing data where there is already some data in OSM, thenyou need to combine this data in an appropriate way or suppress theimport of features with overlap with existing data. Only where you haveexplicit support, which is highly unusual, should you replace currentdata with your imported data."

OSM is not all that dynamic - if it was, we wouldn't be talking about animport. We should do our best to align the data we're thinking ofbringing in with gaps in the existing data. That's what conflation isall about. We should not systematically change existing data. That'scalled an automated edit and is governed by different policies than animport.

I'm not saying any plan is perfect - there will be gaps and overlaps, ofcourse. But that doesn't mean we don't try at all.

Toronto has hundreds of thousands of buildings in OSM. They are notgoing to be compared manually, one by one if there is an import.

--

And again, I'm not saying not to do explicit building conflation andcheck for updated geometries - I'm just saying that it needs to be aseparate process if it happens.


Best,

Nate Wessel, PhD
Planner, Cartographer, Transport Nerd
NateWessel.com <https://www.natewessel.com>

On 2020-01-16 2:56 p.m., Daniel @jfd553 wrote:

Hello Nate,
I understand that you don’t like to see an import process that bothbring in new objects and overwrite existing ones. You also suggestremoving "overlapped" building from ODB prior to import it. Suchpre-processing, that would ensure there will be no buildings"overwrite" during the import, is not realistic (i.e. you will need tooverwrite some buildings anyway). Here are two reasons why it would bedifficult...
1-OSM is a dynamic project and, unless you can "clean" the data on thefly, you will end up with overlaps since some contributors will haveadded buildings in the meantime.
2-One cannot assume that an OSM building, and its ODB counterpart,will be found at the same location (look at DX and DY columns in ODBinventory tables). These are averages, which means there are largeroffsets between both datasets (i.e. you won’t get a match betweenbuildings, or get a match with the wrong ones).
The only realistic option is then to manually delete the ODB buildingsif they overlap OSM ones. Here is what the import guideline suggests [1]…
"If you are importing data where there is already some data in OSM,then you need to combine this data in an appropriate way or suppressthe import of features with overlap with existing data."
Therefore, importing data and using a conflation process is notunusual. Again, I understand that in case of an overlap, you go forthe last option (suppress the import building). I am rather inclinedtoward using conflation when necessary, which means...
-Importing an ODB building when there is no corresponding one in OSM;
-Conflating both ODB and OSM buildings when it significantly improvesexisting OSM content;
-Not importing an ODB building when the corresponding one in OSM isadequate.
What do the others on the list think?

Daniel
[1]https://wiki.openstreetmap.org/wiki/Import/Guidelines#Don.27t_put_data_on_top_of_data
*From:*Nate Wessel [mailto:bike...@gmail.com]
*Sent:* Thursday, January 16, 2020 10:38
*To:* Daniel @jfd553; talk-ca@openstreetmap.org
*Subject:* Re: [Talk-ca] FW: Re: Importing buildings in Canada

Responding to point C below,
I would strongly suggest that we not confuse the process of importingnew data with that of updating/modifying existing data in the OSMdatabase. One of the things I really disliked about the initialbuilding import was that it overwrote existing data at the same timethat new data was brought in. These are really two separate importprocesses and require very different considerations.
We can certainly consider using this dataset to improve/updateexisting building geometries, but I think that is a separate processfrom the import we are discussing here. To keep things simple for thisimport, I would suggest removing any building from the import datasetthat intersects with an existing building in the OSM database. Thatis, let's not worry about conflation for now, and come back and dothat work later if we still feel there is a strong need for it.
I see the main point of this effort as getting more complete coverage- it we want to use the dataset to do quality assurance on existingdata, that is a whole other discussion.
Best,

Nate Wessel, PhD
Planner, Cartographer, Transport Nerd
NateWessel.com <https://www.natewessel.com>

On 2020-01-15 12:55 p.m., Daniel @jfd553 wrote:

    Thanks for the quick replies!

    Now, about...

    *a) Data hosting:*

    Thank you James, I really appreciate your offer (and that of
    others). So yes, I think hosting pre-processed data in the task
    manager, for approved regions, is an attractive offer. When we
    agree on a municipality for pre-processing, I will contact you to
    make the data available.

    BTW, I thought ODB data in OSM format was hosted with the
    OSMCanada task manager. I understand that ODB data are currently
    converted on the fly when requested?

    *b) Task manager work units for import:*

    I agree with Nate, ~ 200 buildings or ~ 1,500 nodes would be
    suitable. I was thinking at the same importation rate, but for an
    hour of work. It seems best to target 20-minute tasks.

    *c) Task manager work units for checking already imported data*

    According to Nate, it is definitely not faster than actively
    importing. We should then keep the above setup (b).

    However, what if I add a new tag to pre-processed data indicating
    if a building was altered or not by the orthogonalization (and
    simplification) process? For instance, /building:altered=no/,
    would identify buildings that were not changed by the process and
    that could be left unchanged in OSM (i.e. not imported);
    /building:altered=yes/ for those who were changed by the process
    and that should be imported again. The same pre-processed datasets
    could then be made available for all cases. Thoughts?

    *d) Finding local mappers:*

    I agree with Nate’s suggestion to try contacting the top 10
    mappers in an area. Using the "main activity center" would work
    for most of the contributors but selecting other overlays (.e.g.
    an activity center over last 6 months) could also work great. As
    long as we identify who might be interested in knowing there is an
    import coming.

    Comments are welcome, particularly about the proposal on c)

    Daniel

_______________________________________________
Talk-ca mailing list
Talk-ca@openstreetmap.org
https://lists.openstreetmap.org/listinfo/talk-ca

Re: [Talk-ca] FW: Re: Importing buildings in Canada

Reply via email to