Brad,

I've done some work on the topic myself on the linked open data side [1]. My conclusions are similar to yours: there are three different types of information: The Thing, the Name of the Thing and the Location of the Thing and all three are not the same Thing.

It is probably in our interest not to make too many changes to the OSM XML specs; perhaps we can achieve everything by simply adding start_date/end_date to features, ways, nodes and tags? I can't think of a case where change could not be encoded using this approach.

There is the question of imprecise dates (which really aren't all that different from having a precise date with no time). I'd suggest that this be handled by the renderer: most people visualize a year or era, finding features is a query based on date overlap. My only addition would be the before or after tags that would account for "I know it used to be call Signal Hill but I don't remember when".

You might mean probability in your example instead of confidence factor, but my concern with both is that they mean essentially nothing unless you know exactly what process generated the numbers; I'm not sure what I would do with a tag that said "There is a 45% chance that this is the grave of XYZ".

Perhaps your Signal Hill example is not the best for confidence factor in that toponymy is primarily a cultural artifact; I can think of several features that appear under different names depending on the institution that produced the map. The solution to that is the creative use of the source tag, which in the end is no different that the language tag for this particular case.

I am working on another project where documenting a confidence factor would be useful in that location is estimated statistically from the reported sinking locations of ships. The wreck location is predicted to be within a polygon with a .95 confidence interval, which is much more useful that a single node when trying to figure out whether an anchor is likely to foul at that location. Still for most people, or a renderer, the statistical information is unlikely to be useful without a reference to a description of how it was achieved and documenting that seems complex.

Does OSM have a tag to record Feature A is part of Feature B?

best,
rhw
[1] http://rdf.muninn-project.org/ontologies/graves.html

On 21-Jan-13, at 8:33 PM, [email protected] wrote:

Message: 4
Date: Mon, 21 Jan 2013 17:32:44 -0800
From: Brad Thompson <[email protected]>
Subject: Re: [Historic] Temporal Tagging


The issue of how to handle changes in names, locations, and shapes (for territories, building footprints, etc) has been a favorite topic of mine in a few discussions to date. The problems can quickly become complex, but I think a simple schema could encompass all possible historical scenarios. I'm happy to share the observations / conclusions, which have generally
been the following:

1) Unlimited Changes to All Attributes: The schema should allow the
recording of an *unlimited* number of changes to *any* attribute of any entity. No attribute can be used as a unique identifier, like 'name' or 'address' or even location-based attributes, because throughout history, all of these things can change. Therefore, a truly unique identifier must be assigned to each entity. Additionally, buildings are moved, streets are renamed, rerouted, and renumbered. Tim, as you've pointed out, this has
happened many times to some entities. Therefore, a schema that simply
allows for a single 'old-name' isn't flexible enough. All changes to
attributes as described above must each have a time associated with them.
2) Confidence Factors: Because historical data inherently entail
uncertainty, there should be a method of assigning a confidence factor to any attribute. (This feature has no purpose in realtime mapping, because all data can be verified against actual conditions.) This confidence factor would be applicable both to attributes like names as well as times. For names, for example, "we think the name of this hill was Telegraph Hill, but there are conflicting reports that claim it was called Signal Hill, so we
assign a 60% confidence factor to Telegraph Hill and a 40% confidence
factor to Signal Hill". The renderer could then decide how to display the name(s). For times, for example, "we know this hill changed name from Loma Alta to Telegraph HIll sometime between 1848 and 1852, but we don't know for certain when, so we assign the date of the change as January 1, 1850
and give it a confidence factor of 4 years (creating a buffer with a
temporal diameter of 4 years around that date). This idea is critical,
because it allows conflicting reports and developing research to be
displayed alongside well-established facts.
3) Spatial and Non-Spatial Entities: Because shapes (nodes, etc.) cannot be used as unique identifiers the way they can for realtime mapping, there
exists a need to create a distinction between spatial entities and
non-spatial entities. This way, each spatial permutation (or version) of an entity, like a building or a road or a territorial boundary, can have a
distinct shape that is still linked to the nonspatial entity that
represents the concept of its agreed-upon identity. For example, 'United States of America' would be a nonspatial entity with a start date of 1776
and no end date. But linked to that entity would be dozens of spatial
entities, because the boundaries of the United States have changed dozens
of times, therefore changing the shape, through small border edits or
territorial acquisitions. Each of those shapes would have its own start and end time, and the map would display the correct shape as determined by the
time being viewed.

Obviously, we're talking about a dramatically different way of recording place data, but in my view, these levels of detail are critical to making a
viable historical mapping platform where multiple types of data can be
shared and displayed. Looking forward to hearing everyone else's thoughts
on this.

Brad Thompson
Pastmapper


_______________________________________________
Historic mailing list
[email protected]
http://lists.openstreetmap.org/listinfo/historic

Reply via email to