Hi sandor:

to long - did not read it.

keep it simple please,

regards
walter


Am 10.04.2017 um 10:04 schrieb Sandor Seres:

Three weeks ago I posted some multipolygon related notes. This mail is, in a way, an addition to that former mail.

My first note was triggered by some user worries about poorer maps if they use data from the osm2ogsql preparation. Dropping “broken multipolygons” will result in many and large empty/white places with long reparation period. Strengthening the preparation on the subject might be a better option in my opinion (I know, I was there). However, at the end, how this subject will be handled is perfectly up to the authors of the osm2pgsql application. Users starting from the OSM source data will not be affected whatever strategy will be selected.

The second note was related to the mass/programmatic correction of the source data. This could have dangerous/damaging impact on many OSM users. Fortunately, the replays say that programmatic correction is not a strategy in the “fixing multypolygons” actions. I have mentioned the “self-crossings” issue which is not an error for many users (depending on what notion interpretations and tools one uses). To clean up the confusion, this note needs some additional words. Assume someone would correct all polygon self-crossings in the source data. Assume, the selected fixing model is the popular dividing model (the polygon is divided into new polygons between self-crossings). The “fix” will be correct but the consequences damaging. Namely, in scaling and rendering the new small areas quickly reach ignorable/collapsing size causing brakes. Here, it is worth noting, that the self-crossing issue is a topic in the modern vector based digital mapping even if all self-crossings are somehow resolved in the source data. Namely, while scaling and doing edge-smoothing in data generalisation, self-crossings on thin area sections (like fiords, peninsulas, rivers and so on) are unavoidable and dividing produces many tiny areas. High fragmentation of the source data and freedom of tag selection (river sections tagged as lakes) make the issue even worse. Just look at the Amazonas river-system rendering from a popular vector map-maker her http://goo.gl/bT1Bu9

(the screen dump is from yesterday, from a demo system, in roughly 1:6.7 mill scale). There are really many and large unacceptable breaks. However, from the same data source, using topology geometry as suggested in my former mail, it is possible to create a compact minimal coverage for the same river system like this https://goo.gl/pNQwDm . Note that the river system her is one simple area (one outer and many inner borders never touching each other) from Peru to the Atlantic. To be on the fair side the last image should be rendered from a zoom/scale level that corresponds to the 1:6.7 mill scale. This is done here https://goo.gl/eaAWNy and the zoom level contains approximately 250 times less nodes than the level used for the previous image. The area connectivity is still perfectly preserved and the image is much cleaner in this scale extract. Finally, if a user is still insist on fixing the polygon self-crossings, exchanging and reversing the poly-lines between two consecutive self-crossings (eventually just reversing the end loop after a self-crossing) should be a much better strategy.

However, the third, the last note was my major point. Just to remind. There is a large set of area related anomalies caused by relations between objects from different classes (between seas, forests, lakes, rivers…). The extent and complexity of this set is far beyond the “broken polygons” issue and should be more in the development focus. Even if the areas/multipolygons within a class are in perfect conformity with the strongest OSM and OGC rules, still these anomalies are there, though sometimes hardly visible in maps. Therefor many map-makers tolerate them but in GIS systems they appear as strong limitations and should not be tolerated. In the former mail I have presented many examples and some hints how these anomalies could be resolved. Unfortunately, the discussion went in a wrong direction, about the Scandinavian forests, while the region selection is irrelevant for the subject. To avoid much repetition I will present further examples without details in procedures. The illustrations are from the area of Japan (one of the best mapped areas) and the source is the standard OSM dump from some week ago.

Honestly, I am not sure what a forest is. More precisely, if you ask me – I know, if you ask me to tell what it is – I do not know. However, among the many interpretations, I am closest to accept the topology interpretation of the notion. The green area in the front page map (or in other OSM based maps) usually covering the areas tagged as forest and/or wood. In Japan, as everywhere, forests are uploaded highly fragmented, they overlap in the most strange combinations, the same with river and lake area objects. The most common case is when borders of neighbouring objects run in and out of each other. The fragmentation itself is causing lots of problems even in rendering. Just look at these examples (the well-known light/dark stripes) here http://osm.org/go/7WCEND?layers=H or here http://osm.org/go/7WCzACu--?layers=C or here https://goo.gl/JVI1E7 or here https://goo.gl/Xhv1nq . Extending the areas within the object classes may help in rendering but still the fragmentation is there.

Assume, we have managed to remove all redundancy, repair most of the “broken polygons” and perform full defragmentation within area classes: forests, lakes, rivers and land masses. Besides, we managed to recognize and replace missing river sections, missing islands in lakes and rivers. So, within any of these object classes we have the best data presentation that is potentially possible from the source data. Yet, we quickly discover that there are forests overwritten by lakes, rivers running over forests, borders of lakes running in and out of forests and so on (the inter class anomalies). While these anomalies are not show stoppers in rendering, they limit the corresponding GIS’s quality, statistics, quantitative analyses and forecasts (number of trees in forests, CO2 consumption per year, oxygen production per year and so on). Let us assume, we have managed to repair all these anomalies by using the topology geometry/calculus as hinted in my previous mail. Then some of the results are like these:

The country’s land area created from the coastline data is here http://goo.gl/O1L60r , the border polygons are disjunctive and there are no holes at all. Subtracting all inland water areas and adding the islands within these, we get the land-masses illustrated here http://goo.gl/OM2dqn <http://goo.gl/OM2dqn>. The yellow areas represent a minimal simple/compact land-masses coverage. The inland waters make only about 0.5% of the land area.

The countries forest coverage is pretty high https://goo.gl/HU63M7 . The forests cover around 63.6% of the land-masses, though there are still some forests to be mapped (see the Kyushu island). The largest compact/simple forest area, here https://goo.gl/4yzeyC, by size equals to 24% of all forests. It consists of one outer/container and 25831 inner/excluding polygons. All polygons are disjunctive and from any point A to any point B in this area one can go walking exclusively through the forest (hm, the shortest way?). However, the holes of this largest simple area contain additional 2892 new (small) “forests”. An extract from this complete, largest reginal forest is presented here https://goo.gl/mzgDRg . The light green is the largest simple forest area while the dark green represents the smaller forests in holes. One can see that there are even holes in these small forests and new forests in their holes and so on. Similar inclusions sometimes go up to 6 levels. The ten largest simple areas make 70.2% of all forests in the country.

Finally, extending the case to other object types and/or larger areas like continents or the Planet, one can feel the huge potential of OSM, especially in the future with growing content. Simply, it is difficult not to be an enthusiast of it.

Regards, Sandor



_______________________________________________
talk mailing list
talk@openstreetmap.org
https://lists.openstreetmap.org/listinfo/talk

_______________________________________________
talk mailing list
talk@openstreetmap.org
https://lists.openstreetmap.org/listinfo/talk

Reply via email to