[OSM-talk] "Second decade" visions

Daniel Koć Tue, 10 Mar 2015 15:57:07 -0700

W dniu 10.03.2015 3:52, Alex Barth napisał(a):

Casting the net a little wider:


What do you think are the big topics and challenges for OpenStreetMap
as we're about to go into the second decade? What does this mean for
State of the Map?

I'm not sure what does this mean for SotM (?), but here are my thoughtson strategic topics as I see them being the active OSM community memberfrom Poland. They are not very outstanding and visionary probably, butIMHO they are important and it can easily take few years - up to decade- to achieve these goals nevertheless... =}


TL;DR summary:

1. More synergy between (sub)projets
2. Managing data overload
   2a. Semi-automated tasks execution
   2b. Continuous sanity check tools
   2c. Big data analysis
3. Tools for personalizing OSM data presentation
4. Redesigning some key tagging schemes
5. Cooperation with external projects (especially open, like Wikidata)

***

And now that "TL" part alone:

1. More synergy between (sub)projets

OSM is acting in a highly decentralized way. I think this is healthy ingeneral, but because the project is now on the road to be "The Map"(much like the Wikipedia is "The Encyclopedia" now), this results ingrowing inefficiency and inertia. Of course we can't avoid having somedepartments, just because the OSM is getting bigger and people tend tofocus on their favorite activity, but nobody seems to care about the OSMoutput as a whole. For example you can talk to death every detail oftagging some esoteric features while completely ignoring how will itaffect the rendering, routing or usability for mappers (for example howto tell the difference between tags A and B).

While it's a general issue and can't be resolved once and for all, Ithink connecting more dots inside the project is very important factorof reaching "The Map" level.


2. Managing data overload

We have so many data these days! It's a blessing and a curse. There arestill many remote places where nothing is mapped - sure! But now we knowhow to do it, we have the tools to do it and we can help localvolunteers to start (one of our most active member is helping to developmapping in Nepal, another one is fond of Kyrgyzstan mountains).

But what to do with big, dense cities, where we have the landuses, thebuildings, the streets and all the other "micromappping" things - plusthe 3D layers, indoor mapping, underground facilities, etc. squeezedtogether? It will be increasingly hard to work with them - and new userswill be those who will suffer the most, because it's easier to damagesomething while trying to add nice shiny object than to really extendthe map.

Maybe we should split the data into "sets" or just make the iD and JOSMmore layer/theme-centric tools - I don't know yet, but the problem ishere to stay.


2a. Semi-automated tasks execution

One of the things we should really start to practice is to rely more onthe automation. Let me quote:

"Try not to let humans do what machines could do instead. As a rule ofthumb, automating a common task is worth at least ten times the effort adeveloper would spend doing that task manually one time. For veryfrequent or very complex tasks, that ratio could easily go up to twentyor even higher."


[ http://www.producingoss.com/en/managing-volunteers.html#automation ]

As an example - we are just starting to use semi-automated script forupdating public transport routes in Warsaw. There are over 300 lineshere, many hundreds of bus/tram stops - and they are constantlychanging, of course. one by one. When I got interested in it, we had adedicated Wikiproject, but it was on hiatus by then and I quickly gaveup too, because tracking so many objects in Wiki was a tedious and notvery useful task. Once we learned that local public transport operator(called ZTM) has started giving away their precious raw data withcoordinates (!), me and few other mappers started to add all the stopsfocusing on "stop_position" tagging. Now, after almost a year of work,we have all of them and the C++ script which creates updated routesnetwork in about 15 minutes. Two important things happened latelyregarding transport system in my hometown - one of the bridges wasdestroyed by fire and a new subway line was opened. If we did everythingby hand, we would be dead now, because too many things would change toofast - but the script, even as rough as it is now, has no problem evenif all the lines will change at once. We have only to feed it with newstops (when they are created) and we're done!

However - beware of "botocracy"! If there's only one person able to usethe tool, it's not sustainable model. It's important that the scriptonly gives the .osm file on the output and we can easily handleinjecting it into the database once a day - or at any other time rate.We have a chance to make manual fixes here and there if needed. Too muchautomation is not the answer - just let us get rid of boring details!


2b. Continuous sanity check tools

Such tools as this public transportation updating script need to be runcontinuously - be it once per day or whatever. And we need some moretools to monitor the whole service and data. Sure, we can't predicteverything (see this message:


https://lists.openstreetmap.org/pipermail/talk/2015-March/072273.html ),

but we can have a "control center" for managing all these bots. The moredata we have, the bigger is our project, the more we will need also thiskind of automation. And it should also be open to the public to avoid"hit by bus" kind of situations.


2c. Big data analysis

We should not only collect more data, but also analyze and aggregatesome of them. Probably some data analysis software exist yet, which canbe used with our database more or less directly, but now it looks likewe care only for editing and maps rendering (and sometimes printing themtoo) - let's imagine some other outputs in scope of OSM!


3. Tools for personalizing OSM data presentation

What we have now is 5 different map styles on the main page to choose,but people will need more and more personal styles (AKA "data skins").Rendering all of them on OSM servers is probably impossible, but we candevelop (just-in-time?) client rendering interfaces for our database. Ithas to be easy to use, like switching on and off visual layers andletting people choose the crazy colors, icons and the interesting areasin their browser just by clicking.

We can still serve some basic CSS skins/prestes via our repository -like "map for the developers" (I would like to see every POI, streetlamp and the pipelines too) or "interactive contour map for quiz/webpage" (two of my friends was asking me exactly about it for their work!)- but the rendering should be on their side. They can share someelements to avoid duplication - P2P in the background can be of some usehere or even "CloudMapping". We should use people's machines much more.I still miss the Tiles@Home project, but that was still old staticone-size-fits-all concept. In the next decade users can make a dynamiccloud sharing common resources, but dynamically creating and compilingpersonal "forks".


4. Redesigning some key tagging schemes

I think that will be one of the hardest think to change, but while tagcrafting is mostly a grassroot process, we need to rethink some of themin a more systematic way.

For example amenity=school should be really landuse=school (if not usedjust for the building), landcover namespace should arise (so on thelanduse=park we can see green space only when there's a grass actually,not on the whole this area), maybe some nature/man_made tagging shouldbe replaced by terrain namespace... That's not important what exactlyshould be (re)designed from top to bottom this time, but once you havethe needed level of expertise, you can make new implementation betterinstead of just patching the original one.

We also have a lot detailed objects which are not always clearly definedand we should try more "cascading" approach, like "amenity=fast_food" =>"amenity=food+amenity_food_type=fast_food" (or something alike). Thatway we can have "Here is food!" label without forcing mapper todistinguish if he's not really sure.

I expect there will be strong reaction against using "top-downcommittee" methodology, but some well-known problems with our ontologyarchitecture will never go away if we try to change it tag-for-tag. Ofcourse that is true only for this class of problems - most new schemeswill still be best when created ad hoc and then used by more and moremappers.


5. Cooperation with external projects (especially open, like Wikidata)

I remember when Wikipedia was afraid of using maps from OSM, becausethat was an external project. But if the license terms are notprohibitive and you know this other community works with the sameprinciples in mind as your own, it would be a shame not to use theirresources just because we don't control it fully. Recently I also heardsome voices regarding using Wikidata in OSM, so for me it's the samestory retold from the other side. =}

OSM is mapping/GIS community and is defined by this. It should not tryto reinvent everything not to loose the focus. If somebody wants to trysomething different, she's free to join those "other" projects as well!And all those projects can cooperate to share what can be shared.

***

Well, that is what comes to my mind when thinking about strategicvisions for next few years. Most of the times I just try to scratch myown little itch day by day, but after few years in the project I alsohave some long-time expectations and ideas. They can be not accurate or- heaven forbid! - the best ones, but remember: that was YOU who askedme about "big topics and challenges for OpenStreetMap"! ;-}}}


--
Piaseczno Miasto Wąskotorowe

_______________________________________________
talk mailing list
talk@openstreetmap.org
https://lists.openstreetmap.org/listinfo/talk

[OSM-talk] "Second decade" visions

Reply via email to