Imre,
It is very good and surprising idea.
I discovered on the page "Error categories" a tool
https://www.keepright.at/ with the help of which I found already dozens
obviously misspelled tags. It functions quite intuitively, just select
"misspelled tags" check box and move the map to an area of interest.
Best regards,
Oleksiy
On 18.01.2018 15:48, Imre Samu wrote:
>What can I as a map editor do to keep these data files to a
reasonable size without compromising data quality?
According to the "Lean thinking" (
https://en.wikipedia.org/wiki/Lean_thinking ) we should focus on
" eliminating waste"
Waste is:
* Any polygon or tagging errors ( because we can't use this
information, and need lot of space or processing resources ) or any
from this: https://wiki.openstreetmap.org/wiki/Error_categories ; etc
* or any mapping errors ( bad street names ; routing problems: waste
for users )
* or any not "UpToDate" data/information ( old phone numbers - it
is useless, so it is waste )
some examples:
http://area.jochentopf.com/stats/
- Errors: Intersections
- Errors: Duplicate nodes
- Errors: Duplicate segments (*~160.000*)
- Errors: Open rings ( ~9.000)
- Errors: Inner rings with same tags as outer rings
- Errors: Wrong role ( *~ 700.000 *)
some key problems: ( unused/bad keys is a waste )
- https://taginfo.openstreetmap.org/reports/characters_in_keys#problem
( Keys with possibly problematic characters )
- https://taginfo.openstreetmap.org/reports/characters_in_keys#space
( Keys with whitespace )
- or my favorite:
--- https://taginfo.openstreetmap.org/keys/latitude#values
--- https://taginfo.openstreetmap.org/keys/LAT#values
And we have lot of low quality imports we should fix.
>What can I as a map editor
imho:
Any quality assurance work helps a lot:
https://wiki.openstreetmap.org/wiki/Quality_assurance
so fixing data problems in your area helps "eliminating waste" and
less waste is good for data size
Imre
2018-01-18 6:14 GMT+01:00 Oleksiy Muzalyev
<oleksiy.muzal...@bluewin.ch <mailto:oleksiy.muzal...@bluewin.ch>>:
Good morning,
I started to experiment with the OSM data [1] on a local computer,
and I begin to realize how big these data files are. It takes
quite a while to load into the local database just the data for
one country.
What can I as a map editor do to keep these data files to a
reasonable size without compromising data quality? I mean in the
sense, - take care of the pennies and the pounds will take care of
themselves?
I could think of the following three approaches so far:
- using as short an URL as possible,
website=http://somewebsite.com instead of
website=http://www.somewebsite.com <http://www.somewebsite.com> ,
three characters less; [2]
- correct phone number ISO format, phone=+12 345 678 90 12 instead
of phone=+12 (345) 678 90 12 , two characters less; [3]
- deleting unnecessary nodes from a way (Shift-Y in JOSM) with
consequent verification of its geometry;
What else, if anything, could be done?
[1] https://wiki.openstreetmap.org/wiki/Downloading_data
<https://wiki.openstreetmap.org/wiki/Downloading_data>
[2] https://wiki.openstreetmap.org/wiki/Key:website
<https://wiki.openstreetmap.org/wiki/Key:website>
[3] https://wiki.openstreetmap.org/wiki/Key:phone
<https://wiki.openstreetmap.org/wiki/Key:phone>
With best regards,
Oleksiy
osm: Alex-7
_______________________________________________
talk mailing list
talk@openstreetmap.org <mailto:talk@openstreetmap.org>
https://lists.openstreetmap.org/listinfo/talk
<https://lists.openstreetmap.org/listinfo/talk>
_______________________________________________
talk mailing list
talk@openstreetmap.org
https://lists.openstreetmap.org/listinfo/talk