Re: [OSM-talk] How can we measure the completeness of the map statistically

2008-06-28 Thread Mark Williams
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

wer-ist-roger wrote:
> Am Donnerstag 26 Juni 2008 schrieb Frederik Ramm:
>>> * Another hypothesis is that more complete areas of OSM will  
>>> have a higher level of edit activity.
>> Then there are those who say that an area that's complete doesn't  
>> require any edit activity...
> 
> That is right that "complete" areas might not gany much edits (or maybe no 
> edits at all) but you can see that on the edit history you will get a 
> submit-curve that has a lot of edits within a timeperiod until you get at 
> thet point it's getting less end less till you have almost no edits at all.
> 
> So a calculation should allways include the history of submitted changes.
> 

Indeed, that curve of much editing -> less editing should be a factor in
the completeness measure - an area with much activity in the past + some
more recent would be good, an area with no edits at all for 2 years is
ripe for a check-over, an area with no history is clearly not good.
Landuse=field is perhaps an example of the exception that proves the rule?

I still think it would be useful for an overlay to exist for mappers to
mark up where they consider done, perhaps that & an auto-layer should
combine to make a more visible 3rd layer of completeness where they
agree? We could start by filling this public layer with coastline data &
mark the seas as done, at least - unless we want to get full marine data
in as well?

Mark
-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFIZmF7JfMmcSPNh94RAj8AAJwKi2OWzcWKzHoKkBDSszfF73hrMQCcCtm6
oYevRaoNA00yrTvL7Qqnk3M=
=P35M
-END PGP SIGNATURE-


___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/cgi-bin/mailman/listinfo/talk


Re: [OSM-talk] How can we measure the completeness of the map statistically

2008-06-27 Thread Freek
On Friday 27 June 2008, Edward Johnson wrote:
> I'm very intrigued by your work Freek. 

Thanks :-)

> I'll get working on trying to reproduce something like this for the UK

If you can wait for a few days, I can make some for the UK too (busy finishing 
my exams a.t.m.).

> and then try to use the map 
> generated to compare with OSM. See if the hypothesis does hold that more
> complete areas are edited more. Because of course as you say a fully
> complete area should not be edited at all.

Difficult... In the Netherlands it might be easier because we have the AND 
data as a "background" to compare to: the number of edits in different areas 
can probably be compared by dividing them by the amount of AND data for the 
areas, which is again approximately what I'm doing for the blue-green-red 
maps. For other areas you really need another external source to compare to. 
Some ideas:
- VMAP0 has some city boundaries I remember (and cities also have a special 
colour on landsat imagery); data density in cities will be higher than in 
rural areas, on average.
- Some governments publish road-length statistics, someone used these to 
generate a top-10 of most complete municipalities before the AND import:
http://wiki.openstreetmap.org/index.php/WikiProject_Netherlands/Statistieken
Interestingly, these statistics generated some healthy competition among Dutch 
participants at the time.

I think we should just work out some of the different methods (from fully 
automated based on different sources to manual tagging) and see which work 
best. Probably manual and automated methods will just coexist in the future.

-- 
Freek

___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/cgi-bin/mailman/listinfo/talk


Re: [OSM-talk] How can we measure the completeness of the map statistically

2008-06-27 Thread wer-ist-roger
Am Donnerstag 26 Juni 2008 schrieb Frederik Ramm:
> >     * Another hypothesis is that more complete areas of OSM will  
> > have a higher level of edit activity.
>
> Then there are those who say that an area that's complete doesn't  
> require any edit activity...

That is right that "complete" areas might not gany much edits (or maybe no 
edits at all) but you can see that on the edit history you will get a 
submit-curve that has a lot of edits within a timeperiod until you get at 
thet point it's getting less end less till you have almost no edits at all.

So a calculation should allways include the history of submitted changes.

___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/cgi-bin/mailman/listinfo/talk


Re: [OSM-talk] How can we measure the completeness of the map statistically

2008-06-27 Thread Edward Johnson
> * Another hypothesis is that more complete areas of OSM will have a
> higher level of edit activity. If no-one has ever edited an area then it
> may be unlikely that the map is complete there, obviously however there may
> just be nothing there, so this test could be used in conjunction with the
> Yahoo! Imagery test stated before. If we could produce some sort of heat
> map showing which areas are edited most frequently and monitor it over time
> this could certainly show us some interesting trends.

I have done approximately this some time ago. See my post in an earlier thread 
on completeness measures:
http://lists.openstreetmap.org/pipermail/talk/2008-May/026284.html

In the meantime I created the images for the Netherlands weekly and put them 
in an animated GIF file:
http://www.vanwal.nl/osm/density/nl_500_080425-080619.gif

On the Dutch tile server we also added a layer showing recent changes, 
overlayed over aerial imagery, see
http://tile.openstreetmap.nl/~rubke/fietskaart/?zoom=9&lat=52.00&lon=5.07&layers=0B000FFTT
(Only zoom level 7-10; the placenames can be turned off in the layer menu.)

Although the amount of recent changes is not a measure for completeness, I 
think it shows nicely what we can do using automated image generation, and 
the method can easily be adapted to show different kinds of statistics.

-- 
Freek



I'm very intrigued by your work Freek. I'll get working on trying to reproduce 
something like this for the UK and then try to use the map generated to compare 
with OSM. See if the hypothesis does hold that more complete areas are edited 
more. Because of course as you say a fully complete area should not be edited 
at all.

Ed
http://edwardmjohnson.wordpress.com/

___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/cgi-bin/mailman/listinfo/talk


Re: [OSM-talk] How can we measure the completeness of the map statistically

2008-06-26 Thread Frederik Ramm
Hi,

> * Another hypothesis is that more complete areas of OSM will  
> have a higher level of edit activity.

Then there are those who say that an area that's complete doesn't  
require any edit activity...

Bye
Frederik

-- 
Frederik Ramm  ##  eMail [EMAIL PROTECTED]  ##  N49°00'09" E008°23'33"




___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/cgi-bin/mailman/listinfo/talk


Re: [OSM-talk] How can we measure the completeness of the map statistically

2008-06-26 Thread Freek
On Thursday 26 June 2008, Edward Johnson wrote:
> * Another hypothesis is that more complete areas of OSM will have a
> higher level of edit activity. If no-one has ever edited an area then it
> may be unlikely that the map is complete there, obviously however there may
> just be nothing there, so this test could be used in conjunction with the
> Yahoo! Imagery test stated before. If we could produce some sort of heat
> map showing which areas are edited most frequently and monitor it over time
> this could certainly show us some interesting trends.

I have done approximately this some time ago. See my post in an earlier thread 
on completeness measures:
http://lists.openstreetmap.org/pipermail/talk/2008-May/026284.html

In the meantime I created the images for the Netherlands weekly and put them 
in an animated GIF file:
http://www.vanwal.nl/osm/density/nl_500_080425-080619.gif

On the Dutch tile server we also added a layer showing recent changes, 
overlayed over aerial imagery, see
http://tile.openstreetmap.nl/~rubke/fietskaart/?zoom=9&lat=52.00&lon=5.07&layers=0B000FFTT
(Only zoom level 7-10; the placenames can be turned off in the layer menu.)

Although the amount of recent changes is not a measure for completeness, I 
think it shows nicely what we can do using automated image generation, and 
the method can easily be adapted to show different kinds of statistics.

-- 
Freek

___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/cgi-bin/mailman/listinfo/talk


[OSM-talk] How can we measure the completeness of the map statistically

2008-06-26 Thread Edward Johnson
I would like a way that we can say accurately how complete any area is, without 
relying on other users to have said its complete. Ive come up with a few 
hypotheses and posted them on my blog (http://edwardmjohnson.wordpress.com) of 
ways which we can start to do this. 

* A complete map will have all of the roads in an area, but we cannot 
obtain stats on the amount of road in every area we may wish to test in 
OpenStreetMap. So an accurate way to predict the length of road in any area 
would be useful. My hypothesis is that the length of road in a given area is 
dependent upon population density. That is to say I expect that urban areas 
will have less road per person than rural areas.

* For areas where OSM has aerial imagery I would like to compare the 
complexity (ie file size) of the Yahoo! jpeg and the corresponding OSM tiles. 
The hypothesis is that areas with very small aerial jpeg files (because they 
are simply one colour like the sea or vast expanses of desert) will have few if 
any entries on OSM, whereas areas with large file sizes (cities) will have a 
large density of nodes and ways in OSM and therefore large tile size. I do not 
have the technical knowledge to test this so any help would be great.

* Another hypothesis is that more complete areas of OSM will have a higher 
level of edit activity. If no-one has ever edited an area then it may be 
unlikely that the map is complete there, obviously however there may just be 
nothing there, so this test could be used in conjunction with the Yahoo! 
Imagery test stated before. If we could produce some sort of heat map showing 
which areas are edited most frequently and monitor it over time this could 
certainly show us some interesting trends.

* This is an attempt to solve the problem of missing roads. I would think 
it unlikely that there would a road which is completely cut off from others, or 
that there would be an entire settlement of roads not connected to rest of the 
country’s road network (as in the Madiera example shown below again). The 
hypothesis is that every road is connected to at least one other road of equal 
or higher classification. So if in OSM there are roads that are not, then maybe 
there are missing roads. This testing may require a lot of calculations and may 
not return that many missing roads. If some one can think of a way to do it 
simpy then I would very much like that input.

Any other ideas of how we can measure completeness statistically?

Ed
http://edwardmjohnson.wordpress.com

___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/cgi-bin/mailman/listinfo/talk