Re: [OSM-talk] Multiple nodes for one country

2009-08-03 Thread Jon Burgess
On Mon, 2009-08-03 at 20:49 +0200, Peter Körner wrote:
> Peter Körner schrieb:
> > andrzej zaborowski schrieb:
> >> Hi Peter,
> >> I don't think anybody has a reason to object to merging them.  At
> >> least me and User:Mala have been merging some of these nodes last
> week
> >> and we got no blackmail so far :)  I believe we went through all
> the
> >> country nodes which didn't have a name:pl= or name:it= assigned yet
> so
> >> out of your list at most 15 or so countries should still remain
> >> duplicated.
> >>
> >> Cheers
> > 
> > The main problem is that I'm unable to produce an up-to-date list
> from 
> > database since I don't have the resources to import an up-to-date
> dump.
> > 
> > I'll try to process an up-to-date planet.osm tomorrow to generate
> an 
> > up-to-date list from it. It's a pity that cassini is not updated via
> the 
> > diffs right now.
> > 
> > At the next step I'll generate a list for each wikimedia-language 
> > containing all countries and their names in this language, so people
> can 
> > correct the locale names more easy. It seems you've already done
> this 
> > for pl and it -- I'm about to do it for de.
> > 
> > Peter
> > 
> 
> I threw an nearly up-to-date planet.osm against a simple 
> sax-parser-script in php and after it ran about 10 hours (such a 
> planet.osm is a really big thing) it produced this table:
> 
> http://toolserver.org/~mazder/duplicate-countries/from-planet.osm/
> 
> Looks much better! Still 26 countries are duplicated, but this could
> be 
> fixed manually, so I'll do that now.
> 
> On some time in the near future, when cassini holds an regularly
> updated 
> gis-database we'll be able to track such duplications at 
> http://toolserver.org/~mazder/duplicate-countries/

I obtained a list of the duplicate country nodes IDs from the Mapnik
rendering DB[1] and have downloaded & fixed all the duplicates[2].


Jon


1: SQL: gis=> select name,osm_id from planet_osm_point where
place='country' and name in (select name from planet_osm_point where
place='country' group by name having count(*) > 1) order by name,osm_id;

2: Two changesets. The first removes those automatically fixed by the
JOSM validator. The second picks up the few remaining ones which needed
merging.
http://www.openstreetmap.org/browse/changeset/2028815
http://www.openstreetmap.org/browse/changeset/2029180




___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk


Re: [OSM-talk] Multiple nodes for one country

2009-08-03 Thread Peter Körner
Peter Körner schrieb:
> andrzej zaborowski schrieb:
>> Hi Peter,
>> I don't think anybody has a reason to object to merging them.  At
>> least me and User:Mala have been merging some of these nodes last week
>> and we got no blackmail so far :)  I believe we went through all the
>> country nodes which didn't have a name:pl= or name:it= assigned yet so
>> out of your list at most 15 or so countries should still remain
>> duplicated.
>>
>> Cheers
> 
> The main problem is that I'm unable to produce an up-to-date list from 
> database since I don't have the resources to import an up-to-date dump.
> 
> I'll try to process an up-to-date planet.osm tomorrow to generate an 
> up-to-date list from it. It's a pity that cassini is not updated via the 
> diffs right now.
> 
> At the next step I'll generate a list for each wikimedia-language 
> containing all countries and their names in this language, so people can 
> correct the locale names more easy. It seems you've already done this 
> for pl and it -- I'm about to do it for de.
> 
> Peter
> 

I threw an nearly up-to-date planet.osm against a simple 
sax-parser-script in php and after it ran about 10 hours (such a 
planet.osm is a really big thing) it produced this table:

http://toolserver.org/~mazder/duplicate-countries/from-planet.osm/

Looks much better! Still 26 countries are duplicated, but this could be 
fixed manually, so I'll do that now.

On some time in the near future, when cassini holds an regularly updated 
gis-database we'll be able to track such duplications at 
http://toolserver.org/~mazder/duplicate-countries/

Peter

___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk


Re: [OSM-talk] Multiple nodes for one country

2009-08-02 Thread andrzej zaborowski
On 02/08/2009, Peter Koerner  wrote:
> andrzej zaborowski schrieb:
>> Hi Peter,
>> I don't think anybody has a reason to object to merging them.  At
>> least me and User:Mala have been merging some of these nodes last week
>> and we got no blackmail so far :)  I believe we went through all the
>> country nodes which didn't have a name:pl= or name:it= assigned yet so
>> out of your list at most 15 or so countries should still remain
>> duplicated.
>>
>> Cheers
>
> The main problem is that I'm unable to produce an up-to-date list from
> database since I don't have the resources to import an up-to-date dump.
>
> I'll try to process an up-to-date planet.osm tomorrow to generate an
> up-to-date list from it. It's a pity that cassini is not updated via the
> diffs right now.

Oh, now that I think of it, it might be possible to just load the last
month's list of nodes as received from XAPI into JOSM and tell it to
"Update data" and it might be able to make a couple of group requests
for the current versions of these nodes.  If this doesn't work then it
might work if you add  elements at the top of the xml file for
each country node.

Cheers

___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk


Re: [OSM-talk] Multiple nodes for one country

2009-08-02 Thread Lennard
andrzej zaborowski wrote:

> out of your list at most 15 or so countries should still remain
> duplicated.

Just fixed The Netherlands, and clicked on some other countries, which 
were all fixed already. I didn't have the heart to go through the whole 
list, since a bot could easily do that as well.

-- 
Lennard

___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk


Re: [OSM-talk] Multiple nodes for one country

2009-08-02 Thread Peter Körner
andrzej zaborowski schrieb:
> Hi Peter,
> I don't think anybody has a reason to object to merging them.  At
> least me and User:Mala have been merging some of these nodes last week
> and we got no blackmail so far :)  I believe we went through all the
> country nodes which didn't have a name:pl= or name:it= assigned yet so
> out of your list at most 15 or so countries should still remain
> duplicated.
> 
> Cheers

The main problem is that I'm unable to produce an up-to-date list from 
database since I don't have the resources to import an up-to-date dump.

I'll try to process an up-to-date planet.osm tomorrow to generate an 
up-to-date list from it. It's a pity that cassini is not updated via the 
diffs right now.

At the next step I'll generate a list for each wikimedia-language 
containing all countries and their names in this language, so people can 
correct the locale names more easy. It seems you've already done this 
for pl and it -- I'm about to do it for de.

Peter



___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk


Re: [OSM-talk] Multiple nodes for one country

2009-08-02 Thread andrzej zaborowski
Hi Peter,
I don't think anybody has a reason to object to merging them.  At
least me and User:Mala have been merging some of these nodes last week
and we got no blackmail so far :)  I believe we went through all the
country nodes which didn't have a name:pl= or name:it= assigned yet so
out of your list at most 15 or so countries should still remain
duplicated.

Cheers

On 02/08/2009, Peter Körner  wrote:
> Pieren schrieb:
>> On Sun, Aug 2, 2009 at 12:10 PM, Peter Körner wrote:
>>> I noticed that for some countries there seems to be more than one node.
>>> E.g. for Slovakia there are 5:
>>> http://www.openstreetmap.org/browse/node/424313572
>>> http://www.openstreetmap.org/browse/node/432425079
>>> http://www.openstreetmap.org/browse/node/424315420
>>> http://www.openstreetmap.org/browse/node/243851695
>>> http://www.openstreetmap.org/browse/node/424310798
>>>
>>> all at the same coordinate, most of them with the same names. Is this
>>> intended or just a mistake? Would it be ok to build a bot that correct
>>> this?
>>>
>>> Peter
>>
>> Read this:
>> http://lists.openstreetmap.org/pipermail/talk/2009-July/038931.html
>>
>> It seems that one person is not able to revert all the crap he created.
>>
>> Pieren
>
> Ok, so here is an index of all duplicated countries:
> http://toolserver.org/~mazder/duplicate-countries/
>
> This is *not* the current state of osm, it's of 2009-07-05 which is the
> age of the postgis-database on cassini, the wikimedia osm-toolserver.
>
> It would be easy to extend the script generating this list, so that it
> merges those nodes into a single one. I just want to ask if that would
> be ok to everybody. If there is no contradiction until Monday, 10th
> August, I'll do so.
>
> Peter
>
> ___
> talk mailing list
> talk@openstreetmap.org
> http://lists.openstreetmap.org/listinfo/talk
>

___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk


Re: [OSM-talk] Multiple nodes for one country

2009-08-02 Thread Peter Körner
Pieren schrieb:
> On Sun, Aug 2, 2009 at 12:10 PM, Peter Körner wrote:
>> I noticed that for some countries there seems to be more than one node.
>> E.g. for Slovakia there are 5:
>> http://www.openstreetmap.org/browse/node/424313572
>> http://www.openstreetmap.org/browse/node/432425079
>> http://www.openstreetmap.org/browse/node/424315420
>> http://www.openstreetmap.org/browse/node/243851695
>> http://www.openstreetmap.org/browse/node/424310798
>>
>> all at the same coordinate, most of them with the same names. Is this
>> intended or just a mistake? Would it be ok to build a bot that correct this?
>>
>> Peter
> 
> Read this: http://lists.openstreetmap.org/pipermail/talk/2009-July/038931.html
> 
> It seems that one person is not able to revert all the crap he created.
> 
> Pieren

Ok, so here is an index of all duplicated countries:
http://toolserver.org/~mazder/duplicate-countries/

This is *not* the current state of osm, it's of 2009-07-05 which is the 
age of the postgis-database on cassini, the wikimedia osm-toolserver.

It would be easy to extend the script generating this list, so that it 
merges those nodes into a single one. I just want to ask if that would 
be ok to everybody. If there is no contradiction until Monday, 10th 
August, I'll do so.

Peter

___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk


Re: [OSM-talk] Multiple nodes for one country

2009-08-02 Thread Pieren
On Sun, Aug 2, 2009 at 12:10 PM, Peter Körner wrote:
> I noticed that for some countries there seems to be more than one node.
> E.g. for Slovakia there are 5:
> http://www.openstreetmap.org/browse/node/424313572
> http://www.openstreetmap.org/browse/node/432425079
> http://www.openstreetmap.org/browse/node/424315420
> http://www.openstreetmap.org/browse/node/243851695
> http://www.openstreetmap.org/browse/node/424310798
>
> all at the same coordinate, most of them with the same names. Is this
> intended or just a mistake? Would it be ok to build a bot that correct this?
>
> Peter

Read this: http://lists.openstreetmap.org/pipermail/talk/2009-July/038931.html

It seems that one person is not able to revert all the crap he created.

Pieren

___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk