[Tagging] Mechanical Edit: fix japanese train stations wikipedia/names fields

2012-10-27 Thread Fabien SK
Hi,

Some time ago, I talked about a proposal to complete the wikipedia tag
for the japanese train stations.
I described it on this OSM wiki page:

http://wiki.openstreetmap.org/wiki/Mechanical_Edits/JaTrainWikipedia

I get feedback from the local mappers and they seems pleased about it.
My Python scripts seem to work, but before I perform any modification, I
would like some technical advises about committing the data. There will
be 8000 modified nodes, and I wonder how many I should put in an single
edit. All of them? One hundred?

I put sample OSM files with an arbitrary number of 70 stations per file
there:

http://fabsk.eu/osm/

There is also a log file giving the reason for updating or not updating
the stations.

Cheers,
Fabien

___
Tagging mailing list
Tagging@openstreetmap.org
http://lists.openstreetmap.org/listinfo/tagging


[Tagging] Mechanical Edit: fix japanese train stations wikipedia/names fields

2012-10-14 Thread Fabien SK
Hi everyone,

I intend to write a script to complete the information on the japanese
train stations nodes. It would
- add the «wikipedia» tag if it does not exist
- fix the «wikipedia» tags with outdated format (for example:
wikipedia:ja = http://...)
- complete the names tags from the existing values
So any comment would be welcome.

So I think that I would do it like that:
- I create an OSM account for this task
- I get a recent dump for Japan
- Using osmosis, I extract all the nodes having railway=station (about
9000 nodes)
- My script will filter the nodes having incomplete information
- It will process the filtered nodes list by batches of X nodes (where X
is a reasonable number for a commit). I could try to create batches
containing nodes in the same area
- it will retrieve the latest version of each node (by id) using the API
- If the Wikipedia link is missing, it will download the jp WP page
(easy, the page name is [station name] + «eki»). If it cannot be
retrieved, or if I cannot be sure that the page is a not disambiguation
page, I give up
- it could check if the coordinates in the WP page (if present) are
about the same than the node. If not, wrong station, I give up
- if the wikipedia tag is wrong, it can fix it (if the value is an URL,
it can set it to the page name)
- it will complete the names tags if needed: «name:en» and «name:ja_rm»
have the same value. They can also be deduced from «ja_kana» (but it's
not always perfect), and vice versa. «name» can be set with the value
«name:ja (name:ja_rm)» if these two tags are present.
- the modified nodes of a batch will be put in an XML. I could review
them in JOSM before submitting them.

I have no experience when it comes to mechanical edit in OSM, so any
comment is welcome to make it safe, for the both the servers and the
existing data.
I don't know if it is technically necessary to split the commit in
batches. But I thought that it would be nice for manual reviews.
My japanese is too bad to write on the jp mailing list :-)

Cheers,
Fabien

___
Tagging mailing list
Tagging@openstreetmap.org
http://lists.openstreetmap.org/listinfo/tagging


Re: [Tagging] Mechanical Edit: fix japanese train stations wikipedia/names fields

2012-10-14 Thread Chris Hill

On 14/10/12 17:09, Fabien SK wrote:

[snip]

My japanese is too bad to write on the jp mailing list :-)



If you can't ask the local mappers what they think about your mechanical 
edit then I would suggest that you don't do it.


--
Cheers, Chris
user: chillly


___
Tagging mailing list
Tagging@openstreetmap.org
http://lists.openstreetmap.org/listinfo/tagging


Re: [Tagging] Mechanical Edit: fix japanese train stations wikipedia/names fields

2012-10-14 Thread Fabien SK
Le 14/10/2012 18:16, Chris Hill a écrit :
 On 14/10/12 17:09, Fabien SK wrote:

 [snip]
 My japanese is too bad to write on the jp mailing list :-)


 If you can't ask the local mappers what they think about your
 mechanical edit then I would suggest that you don't do it.

Satoshi, a japanese mapper, proposed me to forward the discussion to the
Japanese list, where people could help me even if I write in english.
If there is a consensus is reached there, I will come back on this list
for some technical assistance :-)


___
Tagging mailing list
Tagging@openstreetmap.org
http://lists.openstreetmap.org/listinfo/tagging


Re: [Tagging] Mechanical Edit: fix japanese train stations wikipedia/names fields

2012-10-14 Thread Michael Kugelmann

Am 14.10.2012 18:09, schrieb Fabien SK:

- I create an OSM account for this task

highly appreciated.
And discussing it with the Japanese community as already mentioned is a 
very good idea.



Best regards,
Michael.

___
Tagging mailing list
Tagging@openstreetmap.org
http://lists.openstreetmap.org/listinfo/tagging