Re: [OSM-talk] Changing capitalization (Lima)

2012-06-02 Thread john whelan
Igor added a export-tag command to Maperitive that creates a local CSV file
that can be brought into a spreadsheet.  You need a local copy of the .OSM
file or possibly .pbf version should work as well.  It's very useful for
spotting tags that are misspelt.  Once there you can concat red tape to
create xml commands to get JOSM to upload the changes.  I've a couple of
sample VB programs that run done a local file and make changes, a sort of
off line bot if you like but I wouldn't like them to escape into general
use there are too many bots out there at the moment.

Cheerio John

On 1 June 2012 22:05, Jaakko Helleranta.com  wrote:

> (Added HOT to the list as the issue here is probably of interest In the
> areas where HOT is active.)
>
> Andrzej,
> What was your process / what tools did you use?
>
> In general:
> I've dreamed of solutions to fix not only capitalization errors / fuckups
> but also various other typing errors / mistakes in names to use especvially
> in Haiti where I've spent countless hours trying to combat things that
> various helping hands are creating on a steady basis.
>
> My dream solution would be a simple(?) export-import tool that would make
> it possible to:
> 1) Export desired tag values (1 at a time) to csv /
> your_favorite_spreadsheet_format
> 2) Fix the typos/etc errors in that give data
> 3) Import/Upload the fixed data back to OSM.
>
> As Alex originally questioned/thought_aloud fixing of names _could_ happen
> simply by editing the xml file manually -- but you'd need to always
> remember to add the "edited" tag into the right place for the edit to work
> / get registered in the data upload.
> .. And such data manipulation is simply _so_ much easier in a spreadsheet
> application -- and can be outsourced to people who understand nothing about
> geo/tech tools but who know how to fix CapiTAliZaTi0n errors / typos.
>
> The additional reason I'd _love_ to have such tool is that it could make
> it possible to create a simple list of eg all schools, hospitals,
> restaurants, what_not within a given osm file.
>
> .. Especially convenient would be if the developer of such tool would make
> it possible to export / list more than just one field into the
> csv/spreadsheet file; say, name, address, phone, email, website,
> description, etc.
>
> Adding and updating such attribute data would imo be _so_ much easier /
> more efficient in a spreadsheet than one-by-one in JOSM/Potlacth/etc.
>
> If someone would have a Kickstarter/etc project to do this (or can point
> me to an existing such solution) I'd be very much willing to contribute,
> say, $100-200 depending on the richness of features of such tool.
>
> Cheers,
> -Jaakko
>
> Sent from my BlackBerry® device from Digicel
> --
> Mobile: +509-37-26 91 54, Skype/GoogleTalk: jhelleranta
>
> -Original Message-
> From: andrzej zaborowski 
> Date: Fri, 1 Jun 2012 16:34:21
> To: Alex Barth
> Cc: openstreetmap; 
> Subject: Re: [OSM-talk] Changing capitalization (Lima)
>
> On 31 May 2012 17:39, Alex Barth  wrote:
> > We're currently working with Ruben (user Rub21) on fixing street name
> capitalization in Lima - a lot of the street names are ALL CAPS where they
> should be properly capitalized. We're doing this work manually right now
> and are well under way. It's quite time intensive though - any examples of
> where such a cleanup process has been automated on OSM before?
> >
>
> I ran such a process on the POI names in Girona that were imported
> just before the SOTM'10.  Accents were correct already and python
> dealt with them correctly.  The only special cases were some
> prepositions that are written in lower case and the Catalan use of
> apostrophe.
>
> http://www.openstreetmap.org/browse/changeset/5073672 is one of the
> changesets.
>
> Cheers
>
> ___
> talk mailing list
> talk@openstreetmap.org
> http://lists.openstreetmap.org/listinfo/talk
> ___
> talk mailing list
> talk@openstreetmap.org
> http://lists.openstreetmap.org/listinfo/talk
>
___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk


Re: [OSM-talk] Changing capitalization (Lima)

2012-06-01 Thread Jaakko Helleranta.com
(Added HOT to the list as the issue here is probably of interest In the areas 
where HOT is active.)

Andrzej,
What was your process / what tools did you use?

In general:
I've dreamed of solutions to fix not only capitalization errors / fuckups but 
also various other typing errors / mistakes in names to use especvially in 
Haiti where I've spent countless hours trying to combat things that various 
helping hands are creating on a steady basis.

My dream solution would be a simple(?) export-import tool that would make it 
possible to:
1) Export desired tag values (1 at a time) to csv / 
your_favorite_spreadsheet_format
2) Fix the typos/etc errors in that give data
3) Import/Upload the fixed data back to OSM.

As Alex originally questioned/thought_aloud fixing of names _could_ happen 
simply by editing the xml file manually -- but you'd need to always remember to 
add the "edited" tag into the right place for the edit to work / get registered 
in the data upload.
.. And such data manipulation is simply _so_ much easier in a spreadsheet 
application -- and can be outsourced to people who understand nothing about 
geo/tech tools but who know how to fix CapiTAliZaTi0n errors / typos.

The additional reason I'd _love_ to have such tool is that it could make it 
possible to create a simple list of eg all schools, hospitals, restaurants, 
what_not within a given osm file. 

.. Especially convenient would be if the developer of such tool would make it 
possible to export / list more than just one field into the csv/spreadsheet 
file; say, name, address, phone, email, website, description, etc.

Adding and updating such attribute data would imo be _so_ much easier / more 
efficient in a spreadsheet than one-by-one in JOSM/Potlacth/etc.

If someone would have a Kickstarter/etc project to do this (or can point me to 
an existing such solution) I'd be very much willing to contribute, say, 
$100-200 depending on the richness of features of such tool.

Cheers,
-Jaakko

Sent from my BlackBerry® device from Digicel
--
Mobile: +509-37-26 91 54, Skype/GoogleTalk: jhelleranta

-Original Message-
From: andrzej zaborowski 
Date: Fri, 1 Jun 2012 16:34:21 
To: Alex Barth
Cc: openstreetmap; 
Subject: Re: [OSM-talk] Changing capitalization (Lima)

On 31 May 2012 17:39, Alex Barth  wrote:
> We're currently working with Ruben (user Rub21) on fixing street name 
> capitalization in Lima - a lot of the street names are ALL CAPS where they 
> should be properly capitalized. We're doing this work manually right now and 
> are well under way. It's quite time intensive though - any examples of where 
> such a cleanup process has been automated on OSM before?
>

I ran such a process on the POI names in Girona that were imported
just before the SOTM'10.  Accents were correct already and python
dealt with them correctly.  The only special cases were some
prepositions that are written in lower case and the Catalan use of
apostrophe.

http://www.openstreetmap.org/browse/changeset/5073672 is one of the changesets.

Cheers

___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk
___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk


Re: [OSM-talk] Changing capitalization (Lima)

2012-06-01 Thread Alex Barth
Wow, thanks so much - I'm running out of time right now and can't look at the 
file. How could I prep such a file myself?

On May 31, 2012, at 3:13 PM, Worst Fixer wrote:

> Hello.
> 
> 2012/5/31 Alex Barth :
>> We're currently working with Ruben (user Rub21) on fixing street name 
>> capitalization in Lima - a lot of the street names are ALL CAPS where they 
>> should be properly capitalized. We're doing this work manually right now and 
>> are well under way. It's quite time intensive though - any examples of where 
>> such a cleanup process has been automated on OSM before?
> 
> I appreciate your fight with bad tags and bad imports.
> 
> I prepared a file with edits for highways in Lima area.
> 
> http://worstfixer.000a.biz/y2-lima-hw/edits-to-upload.osm.gz
> 
> Overview:
> http://worstfixer.000a.biz/y2-lima-hw/overview-short.html
> http://worstfixer.000a.biz/y2-lima-hw/overview-full.html
> 
> Ich also deleted source= if it is the only tag on nodes, and removed
> single addr:street instance when it was equal to name tag.
> 
> If you notice some big flaw in my case change algoritm, mail me
> privately. Ich will re-make file for you.
> 
> Please review it and upload. If you want to upload it without complete
> review, please follow Mechanical Edits Policy:
> 
> http://wiki.openstreetmap.org/wiki/Mechanical_Edit_Policy
> 
>> tel (202) 250-3633
> 
> This ist world wide international list. Write phone numbers in
> international format.
> 
> -- 
> WorstFixer, twitter: http://twitter.com/WorstFixer

Alex Barth
http://twitter.com/lxbarth
tel (202) 250-3633




___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk


Re: [OSM-talk] Changing capitalization (Lima)

2012-06-01 Thread andrzej zaborowski
On 31 May 2012 17:39, Alex Barth  wrote:
> We're currently working with Ruben (user Rub21) on fixing street name 
> capitalization in Lima - a lot of the street names are ALL CAPS where they 
> should be properly capitalized. We're doing this work manually right now and 
> are well under way. It's quite time intensive though - any examples of where 
> such a cleanup process has been automated on OSM before?
>

I ran such a process on the POI names in Girona that were imported
just before the SOTM'10.  Accents were correct already and python
dealt with them correctly.  The only special cases were some
prepositions that are written in lower case and the Catalan use of
apostrophe.

http://www.openstreetmap.org/browse/changeset/5073672 is one of the changesets.

Cheers

___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk


Re: [OSM-talk] Changing capitalization (Lima)

2012-06-01 Thread Phil Endecott

Steve Doerr wrote:

On 31/05/2012 20:13, Worst Fixer wrote:


If you notice some big flaw in my case change algoritm, mail me privately.


I'd only comment that a really intelligent de-capitalization algorithm 
would attempt to supply the accents that are missing from the 
capitalized forms.


I had to do this for a (non-OSM) Canadian dataset where the English
placenames were in mixed-case but the French (Quebecois) were in ALL
CAPS - presumably done deliberately to avoid the issue of accents.

The best approach is probably to use a dictionary generated from other
placenames.  In the case of OSM we already have a great source of such data.

I could probably do this for you, or at least provide a mapping table
that you could combine with your current conversion script.  But not
until next week.  Let me know if you're interested.


Regards,  Phil.








___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk


Re: [OSM-talk] Changing capitalization (Lima)

2012-05-31 Thread Steve Doerr

On 31/05/2012 20:13, Worst Fixer wrote:


If you notice some big flaw in my case change algoritm, mail me privately.


I'd only comment that a really intelligent de-capitalization algorithm 
would attempt to supply the accents that are missing from the 
capitalized forms. Some languages that use accents tolerate omitting 
them from capital letters but not from lower-case letters. Maybe a 
better algorithm could be developed by a Spanish native-speaker. 
Admittedly, accents could be supplied at a later date.


--
Steve

___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk


Re: [OSM-talk] Changing capitalization (Lima)

2012-05-31 Thread Worst Fixer
Hello.

2012/5/31 Alex Barth :
> We're currently working with Ruben (user Rub21) on fixing street name 
> capitalization in Lima - a lot of the street names are ALL CAPS where they 
> should be properly capitalized. We're doing this work manually right now and 
> are well under way. It's quite time intensive though - any examples of where 
> such a cleanup process has been automated on OSM before?

I appreciate your fight with bad tags and bad imports.

I prepared a file with edits for highways in Lima area.

http://worstfixer.000a.biz/y2-lima-hw/edits-to-upload.osm.gz

Overview:
http://worstfixer.000a.biz/y2-lima-hw/overview-short.html
http://worstfixer.000a.biz/y2-lima-hw/overview-full.html

Ich also deleted source= if it is the only tag on nodes, and removed
single addr:street instance when it was equal to name tag.

If you notice some big flaw in my case change algoritm, mail me
privately. Ich will re-make file for you.

Please review it and upload. If you want to upload it without complete
review, please follow Mechanical Edits Policy:

http://wiki.openstreetmap.org/wiki/Mechanical_Edit_Policy

> tel (202) 250-3633

This ist world wide international list. Write phone numbers in
international format.

-- 
WorstFixer, twitter: http://twitter.com/WorstFixer

___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk


Re: [OSM-talk] Changing capitalization (Lima)

2012-05-31 Thread Alex Barth

Maarten -

What editor (plugin?) do you suggest for this procedure? Or would you just edit 
the OSM file with a text editor? If I do that I'm missing change set entries, 
no?

On May 31, 2012, at 12:03 PM, Maarten Deen wrote:

> On 31-5-2012 17:39, Alex Barth wrote:
>> We're currently working with Ruben (user Rub21) on fixing street name 
>> capitalization in Lima - a lot of the street names are ALL CAPS where they 
>> should be properly capitalized. We're doing this work manually right now and 
>> are well under way. It's quite time intensive though - any examples of where 
>> such a cleanup process has been automated on OSM before?
> 
> How difficult is it to download an area to an OSM file, check for name= CAPS>, change these to proper caps and upload the file?
> 
> I'd say no more than an hours work.
> 
> Regards,
> Maarten
> 
> 
> ___
> talk mailing list
> talk@openstreetmap.org
> http://lists.openstreetmap.org/listinfo/talk

Alex Barth
http://twitter.com/lxbarth
tel (202) 250-3633




___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk


Re: [OSM-talk] Changing capitalization (Lima)

2012-05-31 Thread Alex Barth

Thanks, Andy -

We're in the process of fixing some of these issues, too. There's been a pretty 
botched import in Lima a while ago.

On May 31, 2012, at 12:10 PM, SomeoneElse wrote:

> Alex Barth wrote:
>> We're currently working with Ruben (user Rub21) on fixing street name 
>> capitalization in Lima - a lot of the street names are ALL CAPS where they 
>> should be properly capitalized.
> 
> Ye gods.
> 
> Looking at that patch of Lima it appears that name capitalisation is the 
> least of your worries.  Perhaps a starting point might be a few NE-SW and 
> SE-NW GPS traces (to help align the imagery), and a quick on-the-ground 
> survey of some of the obviously problematical areas (e.g. way 111912131, 
> which is unconnected to anything, and on the Bing imagery is a football 
> field).
> 
> That should give you an idea of the positional validity of what's there - 
> there's no point in wasting time manually correcting "ROAD A" to "Road A" if 
> Road A doesn't actually exist any more.
> 
> Obviously the quick survey to get GPS traces available could pinpoint some 
> key POIs too.
> 
> Cheers,
> Andy
> 
> 
> ___
> talk mailing list
> talk@openstreetmap.org
> http://lists.openstreetmap.org/listinfo/talk

Alex Barth
http://twitter.com/lxbarth
tel (202) 250-3633




___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk


Re: [OSM-talk] Changing capitalization (Lima)

2012-05-31 Thread SomeoneElse

Alex Barth wrote:

We're currently working with Ruben (user Rub21) on fixing street name 
capitalization in Lima - a lot of the street names are ALL CAPS where they 
should be properly capitalized.


Ye gods.

Looking at that patch of Lima it appears that name capitalisation is the 
least of your worries.  Perhaps a starting point might be a few NE-SW 
and SE-NW GPS traces (to help align the imagery), and a quick 
on-the-ground survey of some of the obviously problematical areas (e.g. 
way 111912131, which is unconnected to anything, and on the Bing imagery 
is a football field).


That should give you an idea of the positional validity of what's there 
- there's no point in wasting time manually correcting "ROAD A" to "Road 
A" if Road A doesn't actually exist any more.


Obviously the quick survey to get GPS traces available could pinpoint 
some key POIs too.


Cheers,
Andy


___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk


Re: [OSM-talk] Changing capitalization (Lima)

2012-05-31 Thread Maarten Deen

On 31-5-2012 17:39, Alex Barth wrote:

We're currently working with Ruben (user Rub21) on fixing street name 
capitalization in Lima - a lot of the street names are ALL CAPS where they 
should be properly capitalized. We're doing this work manually right now and 
are well under way. It's quite time intensive though - any examples of where 
such a cleanup process has been automated on OSM before?


How difficult is it to download an area to an OSM file, check for 
name=, change these to proper caps and upload the file?


I'd say no more than an hours work.

Regards,
Maarten


___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk


[OSM-talk] Changing capitalization (Lima)

2012-05-31 Thread Alex Barth
We're currently working with Ruben (user Rub21) on fixing street name 
capitalization in Lima - a lot of the street names are ALL CAPS where they 
should be properly capitalized. We're doing this work manually right now and 
are well under way. It's quite time intensive though - any examples of where 
such a cleanup process has been automated on OSM before?

Example all caps in Lima:
http://osm.org/go/NNcgaXMQ
https://skitch.com/alexbarth/874nu/openstreetmap

Alex Barth
http://twitter.com/lxbarth
tel (202) 250-3633




___
talk mailing list
talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk