Re: [Talk-us] MassGIS Building Import - process

2012-12-16 Thread Serge Wroclawski
On Thu, Dec 13, 2012 at 11:12 PM, Jason Remillard
remillard.ja...@gmail.com wrote:
 Hi,

 I update the wiki

 http://wiki.openstreetmap.org/wiki/MassGIS_Buildings_Import

Jason,

While I think it's great you want to help OSM in this way, I have a
few concerns.

I think it's great that you've documented the process, but even after
following a lot of links that point to other links that point to
Google Docs, I can't find the scripts you use to do the conversion. If
those script are available at all, they're hard to find and review.

And the same goes for the timeframe. It seems you documented this on
Friday and then ran it on Sunday? I don't think I'm the only one who
sometimes takes email vacations on the weekend, and I didn't feel I
had time to look over any of the process. I suspect others may feel
the same way.

- Serge

___
Talk-us mailing list
Talk-us@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-us


Re: [Talk-us] MassGIS Building Import - process

2012-12-16 Thread Jason Remillard
Hi,

The script has been getting move around a bit as switched over to
having all of the OSM files in one zip file. I put the script in the
google docs directory, outside of the big zip.

Thanks
Jason.

On Sun, Dec 16, 2012 at 7:41 PM, Serge Wroclawski emac...@gmail.com wrote:
 On Thu, Dec 13, 2012 at 11:12 PM, Jason Remillard
 remillard.ja...@gmail.com wrote:
 Hi,

 I update the wiki

 http://wiki.openstreetmap.org/wiki/MassGIS_Buildings_Import

 Jason,

 While I think it's great you want to help OSM in this way, I have a
 few concerns.

 I think it's great that you've documented the process, but even after
 following a lot of links that point to other links that point to
 Google Docs, I can't find the scripts you use to do the conversion. If
 those script are available at all, they're hard to find and review.

 And the same goes for the timeframe. It seems you documented this on
 Friday and then ran it on Sunday? I don't think I'm the only one who
 sometimes takes email vacations on the weekend, and I didn't feel I
 had time to look over any of the process. I suspect others may feel
 the same way.

 - Serge

___
Talk-us mailing list
Talk-us@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-us


Re: [Talk-us] MassGIS Building Import - process

2012-12-15 Thread nicholas ingalls
If anyone has a problem relating to this I don't mind fixing it. The way to
do it is simply to download all the data within the bounding box, then use
either the search or the authors dialogue to select all the data uploaded
by that particular user and remove. Then the data can be re-uploaded :)

Cheers,
ingalls


On Sat, Dec 15, 2012 at 1:18 AM, Paul Norman penor...@mac.com wrote:

 I notice that several of you are using uploads that involve 50k nodes per
 changeset.

 ** **

 What is your revert plan if one of these long extended uploads gets
 interrupted? The JOSM reverter plugin will not handle this correctly.

 ** **

 *From:* Jason Remillard [mailto:remillard.ja...@gmail.com]
 *Sent:* Thursday, December 13, 2012 8:12 PM
 *To:* talk-us@openstreetmap.org; impo...@openstreetmap.org
 *Subject:* [Talk-us] MassGIS Building Import - process

 ** **

 Hi,

  

 I update the wiki 

  

 http://wiki.openstreetmap.org/wiki/MassGIS_Buildings_Import

  

 It includes all of the emails from tonight. 

  

 - The import account will be restricted to directly uploading the data.
 Any post fixes should be done with the normal account. 

 - There will be a new upload tomorrow with simplified data, and the full
 massgis data osm replaced with an overlapping osm. This will be saved away
 for later incase anybody wants to do merges after this project is done. I
 will do Groton for sure

  

 Only two remaining issue 

  

 - If we should restrict manual importing to a small group of people. I
 have gotten several feedback messages that letting everybody do it has ***
 *

 turned out not to be a great idea on other imports. We are up to 5 people
 now, all of them have zillion changes to the map. This is not a problem
 yet. Honestly, I am not sure if people will be beating down the doors to
 help or not...

  

 - How hard do we work on the manual imports. Do we stop doing them if the
 data looks really good, and just run the script, or do we keep doing the
 manual imports until we tire of it.  Running the script sooner, gets the
 map better quicker, and lighter OSM users can probably handle cleaning up
 the small 1/500 errors.

  

 After this, I think we are ready to roll this up and get to work.

  

 Thanks

 Jason.

 ___
 Talk-us mailing list
 Talk-us@openstreetmap.org
 http://lists.openstreetmap.org/listinfo/talk-us


___
Talk-us mailing list
Talk-us@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-us


Re: [Talk-us] MassGIS Building Import - process

2012-12-15 Thread Frederik Ramm

Hi,

On 14.12.2012 05:12, Jason Remillard wrote:

- How hard do we work on the manual imports. Do we stop doing them if
the data looks really good, and just run the script, or do we keep doing
the manual imports until we tire of it.  Running the script sooner, gets
the map better quicker, and lighter OSM users can probably handle
cleaning up the small 1/500 errors.


My usual lament:

1. The map is not better just because more is on it.

2. If you cannot be bothered to fix problems then why should others be? 
What is your plan for growing the community to a point where it can 
maintain the data you plan to dump onto OSM?


It is a common scenario to see importers aim for a quick win (look how 
much better we've made the map) and then let others clean up the mess. 
In programming, this is called technical debt - in order to be able to 
present nice results quickly you add some problems to your code which 
have to be cleaned out later. Sometimes there's a good reason for 
incurring such debt (e.g. because a deadline forces you to presents 
results quickly no matter how rotten your code is on the inside).


What is your external deadline that would prevent you from doing things 
right?


Bye
Frederik

--
Frederik Ramm  ##  eMail frede...@remote.org  ##  N49°00'09 E008°23'33

___
Talk-us mailing list
Talk-us@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-us


Re: [Talk-us] MassGIS Building Import - process

2012-12-15 Thread Jeff Meyer
Paul - I've added a few comments and questions about changeset size and
revert policies on the Import Guidelines  Plan Outline wiki pages.

Are there any recommended changeset size limits and/or revert plan
practices?

- Jeff


On Sat, Dec 15, 2012 at 6:43 AM, nicholas ingalls 
nicholas.inga...@gmail.com wrote:

 If anyone has a problem relating to this I don't mind fixing it. The way
 to do it is simply to download all the data within the bounding box, then
 use either the search or the authors dialogue to select all the data
 uploaded by that particular user and remove. Then the data can be
 re-uploaded :)

 Cheers,
 ingalls


 On Sat, Dec 15, 2012 at 1:18 AM, Paul Norman penor...@mac.com wrote:

 I notice that several of you are using uploads that involve 50k nodes per
 changeset.

 ** **

 What is your revert plan if one of these long extended uploads gets
 interrupted? The JOSM reverter plugin will not handle this correctly.

 ** **

 *From:* Jason Remillard [mailto:remillard.ja...@gmail.com]
 *Sent:* Thursday, December 13, 2012 8:12 PM
 *To:* talk-us@openstreetmap.org; impo...@openstreetmap.org
 *Subject:* [Talk-us] MassGIS Building Import - process

 ** **

 Hi,

  

 I update the wiki 

  

 http://wiki.openstreetmap.org/wiki/MassGIS_Buildings_Import

  

 It includes all of the emails from tonight. 

  

 - The import account will be restricted to directly uploading the data.
 Any post fixes should be done with the normal account. 

 - There will be a new upload tomorrow with simplified data, and the full
 massgis data osm replaced with an overlapping osm. This will be saved away
 for later incase anybody wants to do merges after this project is done. I
 will do Groton for sure

  

 Only two remaining issue 

  

 - If we should restrict manual importing to a small group of people. I
 have gotten several feedback messages that letting everybody do it has **
 **

 turned out not to be a great idea on other imports. We are up to 5 people
 now, all of them have zillion changes to the map. This is not a problem
 yet. Honestly, I am not sure if people will be beating down the doors to
 help or not...

  

 - How hard do we work on the manual imports. Do we stop doing them if the
 data looks really good, and just run the script, or do we keep doing the
 manual imports until we tire of it.  Running the script sooner, gets the
 map better quicker, and lighter OSM users can probably handle cleaning up
 the small 1/500 errors.

  

 After this, I think we are ready to roll this up and get to work.

  

 Thanks

 Jason.

 ___
 Talk-us mailing list
 Talk-us@openstreetmap.org
 http://lists.openstreetmap.org/listinfo/talk-us



 ___
 Talk-us mailing list
 Talk-us@openstreetmap.org
 http://lists.openstreetmap.org/listinfo/talk-us




-- 
Jeff Meyer
Global World History Atlas
www.gwhat.org
j...@gwhat.org
206-676-2347
___
Talk-us mailing list
Talk-us@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-us


Re: [Talk-us] MassGIS Building Import - process

2012-12-15 Thread Jeff Meyer
On Sat, Dec 15, 2012 at 7:50 AM, Frederik Ramm frede...@remote.org wrote:

 2. If you cannot be bothered to fix problems then why should others be?
 What is your plan for growing the community to a point where it can
 maintain the data you plan to dump onto OSM?


What's an acceptable error rate? 0?

What's the error rate for manual entry? I'd suggest that if Jason hits 1
error in 500 whatevers, that that's lower than manual entry.

What are some specific examples of costs expected to be incurred by
importing the building outlines the Massachusetts OSM community has
identified?

Could concerns about errors be addressed by a manual, building-by-building
QA process that might address any error issues? What can he *do* to address
your concerns?

Rather than addressing the imports=good vs. imports=bad discussion, what
are the steps Jason hasn't addressed that should be included in order to
make sure that we are minimizing the potential for introducing problems
into the OSM database.

- Jeff
___
Talk-us mailing list
Talk-us@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-us


Re: [Talk-us] MassGIS Building Import - process

2012-12-15 Thread andrzej zaborowski
On 15 December 2012 20:09, Jeff Meyer j...@gwhat.org wrote:
 Paul - I've added a few comments and questions about changeset size and
 revert policies on the Import Guidelines  Plan Outline wiki pages.

 Are there any recommended changeset size limits and/or revert plan
 practices?

One good practice is not to revert data that is not known to be wrong.
 If a big changeset fails halfway through it's possible to fix the
remaining part to use the nodes that have been uploaded and continue,
rather than delete the 1000s of nodes just to create new ones in the
same places.

You can probably now do that in JOSM by downloading the changeset
containing the orphaned nodes, opening in JOSM together with the data
being uploaded and telling the validator to fix all duplicate nodes.
 Myself I've been using the python scripts at
http://wiki.openstreetmap.org/wiki/Upload.py in such situations,
although the api is much more stable now than it was a couple years
ago.

The other good practice, but possibly not usable with JOSM alone, is
not to let the program upload the naked nodes in bulk and then the
buildings in bulk.  You can sort the elements in such a way that every
50k element changeset contains say 45k nodes and 5k ways.  The scripts
let you limit the number of elements in a chunk which the next chunk
depends on to the minimum (optimally 0), this way there's no risk of a
passer by spotting orphan nodes and deleting some causing you
conflicts in your next chunk.

Cheers

___
Talk-us mailing list
Talk-us@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-us


Re: [Talk-us] MassGIS Building Import - process

2012-12-15 Thread andrzej zaborowski
On 15 December 2012 22:45, Paul Norman penor...@mac.com wrote:
 From: andrzej zaborowski [mailto:balr...@gmail.com]
 Subject: Re: [Talk-us] MassGIS Building Import - process

 On 15 December 2012 20:09, Jeff Meyer j...@gwhat.org wrote:
  Paul - I've added a few comments and questions about changeset size
  and revert policies on the Import Guidelines  Plan Outline wiki
 pages.
 
  Are there any recommended changeset size limits and/or revert plan
  practices?

 One good practice is not to revert data that is not known to be wrong.
  If a big changeset fails halfway through it's possible to fix the
 remaining part to use the nodes that have been uploaded and continue,
 rather than delete the 1000s of nodes just to create new ones in the
 same places.

 You can probably now do that in JOSM by downloading the changeset
 containing the orphaned nodes, opening in JOSM together with the data
 being uploaded and telling the validator to fix all duplicate nodes.

 This hasn't come up for me on an import but I've tried it with normal
 mapping. I don't believe you can do a fix all on duplicate nodes and instead
 have to resolve them all individually

With a current JOSM it seems you can select all the errors and click
fix, or you can select the Other duplicate nodes category and
click fix, I just checked.  But JOSM will add each pair of merged
nodes as an individual operation in undo history.

I noticed it also merges the dupe nodes when you're merging two layers
rather than copy from one layer and paste onto another.



  Myself I've been using the python scripts at
 http://wiki.openstreetmap.org/wiki/Upload.py in such situations,
 although the api is much more stable now than it was a couple years ago.

 What's your workflow? Do changes in JOSM, save and then pass to the scripts?

Yes.  The osm2change script understands the JOSM format and produces
and .osc.  With the TIGER name expansion the bot produced .osc files
directly which were reviewed in a text editor.


 The other good practice, but possibly not usable with JOSM alone, is not
 to let the program upload the naked nodes in bulk and then the
 buildings in bulk.  You can sort the elements in such a way that every
 50k element changeset contains say 45k nodes and 5k ways.  The scripts
 let you limit the number of elements in a chunk which the next chunk
 depends on to the minimum (optimally 0), this way there's no risk of a
 passer by spotting orphan nodes and deleting some causing you conflicts
 in your next chunk.

 The best way to do this in JOSM alone is to only merge in 1-5k nodes+ways at
 a time, review them, then upload. This also avoids most of the problems
 above.

 I would only ever do a 50k object changeset in very limited circumstances
 where I am confident that it is safe to do so. Even then I'd try to keep it
 under 25k normally. For normal imports I'd suggest 10k as a soft limit.


Yes, good points.

Cheers

___
Talk-us mailing list
Talk-us@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-us


Re: [Talk-us] MassGIS Building Import - process

2012-12-15 Thread Greg Troxel

Jeff Meyer j...@gwhat.org writes:

 On Sat, Dec 15, 2012 at 7:50 AM, Frederik Ramm frede...@remote.org wrote:

 2. If you cannot be bothered to fix problems then why should others be?
 What is your plan for growing the community to a point where it can
 maintain the data you plan to dump onto OSM?


 What's an acceptable error rate? 0?

 What's the error rate for manual entry? I'd suggest that if Jason hits 1
 error in 500 whatevers, that that's lower than manual entry.

Well put.  Before the import, a vast number of buildings that actually
exist were not shown on the map.  To me, that is a greater problem than
a tiny number of non-buildings rendered as buildings.

One can argue about the weight of actual buildings not shown vs
non-buildings shown in computing a total map quality function, but by
all accounts of people who have looked at the data the quality is very
high.  And the non-buildings shown are things are typically thing that
are structure-like but not really structures, which is exactly the kind
of thing that I've added by hand as builings when maybe I shouldn't.

The community in Mass is actually growing, and I haven't heard from
anyone local who is opposed to what's happening.  This process is
causing us to know each other better.  So I think random complaining at
Jason for you are importing so you must be harming the community is
uncalled for, unsubstantiated, and harmful to building community.  In my
view, community is about whether people want to belong and want to work
with each other, and I see the current situation as a net plus.


pgpSKUvwmGOiS.pgp
Description: PGP signature
___
Talk-us mailing list
Talk-us@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-us


Re: [Talk-us] MassGIS Building Import - process

2012-12-15 Thread Greg Troxel

Jeff Meyer j...@gwhat.org writes:

 Paul - I've added a few comments and questions about changeset size and
 revert policies on the Import Guidelines  Plan Outline wiki pages.

 Are there any recommended changeset size limits and/or revert plan
 practices?

Tools support for reverting is a fair point.  But, I'd argue that it's a
bug in JOSM if one can upload a single changeset that the reverter can't
undo, and if so (I may be confused) this should be fixed in JOSM rather
than be a special manual requirement.


pgpphR8cNggpV.pgp
Description: PGP signature
___
Talk-us mailing list
Talk-us@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-us


Re: [Talk-us] MassGIS Building Import - process

2012-12-15 Thread Paul Norman
 From: Greg Troxel [mailto:g...@ir.bbn.com]
 Subject: Re: [Talk-us] MassGIS Building Import - process
 
 
 Jeff Meyer j...@gwhat.org writes:
 
  Paul - I've added a few comments and questions about changeset size
  and revert policies on the Import Guidelines  Plan Outline wiki
 pages.
 
  Are there any recommended changeset size limits and/or revert plan
  practices?
 
 Tools support for reverting is a fair point.  But, I'd argue that it's a
 bug in JOSM if one can upload a single changeset that the reverter can't
 undo, and if so (I may be confused) this should be fixed in JOSM rather
 than be a special manual requirement.

It's not a bug in JOSM. JOSM gives you lots of way to shoot yourself in the
foot, this is due to an interaction between large changesets, the rails
port, database/serialization times on the changeset /download call and
imports.

Because it's pretty much import-specific it's not too serious an issue if
people are following the guidelines because they require you to have the
knowledge of how to fix your mistakes. The checklist suggests developing a
revert plan and the guidelines specifically state If you don't know how to
revert an import, don't do the import in the first place.

All of this being said, an item on my to-do list is to propose reducing the
maximum changeset size. Right now it is too easy to make a changeset that
only specialized ill-documented tricky to use tools can download.


___
Talk-us mailing list
Talk-us@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-us


[Talk-us] MassGIS Building Import - process

2012-12-13 Thread Jason Remillard
Hi,

I update the wiki

http://wiki.openstreetmap.org/wiki/MassGIS_Buildings_Import

It includes all of the emails from tonight.

- The import account will be restricted to directly uploading the data. Any
post fixes should be done with the normal account.
- There will be a new upload tomorrow with simplified data, and the full
massgis data osm replaced with an overlapping osm. This will be saved away
for later incase anybody wants to do merges after this project is done. I
will do Groton for sure

Only two remaining issue

- If we should restrict manual importing to a small group of people. I have
gotten several feedback messages that letting everybody do it has
turned out not to be a great idea on other imports. We are up to 5 people
now, all of them have zillion changes to the map. This is not a problem
yet. Honestly, I am not sure if people will be beating down the doors to
help or not...

- How hard do we work on the manual imports. Do we stop doing them if the
data looks really good, and just run the script, or do we keep doing the
manual imports until we tire of it.  Running the script sooner, gets the
map better quicker, and lighter OSM users can probably handle cleaning up
the small 1/500 errors.

After this, I think we are ready to roll this up and get to work.

Thanks
Jason.
___
Talk-us mailing list
Talk-us@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-us