[OSM-dev] New feature: OAuthed rate limits!

2016-10-12 Thread Matt Amos
Hi!

A version of CGImap [1] has been deployed which supports "OAuthed rate
limits". What this means is that requests with a valid OAuth signature
will be rate-limited by user account rather than source IP address
[2].

This probably makes no difference for most of us, most of the time.
But it potentially could make a huge difference to people at mapping
parties or any other situation where a large number of people are
trying to map from behind a NAT router or firewall.

At the moment, I don't think any editors send OAuth-signed requests
unless authorisation is required for that particular API call, but I
hope that it will be easy to integrate this feature (perhaps
optionally after receiving "509 Bandwidth Limit Exceeded"). Please let
me know or file an issue [3] if you are getting unexpected errors with
OAuth-signed requests, or if there's anything I can help with adding
this feature to your editor.

Another feature released along with this is CGImap support for the
/changesets/#{id} call, including discussions. Again - please let me
know or file an issue [3] if you notice anything amiss with this call.

There is on-going work to support historical versions of elements [4],
which I hope will lead to support for history calls as well as
changeset/#{id}/download (osmChange) API calls. And eventually, the
whole API [5]. If you'd like to help, please take a look at the source
code and get in touch!

Cheers,

Matt

[1] https://github.com/zerebubuth/openstreetmap-cgimap/releases/tag/v0.5.5
[2] https://github.com/openstreetmap/operations/issues/36
[3] https://github.com/zerebubuth/openstreetmap-cgimap/issues
[4] https://github.com/zerebubuth/openstreetmap-cgimap/tree/history
[5] http://www.asklater.com/matt/blog/2015/11/18/the-road-to-api-07/

___
dev mailing list
dev@openstreetmap.org
https://lists.openstreetmap.org/listinfo/dev


[OSM-dev] London development event, Aug 8-9

2015-07-01 Thread Matt Amos
The OpenStreetMap London group would like to invite you to our next
development event. If you'd be interested in a weekend of collaborative
development of OpenStreetMap and open geodata technologies, please sign up
at http://www.eventbrite.com/e/osm-london-hack-weekend-tickets-17511175397
Sadly, space is limited and signing up for a ticket is necessary.

The theme for this event is mobile development and we would especially like
to welcome anyone with an interest or experience in that area. If mobile
isn't your thing, that's cool too; everyone's welcome to come and hack on
OSM and open geodata projects! This isn't just an event for developers - we
need designers, UX experts, testers, users, mappers and all kinds of people
to help us make the best possible result!

Over the course of the weekend, we'll figure out a plan and then try to
implement it. Note that this isn't one of those competitive hacking
competitions; there might be different teams working on different things,
but we're all working together. The planning will take place on Saturday
morning, so please take that into account in your plans.

Our thanks to our hosts for the weekend, Geovation Hub, for letting us use
their office, which is fully accessible.

For more info and tickets, please see
http://www.eventbrite.com/e/osm-london-hack-weekend-tickets-17511175397
___
dev mailing list
dev@openstreetmap.org
https://lists.openstreetmap.org/listinfo/dev


[OSM-dev] Changeset discussions: dumps and replication

2015-02-25 Thread Matt Amos
at the recent Karlsruhe hack weekend [1] (thanks to Geofabrik for
hosting), Paul Norman and i made some changes to changeset replication
[2] and planet dumps [3] to support discussions.

the first of the dumps is now available [4] as, for the moment, a
separate file and the plan is to merge that back into the "main"
changesets dump in about 6 months time - if you consume changeset
dumps, please test your code against the discussions dump!

the discussions dump is largely the same as the changesets dump, but
includes the discussions like this:

 
  
   
From humble beginnings…
   
  
 

which is the same format as the API (with comments included) [5].
these are also present in the same format in the changeset replication
feed [6], which allows you to keep a dump or replicated database up to
date with the latest comments & changesets.

i hope this will give everyone the data they need to build interesting
new ways of visualising and interacting with the changeset comment
data, for example; better UIs to display discussions near me or on the
map, analysis tools to check for spam or other bad comments, and
entirely new and unexpected cool things.

cheers & happy hacking,

matt

[1] http://wiki.openstreetmap.org/wiki/Karlsruhe_Hack_Weekend_February_2015
[2] https://github.com/openstreetmap/chef/pull/18
[3] https://github.com/zerebubuth/planet-dump-ng/pull/10
[4] http://planet.openstreetmap.org/planet/2015/discussions-150223.osm.bz2
[5] http://api.openstreetmap.org/api/0.6/changeset/1?include_discussion=true
[6] http://planet.openstreetmap.org/replication/changesets/

___
dev mailing list
dev@openstreetmap.org
https://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] OSM software repositories -- git and svn

2014-05-26 Thread Matt Amos
On Fri, May 23, 2014 at 8:44 PM, Jochen Topf  wrote:
> On Fri, May 23, 2014 at 03:19:07PM +0200, Frederik Ramm wrote:
>> 2. discoverability
>
> Lots of little repositories and a 101st with repo with list of repo URLs
> sound good to me. This would also allow different ownership/rights for
> all of the little repos. Why a one-size-fits-all solution when some repos
> could be free-for-all and some more managed. Just allow everybody to add
> any repository to your 101st "list repo".
>
> If you want to, you could ask people to add some standardized meta.json
> file or so that you can then crawl to build some kind of index to make
> it even easier to find repos by keyword or whatever.

i started working on a site for discovery, learning and code
documentation, at http://code4osm.org/ but haven't got very far with
it yet.

the site is just some scripts for processing other repositories [1].
it would be awesome to get some PRs to add other projects, improve
documentation or improve the site.

cheers,

matt

[1] https://github.com/zerebubuth/code4osm.org

___
dev mailing list
dev@openstreetmap.org
https://lists.openstreetmap.org/listinfo/dev


[OSM-dev] London Hack Weekend - Mar 8 & 9

2014-02-11 Thread Matt Amos
we're having a hack weekend on the 8th and 9th March, graciously
hosted by AOL / MapQuest at their offices in central London. it would
be great to see you there!

full details are available on the wiki page [1], and there's also an
event on Lanyrd [2]. please sign up to at least one of these if you
are planning to attend, both so that we're able to judge the amount of
power sockets we'll need, and for fire safety / security.

in addition, we'll be socialising in the pub on the Friday and
Saturday evenings - if hacking isn't your thing, we'd be very happy to
see you there. venues TBA at the moment, and suggestions welcome.

cheers,

matt

[1] http://wiki.openstreetmap.org/wiki/London/London_Hack_Weekend_Mar_2014
[2] http://lanyrd.com/2014/osmhack/

___
dev mailing list
dev@openstreetmap.org
https://lists.openstreetmap.org/listinfo/dev


[OSM-dev] London hack weekend

2014-02-05 Thread Matt Amos
after the last hack weekend, i wanted to have another around the
beginning of March, and that's rapidly approaching. here's a doodle
poll for the weekends and if you're interested in coming, please
indicate which weekends you'd be available.

http://www.doodle.com/hh4vnx2p8kzrnypv#table

thanks,

matt

___
dev mailing list
dev@openstreetmap.org
https://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Experimental history files too

2014-02-03 Thread Matt Amos
On Tue, Jan 28, 2014 at 11:11 AM, Peter Körner  wrote:
> Am 28.01.2014 03:18, schrieb Matt Amos:
>> On Fri, Jan 24, 2014 at 2:56 PM, Peter Körner  
>> wrote:
>>> What do you think about adopting the osmium-naming-scheme for history files?
>>
>> personally, i think it's misleading
>> [...]
>> in the case of .osm files, they're all potentially history files, and
>> the file format does not change depending on whether multiple versions
>> are present for a single ID or not.
>
> History-Files carry an extra attribute (bool visible) that distinguishes
> them from regular osm files.

all .osm files can carry the visible attribute, and they're present in
all responses from the API. they are only elided from the "current"
planet dump because they all have the value "true", so would simply be
a waste of space. to be clear: the presence of the visible attribute
does not mean the file contains multiple versions with the same ID.

> Also it's not only about the parser. [...]
> So from a data-user point of view it does not matter if the format is
> actually-quite-similar, as long as there are separate programs and tools
> required to handle the two types of file, they are actually different.

the "current" planet is a strict subset of the "history" planet -
anything which can load and process a "history" planet file should be
able to process the "current" planet. also, it is trivial to turn a
"history" planet into a "current" planet by discarding old versions
and deleted elements.

> I'd compare it more to tiff/aiff. While they are actually quite similar
> from a file-format point of view, no one would argue that audio-files
> and image-files should be handled as if there were no difference.

i would agree that audio files and image files are completely different :-)

but that is not the case with "history" and "current" planet files -
as i've said above, the former is a strict superset of the latter, so
the comparison to tiff/aiff is misleading. i can't think of a good
comparison - the best one i've thought of so far is one version of
HTML compared with a previous version, and programs which understand
only previous versions will ignore some elements of a later version
file.

>> whether something is a "history"
>> osm file or a "current" osm file is a matter of the content - so
>> wanting a different extension is a bit like wanting .png for
>> truecolour images and .pgr for greyscale images (in the same PNG
>> format).
> The main difference here is that all applications capable of reading png
> MUST be capable of reading pgr as well. That's not true for osm/osh
> applications.
>
> Also the tasks you can perform on both file formats are the same
> (display, crop, combine, ...). This is also not true for osm/osh files.
> One can import both into a database but the database-format required and
> the actions possible on those databases are quite different.

indeed, the analogy with image files was not good.

however, there are tasks that one can perform on osm files regardless
of whether they contain multiple versions with the same ID - one can
pull out elements matching certain tags & attributes, perform
transformations on the basis of tags & attributes, reproject nodes or
extract those within certain areas, etc...

>> having said that, it would seem reasonable to add a flag to
>> the document element to indicate whether the .osm file is a special
>> case, having a single version for each ID, as many programs seem to
>> rely on this assumption and it would be better to be able to check it.
> Isn't that already implemented via the required_features header of
> pbf-files?
>
> See
> <https://github.com/joto/osmium/blob/master/include/osmium/output/pbf.hpp#L880>
> for a reference.

no, actually, that flag has a different meaning. setting
"HistoricalInformation" in the PBF header simply means that some
elements may be deleted (have visible="false"). one can create a
conforming file, with all elements having visible="true", and yet
still have multiple versions for the same ID.

to correct this, PBF should probably deprecate
"HistoricalInformation"[1] and have a header field
"CurrentInformationOnly", which indicates that *both* there are no
visible="false" elements and no two elements of the same type share an
ID. and our XML should probably have the same attribute on the header.

cheers,

matt

[1]: one could simply change what "HistoricalInformation" meant, but
that runs the risk of already-written (and previously conforming)
files becoming non-conforming, which will be confusing.

___
dev mailing list
dev@openstreetmap.org
https://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Experimental history files too

2014-01-27 Thread Matt Amos
On Fri, Jan 24, 2014 at 2:56 PM, Peter Körner  wrote:
> Am 23.01.2014 18:28, schrieb Matt Amos:
>> i encourage everyone to take a look and report back any problems you
>> find. my thanks to Peter Körner, who seems to already be doing this -
>> with no problems?
>
> No problems yet. Have run two splits of those files already
> (http://osm.personalwerk.de/full-history-extracts/)
>
> What do you think about adopting the osmium-naming-scheme for history files?
>  .osm.[bz2|gz|pbf] -> regular osm files
>  .osh.[bz2|gz|pbf] -> history files
>  .osc.[bz2|gz]-> changeset-files
>
> That would make detecting the kind of file at first glance more easy and
> it also fits nicely int othe .osc-file nameing convention.

personally, i think it's misleading. osmchange is a related, but
different, format from osm xml and a parser which works for one will
not necessarily work for the other. therefore, having a different
extension seems reasonable.

in the case of .osm files, they're all potentially history files, and
the file format does not change depending on whether multiple versions
are present for a single ID or not. whether something is a "history"
osm file or a "current" osm file is a matter of the content - so
wanting a different extension is a bit like wanting .png for
truecolour images and .pgr for greyscale images (in the same PNG
format). having said that, it would seem reasonable to add a flag to
the document element to indicate whether the .osm file is a special
case, having a single version for each ID, as many programs seem to
rely on this assumption and it would be better to be able to check it.

> I'm going to implement a regular run that generates fresh extracts every
> week from the available file. Is there any note on which weekday the
> full-history-dumps are generated, so I can loosely sync my split-script
> to that rhythm?

great! :-)

the generation is synced to the backup database dumps, so the clock
starts running early Tuesday, when Monday's backup is complete. they
seem to be fairly reliably finished by Wednesday morning, so it's
probably safe to start looking for them then - although they'll be
named for Monday's date.

> I xml-writing takes only half as long as xml-reading I'd double-think
> about supplying xml-based files. nobody really has fun reading such huge
> files with expat. And if it's really neccessary, there's always
> osmium_convert which will generate xmls from pbf-dumps or -extracts locally.

this is a discussion which could probably continue forever. my opinion
is that it's worthwhile distributing files which are sort-of
human-readable, in a well-known format/markup for which many libraries
exist in many languages, compressed with standard tools, and in the
same format as the API.

this way, it's possible for people to develop tools which work against
small map call downloads, then scale them to extracts and even the
whole planet. of course, it's a widely-held belief that xml sucks
irretrievably and, while it's certainly true that pbf is smaller and
parses faster, distributing only pbf would mean someone would have to
learn those extra tools/commands to start using the data.

xml, despite its many flaws, at least has myriad libraries, bindings
and tools which make it easier to experiment with processing and
transforming osm data. these experimental planet/history files are
also line-oriented, which means one can even do quick-and-dirty
grep/sed/awk work for ad-hoc analysis.

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
https://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] London Hack Weekend - Nov 30 / Dec 1

2013-10-26 Thread Matt Amos
On Sat, Oct 26, 2013 at 3:57 PM, Matt Amos  wrote:
> full details are available on the wiki page [1], and there's also an
> event on Lanyrd [2].

of course, it would have been more helpful if i'd actually put the links in...

[1] http://wiki.openstreetmap.org/wiki/London/London_Hack_Weekend_Nov_2013
[2] http://lanyrd.com/crxqm

___
dev mailing list
dev@openstreetmap.org
https://lists.openstreetmap.org/listinfo/dev


[OSM-dev] London Hack Weekend - Nov 30 / Dec 1

2013-10-26 Thread Matt Amos
we're having a hack weekend on the 30th November and 1st December,
graciously hosted by AOL / MapQuest at their offices in central
London. it would be great to see you there!

full details are available on the wiki page [1], and there's also an
event on Lanyrd [2]. please sign up to at least one of these if you
are planning to attend, both so that we're able to judge the amount of
power sockets we'll need, and for fire safety / security.

in addition, we'll be socialising in the pub on the Friday and
Saturday evenings - if hacking isn't your thing, we'd be very happy to
see you there. venues TBA at the moment, and suggestions welcome.

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
https://lists.openstreetmap.org/listinfo/dev


[OSM-dev] Fun documentation questionnaire!

2013-10-07 Thread Matt Amos
we at EWG are interested in which OSM-related projects would benefit
from some help improving their documentation for developers. that is;
the documentation which you'd look at if you wanted to start
contributing to that project, not any documentation for using it.

to help collect some data to inform any discussion, we would be very
grateful if you could spend a few minutes filling out this short (and
fun) questionnaire:

http://bit.ly/17OxvKa

if you see a project which you've not heard of before, or haven't seen
the developer documentation, it would be extremely helpful if you
could spend a short time looking for it. it may not even exist - but
it's good data!

the list of projects is not supposed to be exhaustive, and there are
many worthy projects missing from the list. if there is something
which we've missed, please use this thread to discuss it and its
developer documentation situation.

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
https://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Fwd: Changeset metadata replication stream not working

2013-04-12 Thread Matt Amos
On Fri, Apr 12, 2013 at 7:18 PM, Paweł Paprota  wrote:
> Please either fix it or take it down so I know what to do in my code.

apologies, should be working again now.

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


[OSM-dev] Latest full-history planet

2013-02-08 Thread Matt Amos
now available from:

http://planet.openstreetmap.org/planet/full-history/

get it while it's hot :-) and please let me know if you find any errors with it.

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Status of the Mapnik stylesheets

2012-11-16 Thread Matt Amos
On Wed, Nov 14, 2012 at 12:23 PM, Richard Fairhurst wrote:

> Matt Amos wrote:
> > i'd sound a note of caution about having separate "clean" and
> > "detailed" styles. we sort-of did that before with mapnik and
> > osmarender respectively and... well, we don't have
> > osmarender any more.
>
> That was a technology failure, though, rather than anything wrong with the
> concept itself.
>

the greater resources required to render osmarender worldwide may have
played a part, but that part of the problem was the one getting the most
attention with tiles@home and so forth. my take on it was that many,
perhaps most, people felt the extremely "detailed" style that
osmarender/tiles@home was producing wasn't as pleasant as the mapnik style,
so it got second billing, less re-use, and that's what led to its decline.

In principle, there are clear, identifiable, distinct needs for a "showcase
> style" and a "debugging view". We want the great unwashed to look at OSM
> and
> say "wow, that's a complete, accurate map"; and we want our mappers to
> enjoy
> the gratification of seeing their changes rendered, because that's a
> powerful incentive to keep contributing.
>
> (I use the word "view" rather than "style" because it's conceivable that
> the
> latter could be provided some other way than a traditional Mapnik
> stylesheet, perhaps something along the lines of Kothic-JS.)
>

i agree, but would go further and suggest that the debugging view might be
better constructed on top of the "showcase style", rather than being able
to (d)evolve independently of it. i would further suggest that the
gratification of seeing changes rendered is strongly reliant on those
changes being visible directly on osm.org, which (i think) leads to the
conclusion that these two "views" cannot be separate, and must be
integrated somehow.


> (For the showcase style, personally I think XML vs CSS is more of a problem
> than svn vs git, but that's an implementation detail.)
>

i'd like to applaud Andy Allan's recent work in this area [1]. this is
excellent stuff, and many thanks to Andy for starting it :-)

cheers,

matt

[1] https://github.com/gravitystorm/openstreetmap-carto
___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Status of the Mapnik stylesheets

2012-11-13 Thread Matt Amos
On Tue, Nov 13, 2012 at 10:36 PM, Tom MacWright  wrote:

> There is no way to 'just do it' until the style ... has active
> maintainers. Until then we're just talking.


seems like this might be the major hurdle - there do seem to be people
willing to contribute on this thread, but uncertainty about who is / will
be the maintainer and take on the long-term responsibility of choosing
which contributions to take.

i'd sound a note of caution about having separate "clean" and "detailed"
styles. we sort-of did that before with mapnik and osmarender respectively
and... well, we don't have osmarender any more.

cheers,

matt
___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Minutely changeset feed

2012-11-07 Thread Matt Amos
On Wed, Nov 7, 2012 at 5:22 PM, Toby Murray  wrote:

> On Wed, Nov 7, 2012 at 10:15 AM, Matt Amos  wrote:
> > On Fri, Nov 2, 2012 at 6:34 AM, Toby Murray 
> wrote:
> >>
> >> It looks like changesets don't show up until they are closed. This
> >> makes sense since then you don't have to worry about information
> >> changing.
> >
> >
> > hmm... this shouldn't be the case - changesets should show up when
> they're
> > opened and again when they're closed. if they're opened and closed in the
> > same period, it will only show the closed state. this seems to be what
> > happens with most changesets - they open and close within a few seconds.
>
> I looked at 5 consecutive diffs just now and there was not a single
> open changeset. Other random ones I've looked at in the past also
> don't have any open ones. I find this highly unlikely since it is
> actually kind of hard to manually close a changeset in P2. So I
> suspect you have a bug. Possibly the same bug that affected the weekly
> planet dump until a few weeks ago when I noticed it and Tom fixed it:
>

ah... just noticed a massive bug - i was assigning both created_at and
closed_at from the database's created_at! no wonder it thought everything
was already closed. right, i'll go fix that.


>
> http://git.openstreetmap.org/planetdump.git/commitdiff/e406711139b9eb8db9d32fa4094325ec3841082c?hp=beb331bcdf6e5565b02fe61be397c150bc03
>
> The problem was that all changesets have the closed_at field set to
> one hour in the future whenever the changeset is touched. This is
> stored in UTC. If one hour has passed since the last change, meaning
> that closed_at is in the past, the changeset is considered closed. But
> the SQL that determines the changeset's openness uses the now()
> function which by default returns server local time. During daylight
> savings time in London this means that, the "closed_at > now()" check
> always returned false so all changesets were marked as being closed in
> the weekly dump.
>
> But then again, DST is no longer in effect... So it may be something else.


yeah, i thought i handled that - all calls in the code are UTC, although i
see i've used now() in one SQL statement, but it's just to get a starting
list and is filtered against the correct time later in the code.

i will have to do some investigation...


> >>
> >> However, what will it do in the edge case where a changeset is closed
> >> but then reopened? Not possible you say? I once had an upload take
> >> over an hour to process. It did eventually succeed but the changeset
> >> was marked as closed in my changeset list for several minutes before
> >> the upload finished and then it was suddenly open again. I am assuming
> >> in this case the changeset would show up twice in the minutely diffs
> >> and cause the INSERT queries of all consumers to explode violently :)
> >
> >
> > yeah... i don't think the code is able to handle this at the moment...
> maybe
> > it would see the re-opening, but probably not. any specific examples of
> this
> > would be great, btw, to help me start building up some test cases.
>
> Ok well if the changeset normally shows up twice in the feed anyway
> then tools will need to handle this so it shouldn't be a problem on
> the consumer side. But yes, the code generating the diffs does need to
> take this into account. I know the osmosis diffs are based on database
> transactions so they pick up any row that has changed since the last
> diff. Applied to changesets, this would actually produce an entry in
> the stream every time a chunk was uploaded (in JOSM chunked upload
> mode)
>

yeah, i wanted to avoid the changeset being in the feed too many times
(e.g: chunked upload). ideally only when it is opened and closed, although
there are situations where it probably prints out more often.


> By examples, do you mean a changeset ID? Since this is all time based
> that won't do much good since now the changeset will look like any
> other changeset that I held open for 2 hours.


i'll have to find more inventive ways to test the code, then ;-)

cheers,

matt
___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Minutely changeset feed

2012-11-07 Thread Matt Amos
On Fri, Nov 2, 2012 at 6:34 AM, Toby Murray  wrote:

> Since I have been playing with changeset metadata[1] a bit, Paul
> Norman recently pointed out to me that there are now minutely diffs of
> changesets available from planet.osm [2]. Would anyone care to share
> some juicy details like where it came from and how it works?
> Apparently it was mentioned in an EWG meeting. I should really check
> one of those out...
>
> Anyway, some observations/questions:
>
> I see the state file is in a different format than all the other
> replication state files. Probably not a big deal but is there value in
> consistency?
>

perhaps. ultimately, i'd like this to be rolled into the osmosis diff
production processes, but doing that now might cause some breakage. in the
interim, i thought a YAML file is probably standard enough that not
re-using the non-standard osmosis parser would be easy enough.


> Are there any existing tools to use this stream?
>

i don't know of any, but i'd be very happy to learn of new tools / support
in existing tools :-)


> It looks like changesets don't show up until they are closed. This
> makes sense since then you don't have to worry about information
> changing.
>

hmm... this shouldn't be the case - changesets should show up when they're
opened and again when they're closed. if they're opened and closed in the
same period, it will only show the closed state. this seems to be what
happens with most changesets - they open and close within a few seconds.


> However, what will it do in the edge case where a changeset is closed
> but then reopened? Not possible you say? I once had an upload take
> over an hour to process. It did eventually succeed but the changeset
> was marked as closed in my changeset list for several minutes before
> the upload finished and then it was suddenly open again. I am assuming
> in this case the changeset would show up twice in the minutely diffs
> and cause the INSERT queries of all consumers to explode violently :)
>

yeah... i don't think the code is able to handle this at the moment...
maybe it would see the re-opening, but probably not. any specific examples
of this would be great, btw, to help me start building up some test cases.

cheers,

matt



> [1] https://github.com/ToeBee/ChangesetMD
> [2] http://planet.openstreetmap.org/replication/changesets/
>
> Toby
>
> ___
> dev mailing list
> dev@openstreetmap.org
> http://lists.openstreetmap.org/listinfo/dev
>
___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


[OSM-dev] ODbL full history planet

2012-10-19 Thread Matt Amos
now available from:

http://planet.openstreetmap.org/planet/full-history/

only limited testing has been done on this file (checking for XML
well-formed-ness and various spot-checks on elements), so please let me
know if you find any errors with it.

cheers,

matt



___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Why are so many changeset so large?

2012-10-17 Thread Matt Amos
On Wed, 2012-10-17 at 00:28 +0100, Tom Hughes wrote:
> On 17/10/12 00:04, Alex Barth wrote: 
> > - Are there technical reasons why changesets should tend to be 
> > large? Are they expensive on some level?
> 
> I believe it's entirely because we've got so many people doing 
> mechanical or semi-mechanical edits.
> 
> That includes bots but also things like people using xapi or overpass to 
> download all objects matching some set of tags, then change those tags 
> and reupload.

the historical answer to this is that when changesets were added to the
OSM API there were two different intentions for their use which got
conflated: first, that changesets were structures for grouping edits
sharing common attributes. and second, that changesets were VCS-style
'commits' which would be uploaded in a single request and applied
atomically.

effectively, the first use-case was for users, and tried to make
changesets as open-ended as possible. from this, we get tags on
changesets for comments, editor, bot-ness, etc... and the ability to
keep uploading into an open changeset.

the second use-case was a technical thing - the sheer number of API
calls to individual elements, even from normal-sized editing sessions,
could cause problems. and, for small calls, HTTP headers and round-trip
latencies would dominate the cost of an upload. further, editors had to
cope with the situation where an upload failed half-way through and to
re-try the failed calls. from this, we get a single changeset/#id/upload
call which applies atomically.

at the time, this seemed like a good way to satisfy both use-cases. and,
while it does what it set out to, i think we should consider splitting
these in the next API version; explicitly reifying uploads at which
bboxes / coverage sets and change counts can be stored. changesets can
then simply be collections of uploads.

getting to the point: this might to some extent mitigate the "large
changesets" issue, as it would allow bboxes to be collected at a smaller
granularity. however, it wouldn't be a full solution and we'd probably
still need something like OWL to break down the geographic footprint of
changesets further.

cheers,

matt



___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Older version of OWL alive?

2012-10-16 Thread Matt Amos
On Tue, 2012-10-16 at 09:32 -0400, Alex Barth wrote:
> Matt (or others who might know) -
> 
> Is there a staging (or some other kind of) version of OWL alive that I could 
> look at? 

sadly not. the version on dev (which was crushing it) died under its own
weight a while ago. the situation back in May [1] is pretty much the
same situation we're in now, despite great work by Ian Dees on trying to
get the front-end working.

cheers,

matt

[1] http://lists.openstreetmap.org/pipermail/talk/2012-May/062959.html



___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] OSM Wishlist

2012-10-15 Thread Matt Amos
On Mon, 2012-10-15 at 18:13 +0200, Paweł Paprota wrote:
> > "top ten tasks" is  "These are the Top Ten Tasks that the OSM System 
> > Administrators"
> > What about the community ? This only is a todo list by the admins, for
> > the 
> > admins coded by the admins. So far so good, but that's not a wishlist,
> > or, at 
> > least, not a wishlist of the community.

the page says: "These are the Top Ten Tasks that the OSM System
Administrators would really like your development help on." so i think
it's unfair to say it's a list by the admins, for the admins. the
important part of the sentence is "... would really like your
development help on."

we (EWG) formulated this list as a curated selection of tasks that we
thought were generally important so that people who were looking for
something to do would have some idea of what "the most important"[1]
items were, and where their efforts would be most appreciated.

> Disclaimer: I have just started working on OSM a few weeks ago so I may
> be wrong with my impression about how things work.
> 
> Generally I would say that OSM works like a typical open source project
> - people who do the actual programming work choose what they want to
> work on. That's OK since this characteristic is the main attraction to
> open source for programmers around the world - they can work on what
> they like, instead of working on what their boss or a customer order
> them to work on.

exactly - there are no restrictions on what you should work on. the data
is open, the software is open, and you can work on whatever is
interesting to you.

and if, surrounded by this huge number of different things to work on,
you want to work on something that some people who are knowledgeable
about OSM think is important enough to have short-listed; that's the Top
Ten Tasks[1].

> I think every open source project, including the big ones has some
> challenges with "user voice being heard" or at least that's the
> impression. If you propose to change it by creating a community-driven
> (instead of "admin"-driven as you put it) wishlist, by any means - do
> it. The operative word being "do".

one problem (which probably started this thread) is that a wishlist is
potentially infinite. and, even treating it as an ideas-gathering forum,
the value becomes diluted with the quantity of ideas presented. the
hardest part of making the process useful is curating the list of wishes
and doing the work of turning it into a list of achievable and practical
items. and, as you rightly point out, the operative word is "do".

cheers,

matt

[1] this is not to say that other things aren't important. when the Top
Ten Tasks were written, our crystal ball was out of order so we sadly
couldn't predict the future. things change, and ten is much too small a
number to get everything that's important onto the list, so the TTTs are
a just a suggestion - they're not gospel.



___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Fwd: Streaming Replication

2012-10-15 Thread Matt Amos
On Sun, 2012-10-14 at 22:39 +1100, Brett Henderson wrote:
> The timestamp columns in the database are set to "timestamp without
> time zone" which presumably means the timezones of dates aren't
> automatically converted to the correct timezone upon querying.  I'm a
> bit confused though because I believe PostgreSQL itself is running in
> the BST timezone.  I'd like to investigate further but I don't have
> time at the moment.

bare timestamps are stored in the database in UTC always [1]. as far as
i know, there's no way to decorate the column with this information, so
postgres doesn't have it to do any conversions. it seems likely that
somewhere along the stack of software doing the query something is
adding timezone information by defaulting to the current timezone, and
it shouldn't be doing that.

cheers,

matt

[1]
https://github.com/openstreetmap/openstreetmap-website/blob/master/app/models/node.rb#L290



___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] OSM Wishlist

2012-10-12 Thread Matt Amos
On Fri, Oct 12, 2012 at 11:50 PM, Iván Sánchez Ortega
 wrote:
> A diff/patch tool for imported data.

opengeo presented at SOTM about their plans for "geogit" [1]. from
what i understood one of their aims is interoperability between
separately-maintained OGC model datasets and OSM. i think i'm right in
saying that there's a fair way to go before it's a finished product,
but i'm sure they'd welcome any support.

cheers,

matt

[1] http://blog.opengeo.org/tag/geogit/

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Hello World

2012-10-12 Thread Matt Amos
On Fri, Oct 12, 2012 at 6:14 PM, Michal Migurski  wrote:
> * Improved relation + data model support in a editors, either a new one or a 
> retrofit onto Potlatch. Things like route relations or addressing schemas are 
> important, for example, and should see the same kind of first-class support 
> in an editor that the primary/secondary/etc. road schema does. It's time for 
> these higher-order features to come out of the lab.

another item, if i may add to your wishlist, is turn restrictions;
there's already some special-purpose support for this in JOSM and
Potlatch 2, and i think it's something where a little more work could
really help reduce the complexity of adding and maintaining these
features. they tend to be found around complex junctions and there
have been many ideas in the past for what a UI specifically for
junction editing might look like (for example [1]).

> * Better time-awareness in planet dumps. I want to see a "days since last 
> edit" or similar attribute on all entities in the planet file, which will 
> make it possible to create downstream displays and styles that address hot, 
> contentious information. We're starting to see two tiers of OSM data 
> consumers, those who want the latest-newness and those who want to treat OSM 
> like a slower-moving data source.

please could you explain a little more? i think perhaps i've
misunderstood what you're asking for. all nodes, ways and relations in
the planet file have a timestamp, and the planet has a date header
that would allow the calculation of "days since last edited". would
you like this information on the sub-elements (tags, nds, members) as
well?

> Stepping back, the story of OSM *is* that things happen in other tools, so 
> I'm in agreement with Paweł Paprota that Mapbox should "build something that 
> needs JSON support, filtering and tag API" before those features get fed into 
> the core infrastructure. The minute diffs support the ability to develop 
> real, working systems in an offboard manner, and I think we should take 
> advantage of that.

and systems developed in this way are loosely coupled to the rails
code and main database, making it easier to admin and scale these
services later without adding complexity to the rails code. in my
opinion, it's a huge advantage.

cheers,

matt

[1] http://www.slideshare.net/yutik/mapzen-junction-editor

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Will the real OpenStreetBugs stand up?

2012-10-09 Thread Matt Amos
On Tue, Oct 9, 2012 at 9:48 PM, Mikel Maron  wrote:
>> What part of "I will take an action to get something pushed out before the
>> next meeting" did you fail to understand yesterday?
>
> This sounds promising, but I fail to understand it fully. Is there someone
> else from the EWG meeting (not TomH, who I don't want to bother) who would
> like to fill us in on these developments with integrating bug tracking into
> OSM.org?

i've just finished putting the minutes up [1] and the relevant extract is:

* ACTION: TomH to push current OSB/notes branch public (which he has
been improving on a local branch)
* (OSB/notes) there are concerns about the UI implementation of the
current branch, but it may be possible to merge the API separately so
that UI development can proceed independently

cheers,

matt

[1] http://www.osmfoundation.org/wiki/Working_Group_Minutes/EWG_2012-10-08

> -Mikel
>
> * Mikel Maron * +14152835207 @mikel s:mikelmaron
>
> 
> From: Tom Hughes 
> To: Tom MacWright 
> Cc: dev@openstreetmap.org
> Sent: Tuesday, October 9, 2012 4:31 PM
> Subject: Re: [OSM-dev] Will the real OpenStreetBugs stand up?
>
> On 09/10/12 21:24, Tom MacWright wrote:
>
>>All those are independent third party sites created by individuals
>>and are not directly related to core site.
>>
>> Aren't they using the same database somehow?
>
> No idea.
>
>>What we were talking about in the EWG meeting was adding a "bug"
>>reporting system to the main site that records things in the main
>>database and is integrated with the API etc.
>>
>>It is not directly related to any of the sites you mention.
>>
>> Okay, then what is it? :) Is it not open-source at all? I thought that
>> you were working on a branch of the 'official' OSB project and just
>> needed to merge/publish that?
>
> What part of "I will take an action to get something pushed out before the
> next meeting" did you fail to understand yesterday?
>
> The story is that Kai created something that was literally based on taking
> one of the existing OSB systems and bolting that javascript onto the rails
> code but it didn't produce a something that was very coherent with the rest
> of the site and API so I have been reworking it.
>
> There is a branch out there that you may stumble across but it bears no
> resemblence to the current code.
>
> Now if you want me to get what I have cleaned up and published I should
> probably stop writing emails about it and actually work on it instead...
>
> Tom
>
> -- Tom Hughes (t...@compton.nu)
> http://compton.nu/
>
> ___
> dev mailing list
> dev@openstreetmap.org
> http://lists.openstreetmap.org/listinfo/dev
>
>
>
> ___
> dev mailing list
> dev@openstreetmap.org
> http://lists.openstreetmap.org/listinfo/dev
>

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] OWL + OSM Activity Server

2012-10-09 Thread Matt Amos
On Mon, Oct 8, 2012 at 9:33 PM, Paweł Paprota  wrote:
> Hi Ian,
>
> Thanks for the response. What you wrote about partitioning the data etc.
> is exactly what concerns me with the Changeset Activity Publisher
> implementation - it will get too big with time.

what Ian says is exactly right - the current status is that work needs
to be done to either denormalise the data or figure out a clever way
to handle non-spatial queries over this spatial data structure. the
least horrific option seems to be the denormalisation, but i've also
been playing around with trying to write an sp-gist index and toying
with the question of whether some other form of database might be a
better fit.

> On the other hand, for "social" purposes I guess there is no need to
> hold on to every single change since forever... Does anyone read
> Facebook posts from few months ago? Guess not...

depending on how fast the generation process is, could it potentially
be done on-the-fly and cached?

> Slightly different use cases I think but still it would be good to learn
> from OWL devs knowledge about scaling something like this.

i can definitely tell you what doesn't work: sticking it all in one
table and hoping the database will magically sort it out ;-)

as yet i haven't seen a solution which doesn't incur significant drawbacks.

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


[OSM-dev] Re-booting EWG

2012-10-01 Thread Matt Amos
in the shadow of the big, bad license change, i'm afraid EWG rather
fell by the wayside. apologies about that, and for taking so long to
re-start it. in keeping with the spirit of continuity, the next
meeting will be on #osm-ewg in a week's time: monday 8th october at
1800 GMT (see [1] for times in various time zones).

on the agenda for this meeting: review of the top ten tasks [2] and
planning upcoming events.

cheers,

matt

[1] http://www.timeanddate.com/worldclock/fixedtime.html?iso=20121008T19&p1=136
[2] http://wiki.openstreetmap.org/wiki/Top_Ten_Tasks

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] [Building Lane Model for OSM]How can I get multiple gpx from osm for repeated roads?

2011-11-17 Thread Matt Amos
On Thu, Nov 17, 2011 at 4:47 PM, donglai  wrote:
> Hi all:
>
> I'm thinking to build the lane model for OSM using machine learning
> techniques. How can I get multiple gpx files from osm database for certain
> roads?

the GPX tracks in the OSM database are not explicitly related to the
roads - it is a completely different dataset in that regard. to
download tracks for an area of interest, there is the "trackpoints"
API call [1]. however, for bulk downloading large areas there is no
good method (yet). perhaps using the trackpoints API call near some
major intersections would yield the data you need?

cheers,

matt

[1] http://wiki.openstreetmap.org/wiki/API_v0.6#Retrieving_GPS_points

> Thanks in advance for any advice or feedback.
> Best
> D.
>
> Below are my half-baked ideas:
>
> 1) Motivation:
>
> a) According to Sebastian Thrun, in order to build the autonomous driving
> car, they push google map to 15cm accuracy with lane models !!! (I guess
> they do it in an expensive way, maybe only for part of CA to test their car)
> http://www.youtube.com/watch?v=bp9KBrH8H04
>
> b) Also, several OSM members have already proposed the lane tag for the map.
> However, it'll be a pain for our dear brave volunteers to track and mark the
> lane they were in.
>
>
> 2) One possible way out:
> a) the bottleneck is that gpx files uploaded by users are often too noisy to
> figure out the lane assignment. But we have the "turn constraint" (change to
> inner/outer lane before turning left/right) and relative geometry shape of
> the route.
> b) Given multiple gpx files for one road segment, we can have a robust
> estimation of the lane model using machine learning techniques.
> c) Also, such technique may be useful for map refinement instead of asking
> users to correct by their own where the accuracy is in meters.
>
> ___
> dev mailing list
> dev@openstreetmap.org
> http://lists.openstreetmap.org/listinfo/dev
>
>

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Where is the controler for requests?

2011-11-16 Thread Matt Amos
the OSM rails server follows a pretty standard layout for the
controllers, models, views and routes and so forth as used by any Ruby
on Rails project. the Rails tutorial has an overview of the directory
structure of a rails app [1] and there is more detail in the rails
official guides [2].

cheers,

matt

[1] 
http://ruby.railstutorial.org/ruby-on-rails-tutorial-book#sec:the_first_application
[2] http://guides.rubyonrails.org/

On Wed, Nov 16, 2011 at 9:21 AM, Alexandru  wrote:
> I want to add some static html pages to my server, can someone pls tell me
> where the controller is so that i can add my paths?
> Thanks in advance.
> PS: a reference for osm's general file structure would be very appreciated,
> too!
>
> ___
> dev mailing list
> dev@openstreetmap.org
> http://lists.openstreetmap.org/listinfo/dev
>
>

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Segmentation fault in osm

2011-11-11 Thread Matt Amos
On Fri, Nov 11, 2011 at 1:29 PM, Alexandru  wrote:
> /usr/lib/ruby/gems/1.8/gems/i18n-0.6.0/lib/i18n/core_ext/hash.rb:13: [BUG]
> Segmentation fault
> ruby 1.8.7 (2011-06-30 patchlevel 352) [x86_64-linux]
>
> Aborted

this would seem to indicate a pretty serious problem with your ruby
installation. what distro / OS are you running? and is it possible
there are some rogue out-of-date or ABI-incompatible libraries
somewhere on your disk?

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] EWG & London Hack Weekend

2011-10-23 Thread Matt Amos
based on the doodle poll dates, it looks like the weekend of the 26th
& 27th november is most convenient. i look forward to a great weekend
of hacking, discussion, education and maybe a tiny little bit of
drinking, and i hope to see you there. if you're interested in coming,
please sign up on the wiki:

http://wiki.openstreetmap.org/wiki/London/London_Hack_Weekend_Nov_2011

and a reminder that the EWG meeting is tomorrow at 5pm GMT.

cheers,

matt

On Sun, Oct 2, 2011 at 5:22 PM, Matt Amos  wrote:
> first, a quick reminder that the Engineering Working Group (EWG) [1]
> is having meetings on #osm-ewg Mondays at 5pm GMT. if you'd like to
> get involved in OSM-related development, or have ideas about how to
> support it, then please come along. this week, we're going to be
> mostly looking at "what's the biggest single thing stopping you
> developing with OSM data?" and probably a bunch more stuff :-)
>
> for a more hands-on event, i'm looking to start planning the next
> London hack weekend (previous [2]) some time in late November / early
> December. judging by previous events, it'll be a smorgasbord of
> hard-core development, tutorials and hanging around in pubs! if you're
> interested in coming along please enter your availability at this
> doodle poll [3] so that we can find out when has maximal utility [4].
>
> cheers,
>
> matt
>
> [1] http://www.osmfoundation.org/wiki/Engineering_Working_Group
> [2] http://wiki.openstreetmap.org/wiki/London_Hack_Weekend_May_2011
> [3] http://www.doodle.com/z5brf6c5wa9biwix
> [4] http://en.wikipedia.org/wiki/Utility
>

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


[OSM-dev] EWG & London Hack Weekend

2011-10-02 Thread Matt Amos
first, a quick reminder that the Engineering Working Group (EWG) [1]
is having meetings on #osm-ewg Mondays at 5pm GMT. if you'd like to
get involved in OSM-related development, or have ideas about how to
support it, then please come along. this week, we're going to be
mostly looking at "what's the biggest single thing stopping you
developing with OSM data?" and probably a bunch more stuff :-)

for a more hands-on event, i'm looking to start planning the next
London hack weekend (previous [2]) some time in late November / early
December. judging by previous events, it'll be a smorgasbord of
hard-core development, tutorials and hanging around in pubs! if you're
interested in coming along please enter your availability at this
doodle poll [3] so that we can find out when has maximal utility [4].

cheers,

matt

[1] http://www.osmfoundation.org/wiki/Engineering_Working_Group
[2] http://wiki.openstreetmap.org/wiki/London_Hack_Weekend_May_2011
[3] http://www.doodle.com/z5brf6c5wa9biwix
[4] http://en.wikipedia.org/wiki/Utility

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] 'tile' - what it is used for in some tables in the schema?

2011-10-01 Thread Matt Amos
On Fri, Sep 30, 2011 at 10:00 AM, Zsombor Szabó  wrote:
> Hi,
>
> studying the OSM database schema over at
> http://wiki.openstreetmap.org/wiki/Database_schema I was wondering for
> what the 'tile' attribute is used for in the current_nodes, nodes,
> gps_points tables.
>
> If you know then please share.

it's a geometric index using a B-tree of the (truncated) morton code
of the lat/lon of those elements. for the gory details, see the
sql_for_area function in the source code [1]. there's also some
general discussion on the wiki [2], info about the general method on
wikipedia [3] and in the mailing list archives [4].

cheers,

matt

[1] http://git.openstreetmap.org/rails.git/blob/HEAD:/lib/quad_tile.rb#l63
[2] 
http://wiki.openstreetmap.org/wiki/Quadtile#Indexing_single-point_geo-locations
[3] http://en.wikipedia.org/wiki/Z-order_curve
[4] http://lists.openstreetmap.org/pipermail/dev/2008-September/011694.html

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] OSMF Engineering Working Group inauguration

2011-08-23 Thread Matt Amos
ern with
uservoice and similar systems was that it's not a good forum for
bi-directional discussion about things like feasibility and
compromise. but the difficulty of the task should not dissuade us ;-)

> But overall, it seems like it was a reasonable  productive first meet for
> the EWG.

thanks to everyone who came. hopefully we'll have another productive
meeting next week.

cheers,

matt

> Kai
>
>
> On 01/-10/-28163 12:59 PM, Kate Chapman wrote:
>>
>> Hi Matt,
>>
>> Just wanted to mention I'm interested in helping.  Sadly I am
>> unavailable today, though normally I would be at the time selected.
>>
>> Couple thoughts I have about the rough agenda:
>>
>> * Barriers for entry for getting started:
>>
>> People seem to have issues getting the rails port working.  Perhaps
>> creation of VM with development environment?  That would allow those
>> who maybe can just do JavaScript and CSS still to contribute
>> something.
>>
>> Documentation (this of course depends on the specific project).  Are
>> there holes in the documentation that could be improved?
>>
>> * Small Projects that are mentored to get developers started:
>>
>> I don't have any great ideas about this.  I don't actively contribute
>> to any of the main OSM projects by actually coding, though I'm willing
>> to help mentor if there is a way that makes sense.  Are there
>> individuals who come in and want to contribute code and don't map that
>> need basic OSM mentoring?
>>
>>
>> Thanks!
>>
>> -Kate
>>
>>
>>
>>
>> On Mon, Aug 15, 2011 at 8:09 PM, Matt Amos  wrote:
>>>
>>> the "Engineering Working Group"[1] is a new group set up with the
>>> purpose of helping stimulate the development community surrounding
>>> OSM. you don't need to be a developer to join - we're interested in
>>> hearing from everyone and ideas are welcome. software enables us to do
>>> the mapping we love, and to make use of the data we collect, so making
>>> our software better is part of making OSM better.
>>>
>>> although IRC isn't a perfect communication medium it's easy to set up
>>> and it's interactive, so i've set one up over at #osm-ewg on OFTC.
>>> just to kick off, i suggest a meeting next week on:
>>>
>>>  Monday 22nd at 5pm GMT
>>>
>>> (5pm CEST, 6pm BST, 1pm east-US, 10am pacific-US)[2] to start the
>>> conversation.
>>>
>>> a very rough agenda for the meeting, including but certainly not limited
>>> to:
>>>  * what are the barriers-to-entry for new developers?
>>>  * are there small projects (perhaps mentored?) which new developers
>>> can take on?
>>>  * what can be done to improve the usefulness of communication between
>>> developers and users?
>>>
>>> hope to see you there,
>>>
>>> matt
>>>
>>> [1] it's not necessarily the best or most descriptive name, but it'll
>>> do for now. other suggestions include "Development Support Group" and
>>> "Software Working Group".
>>> [2] yeah, i know, but no matter what time is chosen it'll be bad for
>>> someone. i suggest we start with this and i'm open to suggestions
>>> about rotating the time or setting up other communication channels
>>> which are less time-zone sensitive.
>>>
>>> ___
>>> dev mailing list
>>> dev@openstreetmap.org
>>> http://lists.openstreetmap.org/listinfo/dev
>>>
>>
>>
>
>

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


[OSM-dev] OSMF Engineering Working Group inauguration

2011-08-15 Thread Matt Amos
the "Engineering Working Group"[1] is a new group set up with the
purpose of helping stimulate the development community surrounding
OSM. you don't need to be a developer to join - we're interested in
hearing from everyone and ideas are welcome. software enables us to do
the mapping we love, and to make use of the data we collect, so making
our software better is part of making OSM better.

although IRC isn't a perfect communication medium it's easy to set up
and it's interactive, so i've set one up over at #osm-ewg on OFTC.
just to kick off, i suggest a meeting next week on:

  Monday 22nd at 5pm GMT

(5pm CEST, 6pm BST, 1pm east-US, 10am pacific-US)[2] to start the conversation.

a very rough agenda for the meeting, including but certainly not limited to:
 * what are the barriers-to-entry for new developers?
 * are there small projects (perhaps mentored?) which new developers
can take on?
 * what can be done to improve the usefulness of communication between
developers and users?

hope to see you there,

matt

[1] it's not necessarily the best or most descriptive name, but it'll
do for now. other suggestions include "Development Support Group" and
"Software Working Group".
[2] yeah, i know, but no matter what time is chosen it'll be bad for
someone. i suggest we start with this and i'm open to suggestions
about rotating the time or setting up other communication channels
which are less time-zone sensitive.

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] JSON-output for xapi

2011-06-26 Thread Matt Amos
On Sun, Jun 26, 2011 at 6:17 PM, Ian Dees  wrote:
> I'm not sure it's specified anywhere, but it is customary (and many tools
> expect) to receive a sorted OSM file: nodes first, followed by ways,
> followed by relations. This makes constructing an in-memory model of the
> data easier on the client side.

indeed. the rails port generates XML with the nodes, ways and
relations in order and cgimap is designed to be compatible with that,
in both XML and JSON outputs. i figure it also makes parsing the data
easier, which is probably more important for JSON as the format is
often imported directly into working data structures in the browser.

have you considered using temporary tables in the database to order
the data so that it can be streamed? this is how cgimap (and, iirc,
jxapi?) solve the issue of building up search results in a performant
manner that can be streamed while also being correctly ordered.

cheers,

matt

> On Sun, Jun 26, 2011 at 12:05 PM, Dario Brandes  wrote:
>>
>> hi matt,
>>
>> the main difference between our solutions is that you sort the data in
>> category's (nodes, ways, relations). We have only one list, in
>> which this elements are unsorted. This asynchronism is necessary to
>> keep our good server performance, because the output-module does not
>> have to await for all the data. It can start the output stream before.
>> What's your point of view? Do you really need this type of sorting?
>>
>> I hope we can find a solution, to make all happy :)
>>
>> Regards Dario
>>
>>
>> On Thu, 16 Jun 2011 19:22:54 +0100
>> Matt Amos  wrote:
>>
>> > hi dario,
>> >
>> > there's experimental support in cgimap for JSON format, which will
>> > hopefully get enabled at 0.7 time (whenever that is). i've attached an
>> > example of the output in both JSON and XML for comparison. it looks
>> > extremely similar to the examples you've included below, so hopefully
>> > we can work together on this. :-)
>> >
>> > cheers,
>> >
>> > matt
>> >
>> > On Thu, Jun 16, 2011 at 5:04 PM, Dario Brandes 
>> > wrote:
>> > > Hello everyone,
>> > >
>> > > we from the xappy.js Team want talk with you about a JSON Standard
>> > > for osm. The GeoJSON Format is inapt to safe relations and so we
>> > > created something new.
>> > > Here the link to our wikipage:
>> > > https://github.com/slomo/osm-spline-xapi/wiki/JSON-Output
>> > >
>> > > We would like to hear your comments about our proposal.
>> > >
>> > > Regards Dario
>> > >
>> > >
>> > >
>> > > # JSON output format definition by example.
>> > >
>> > > **Note:** for simple reading we included whitespaces which are of
>> > > course omitted in the actual implementation.
>> > >
>> > > ## skeleton
>> > >    {
>> > >        "version": 0.6,
>> > >        "generator": "xappy.js",
>> > >        "xapi": {
>> > >             "uri": "XXX",
>> > >             "planetDate": 201106161601,
>> > >             "copyright": "XXX",
>> > >             "instance": "XXX"
>> > >         },
>> > >        "elements": [
>> > >            ...
>> > >            ...
>> > >            ...
>> > >        ]
>> > >    }
>> > >
>> > > where the `elements` array obviously contains all the elements.
>> > >
>> > > ## node
>> > >    {
>> > >        "type": "node",
>> > >        "id": 3596186,
>> > >        "lat": 53.4633699598014,
>> > >        "lon": -2.22667910006381,
>> > >        "timestamp": "2007-06-21T17:10:58+01:00",
>> > >        "version": 2,
>> > >        "changeset": 2213,
>> > >        "tags": [
>> > >            "amenity": "hospital",
>> > >            "name": "Manchester Royal Infirmary"
>> > >        ]
>> > >    }
>> > >
>> > > ## way
>> > >    {
>> > >         "type": "way",
>> > >         "id": 4958218,
>> > >         "version": 3,
&g

Re: [OSM-dev] openid

2011-06-18 Thread Matt Amos
On Sat, Jun 18, 2011 at 3:57 PM, Manuel Reimer
 wrote:
> Another question about openid: Is it possible to use it to, for example, to
> let an editor get access to OSM or is there still a "regular" OSM profile
> needed?

no, an OSM account is needed, although of course the user can choose
to sign up with OpenID if they want. the editor can then use OAuth to
access the OSM API.

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] JSON-output for xapi

2011-06-16 Thread Matt Amos
hi dario,

there's experimental support in cgimap for JSON format, which will
hopefully get enabled at 0.7 time (whenever that is). i've attached an
example of the output in both JSON and XML for comparison. it looks
extremely similar to the examples you've included below, so hopefully
we can work together on this. :-)

cheers,

matt

On Thu, Jun 16, 2011 at 5:04 PM, Dario Brandes  wrote:
> Hello everyone,
>
> we from the xappy.js Team want talk with you about a JSON Standard for
> osm. The GeoJSON Format is inapt to safe relations and so we created
> something new.
> Here the link to our wikipage:
> https://github.com/slomo/osm-spline-xapi/wiki/JSON-Output
>
> We would like to hear your comments about our proposal.
>
> Regards Dario
>
>
>
> # JSON output format definition by example.
>
> **Note:** for simple reading we included whitespaces which are of
> course omitted in the actual implementation.
>
> ## skeleton
>    {
>        "version": 0.6,
>        "generator": "xappy.js",
>        "xapi": {
>             "uri": "XXX",
>             "planetDate": 201106161601,
>             "copyright": "XXX",
>             "instance": "XXX"
>         },
>        "elements": [
>            ...
>            ...
>            ...
>        ]
>    }
>
> where the `elements` array obviously contains all the elements.
>
> ## node
>    {
>        "type": "node",
>        "id": 3596186,
>        "lat": 53.4633699598014,
>        "lon": -2.22667910006381,
>        "timestamp": "2007-06-21T17:10:58+01:00",
>        "version": 2,
>        "changeset": 2213,
>        "tags": [
>            "amenity": "hospital",
>            "name": "Manchester Royal Infirmary"
>        ]
>    }
>
> ## way
>    {
>         "type": "way",
>         "id": 4958218,
>         "version": 3,
>         "timestamp": "2007-07-25T01:55:35+01:00",
>         "changeset": 2211,
>         "nodes": [
>             218963,
>             331193
>         ],
>         "tags":[
>             "landuse": "residential",
>             "source": "landsat"
>         ]
>     }
>
> ## relation
>    {
>        "type": "relation",
>        "id": 2670,
>        "timestamp": "2007-10-25T03:05:34Z",
>        "version": 32,
>        "changeset": 2211,
>        "members": [
>            {
>                "type":"way",
>                "ref":3992472,
>                "role": ""
>            },
>            {
>                "type":"way",
>                "ref":3992524,
>                "role": ""
>            }
>        ...
>      ],
>        "tags":[
>            "name": "Fonnereau Way",
>            "network": "Ipswich foothpaths",
>            "type": "route"
>        ]
>    }
>
> ___
> dev mailing list
> dev@openstreetmap.org
> http://lists.openstreetmap.org/listinfo/dev
>
>


foo.osm.json
Description: application/json


 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
 
 
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
  
 
 
  
  
  
  
  
  
 

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] rails_port seg. fault

2011-05-21 Thread Matt Amos
On Tue, May 17, 2011 at 9:05 PM, Jaroslaw Wozny
 wrote:
> Hi,
>
>> interesting. i've just done a test with the latest git code (e120e59)
>> and got no errors. unfortunately, my setup is different from yours:
>
> I have also the latest (head version e120e59).
>
>>> 187 tests, 2413 assertions, 3 failures, 0 errors
>
>> what's your log/test.log? are you using the postgres extensions in
>> db/functions/?
>
> Ok. I have installed extra functions for PG and no fails on tests. But I
> have still problem with seg fault. Problem is repeated by two independent
> people on Ubuntu 11.04 (two different machines). Import is performed by
> bulk_upload.py. :(
>
> Maybe something is specific in bulk_upload.py that shows problem during
> import? I don't know.

yes, it appears to be. i was able to reproduce both the segfault you
reported and the long-looked-for infinite loop bug, both with
bulk_upload.py! unfortunately, the segfault seems to be happening
because the libxml data structure is being reclaimed before the ruby
object wrapping it is dead, and it's very difficult to debug stuff
across that ruby/libxml interface.

however, there is good news: i've updated rails_port to use
libxml-ruby 2.0.5 and it seems much more robust - i haven't been able
to get it to segfault. it'll be in git master soon, but if you'd like
to try it out then a patch is attached. i'd like to know how you get
on with it and whether it has any problems.

cheers,

matt
From 5472095287a651d9915805ab7bd3deadfe8d6c3d Mon Sep 17 00:00:00 2001
From: Matt Amos 
Date: Sat, 21 May 2011 15:15:49 +0100
Subject: [PATCH] Updated to libxml-ruby 2.0.5 and fixed code accordingly.

I've tested this through the unit tests and by hitting it with
bulk_upload.py and neither fail or cause the server to crash or
go into an infinite loop. Both of these things happened randomly
with 1.1.3/4 due to an apparent early deregistration of the
expanded nodes.
---
 config/environment.rb |2 +-
 lib/diff_reader.rb|   19 ++-
 2 files changed, 19 insertions(+), 2 deletions(-)

diff --git a/config/environment.rb b/config/environment.rb
index 58d9432..d5cee81 100644
--- a/config/environment.rb
+++ b/config/environment.rb
@@ -18,7 +18,7 @@ Rails::Initializer.run do |config|
   unless  STATUS == :database_offline
 config.gem 'composite_primary_keys', :version => '2.2.2'
   end
-  config.gem 'libxml-ruby', :version => '~> 1.1.1', :lib => 'libxml'
+  config.gem 'libxml-ruby', :version => '>= 2.0.5', :lib => 'libxml'
   config.gem 'rmagick', :lib => 'RMagick'
   config.gem 'oauth', :version => '>= 0.4.3'
   config.gem 'oauth-plugin', :version => '0.3.14'
diff --git a/lib/diff_reader.rb b/lib/diff_reader.rb
index 0f0492f..de2da3c 100644
--- a/lib/diff_reader.rb
+++ b/lib/diff_reader.rb
@@ -20,6 +20,10 @@ class DiffReader
   def initialize(data, changeset)
 @reader = XML::Reader.string(data)
 @changeset = changeset
+# document that's (re-)used to handle elements expanded out of the
+# diff processing stream.
+@doc = XML::Document.new
+@doc.root = XML::Node.new("osm")
   end
 
   ##
@@ -85,8 +89,21 @@ class DiffReader
   model = MODELS[model_name]
   raise OSM::APIBadUserInput.new("Unexpected element type #{model_name}, " +
  "expected node, way or relation.") if model.nil?
-  yield model, @reader.expand
+  # new in libxml-ruby >= 2, expand returns an element not associated 
+  # with a document. this means that there's no encoding parameter,
+  # which means basically nothing works.
+  expanded = @reader.expand
+
+  # create a new, empty document to hold this expanded node
+  new_node = @doc.import(expanded)
+  @doc.root << new_node
+
+  yield model, new_node
   @reader.next
+
+  # remove element from doc - it will be garbage collected and the
+  # rest of the document is re-used in the next iteration.
+  @doc.root.child.remove!
 end
   end
 
-- 
1.7.4.1

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] rails_port seg. fault

2011-05-16 Thread Matt Amos
On Mon, May 16, 2011 at 9:49 PM, Jaroslaw Wozny
 wrote:
>
>> libxml-ruby (1.1.4) should be 1.1.3
>
> Hi,
>
> Reinstalled and I have got similar problem.
>
> Error:
> 2011-05-16 22:39:13.659425 #13825]
> ERROROAuth::Signature::UnknownSignatureMethod
> [2011-05-16 22:39:13.661038 #13825] Filter chain halted as [:authorize]
> rendered_or_redirected.
> [2011-05-16 22:39:13.661322 #13825] Completed in 39ms (View: 1, DB: 1) | 401
> Unauthorized [http://127.0.0.1/api/0.6/changeset/1/upload]
> /home/jarekw/rubygems-1.3.7/rails/app/models/node.rb:82: [BUG] Segmentation
> fault
> ruby 1.8.7 (2010-08-16 patchlevel 302) [x86_64-linux]
>
> My list of gems:
> *** LOCAL GEMS ***
>
> actionmailer (2.3.8)
> actionpack (2.3.8)
> activerecord (2.3.8)
> activeresource (2.3.8)
> activesupport (2.3.8)
> httpclient (2.2.0.2)
> i18n (0.5.0)
> libxml-ruby (1.1.3)
> nokogiri (1.4.4)
> oauth (0.4.4)
> oauth-plugin (0.3.14)
> pg (0.11.0)
> rack (1.1.2)
> rails (2.3.8)
> rake (0.8.7)
> rmagick (2.13.1)
> sanitize (2.0.1)
> SystemTimer (1.2.3)
> timecop (0.3.5)

interesting. i've just done a test with the latest git code (e120e59)
and got no errors. unfortunately, my setup is different from yours:

$ ruby -v
ruby 1.8.7 (2011-02-18 patchlevel 334) [x86_64-linux]

$ gem list

*** LOCAL GEMS ***

actionmailer (2.3.8)
actionpack (2.3.8)
activerecord (2.3.8)
activeresource (2.3.8)
activesupport (2.3.8)
gash (0.1.3)
gemcutter (0.7.0)
grancher (0.1.5)
haml (2.2.24)
hanna (0.1.12)
httpclient (2.2.0.2)
i18n (0.5.0)
json (1.5.1)
libxml-ruby (2.0.3, 1.1.3)
nokogiri (1.4.4)
oauth (0.4.4)
oauth-plugin (0.3.14)
open4 (0.9.6)
pg (0.11.0)
rack (1.1.2)
rails (2.3.8)
rake (0.8.7)
rake-compiler (0.7.8)
rake-rubygems (0.2.0)
rake-tasks (0.2)
rdoc (2.3.0)
rmagick (2.13.1)
sanitize (2.0.1)
SystemTimer (1.2.3)
test-unit (2.3.0)
test-unit-ext (0.5.0)
timecop (0.3.5)
wirble (0.1.3)

and i'm (still) running ubuntu 10.10.

> Note:
> I executed tests and it shows 3 problems:
> 1) Failure:
> test_changes_simple(ApiControllerTest)
> [/test/functional/api_controller_test.rb:207]:
> Expected response to be a <:success>, but was <500>
> <"">
>
>  2) Failure:
> test_changes_zoom_valid(ApiControllerTest)
>   [/test/functional/api_controller_test.rb:231:in `test_changes_zoom_valid'
>    /test/functional/api_controller_test.rb:229:in `upto'
>    /test/functional/api_controller_test.rb:229:in
> `test_changes_zoom_valid']:
> Expected response to be a <:success>, but was <500>
> <"">
>
>  3) Failure:
> test_hours_valid(ApiControllerTest)
>   [/test/functional/api_controller_test.rb:260:in `test_hours_valid'
>    /test/functional/api_controller_test.rb:258:in `upto'
>    /test/functional/api_controller_test.rb:258:in `test_hours_valid']:
> Expected response to be a <:success>, but was <500>
> <"">
>
> 187 tests, 2413 assertions, 3 failures, 0 errors

what's your log/test.log? are you using the postgres extensions in
db/functions/?

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Filename extensions for OSM files

2011-05-11 Thread Matt Amos
On Wed, May 11, 2011 at 12:23 PM, Peter Körner  wrote:
> Am 11.05.2011 13:17, schrieb Matt Amos:
>>>
>>> We should probably introduce an attribute on the  tag that says that
>>> this is a history file. Just as we are currently discussing for the PBF
>>> version.
>>
>> while we're at it, let's have a flag to indicate if the elements are
>> sorted by ID, how many prime-numbered elements there are, whether
>> there was a full moon when it was generated and what the colour of the
>> file is. i'm up for all of that as long as it's blue.
>
> history files have similar syntax as normal osm files, but very different
> semantics. A program processing a history file needs to be aware or it
>  - having multiple objects with the same id
>  - having objects that stop being existent in some point in time
>  - that have an extra information attached
>
> that way it is useful for a program to know, if a file fed into it via stdin
> will be processed fine right from the start. nobody likes processes that
> fail after 10 hours because of a missing --this-is-an-history-file flag.

it'll be far less than 10 hours before the first element with
visible="false" or multiple versions comes through. in fact, i bet
it'd happen within the first 10 lines.

all OSM files are potentially history files (e.g:
http://osm.org/api/0.6/node/1/history) - they all have the visible and
version attributes on all elements, despite that not being needed for
dealing with a snapshot. some programs may make assumptions about the
contents of OSM files, but that's for the program in question to
assert, and for the user to be aware of.

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Filename extensions for OSM files

2011-05-11 Thread Matt Amos
On Wed, May 11, 2011 at 11:26 AM, Jochen Topf  wrote:
> On Wed, May 11, 2011 at 11:13:37AM +0200, Peter Körner wrote:
>> Am 10.05.2011 22:31, schrieb David Paleino:
>> >On Tue, 10 May 2011 20:58:40 +0200, Peter Körner wrote:
>> >
>> >>Frederik mentioned mime-types, I'd suggest (not knowing if I'd break any
>> >>convention using multiple + symbols):
>> >>
>> >>application/x-osm+xml for .osm
>> >>[..]
>> >
>> >I'd suggest we should also implement some magic(5) lines for file(1).
>>
>> Well, that would only work if you could differentiate between osm,
>> osh and osc files by looking at the first lines.
>>
>> osm-files have an -tag, osc-files start with > ../> but osh-files are, currently, only osm-files with a
>> visible-attribute on each entity - so no way to distinguish them
>> from osm-files.
>
> We should probably introduce an attribute on the  tag that says that
> this is a history file. Just as we are currently discussing for the PBF
> version.

while we're at it, let's have a flag to indicate if the elements are
sorted by ID, how many prime-numbered elements there are, whether
there was a full moon when it was generated and what the colour of the
file is. i'm up for all of that as long as it's blue.

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] how long does it take to generate the full-history dump?

2011-05-04 Thread Matt Amos
the last one took about 60 hours  - 2011-04-18 00:00 is the time
according to the file name, 2011-04-20 12:11 is the file timestamp.
that's not completely accurate, but is probably to within an hour or
two.

note that the full planet dump is not optimised for speed - nor would
we want it to be, as the more resources are devoted to the dump, the
fewer are available to editors - the availability of minutely / hourly
replication dumps makes it kinda pointless anyway.

cheers,

matt

On Tue, May 3, 2011 at 10:06 PM, Manchun Yao  wrote:
> Hi Peter,
>
> I am not using the tool. I just want to get a general idea about how long it 
> take to produce the full-history dump.
>
> -Manchun
> -Original Message-
> From: Peter Körner [mailto:osm-li...@mazdermind.de]
> Sent: Tuesday, May 03, 2011 1:27 PM
> To: Manchun Yao
> Cc: dev@openstreetmap.org
> Subject: Re: [OSM-dev] how long does it take to generate the full-history 
> dump?
>
> Am Dienstag, den 03.05.2011, 20:14 + schrieb Manchun Yao:
>> How many hours does it take to generate the full-history dump using
>> the historydump tool?
>
> Can't you look into the result-file using the tools tail or less? So you can 
> check how much of the dump has already been written.
>
> Peter
>
>
>
> ___
> dev mailing list
> dev@openstreetmap.org
> http://lists.openstreetmap.org/listinfo/dev
>

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Using osmosis to generate a full-planet.osm file

2011-04-10 Thread Matt Amos
On Thu, Apr 7, 2011 at 8:30 AM, Frederik Ramm  wrote:
> Scott,
>
> On Wed, 6 Apr 2011 22:06:57 -0500
>> I believe the osmosis database representation for a planet is not the
>> same as what is used by osm servers. OSM servers use a special dump
>> program for generating planets.
>
> I think Eric was talking about the history planets,
> not the normal planet files. Neither of these is created with Osmosis
> as far as I know but I'm not sure what software is used for the full
> history dumps.

the full history planets are created using lars francke's historydump [1] tool.

cheers,

matt

[1] https://bitbucket.org/lfrancke/historydump/overview

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


[OSM-dev] next London hack day

2011-04-04 Thread Matt Amos
thanks to everyone who came to the hack day last weekend in London [1]
- i had a lot of fun and i hope everyone else did too. :-)

i'd like to keep the ball rolling by asking if anyone's interested in
having another of these events in the summer, possibly around the end
of May or the start of June. i've set up a wiki page to collect
everyone's thoughts and availability [2] and hopefully we'll have the
same great mix of talks, development and social events.

happy hacking,

matt

[1] http://wiki.openstreetmap.org/wiki/London_Hack_weekend_April_2011
[2] http://wiki.openstreetmap.org/wiki/London_Hack_Weekend_Summer_2011

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] OpenID for OpenStreetMap?

2011-02-10 Thread Matt Amos
On Thu, Feb 10, 2011 at 8:09 PM, Serge Wroclawski  wrote:
> On Thu, Feb 10, 2011 at 2:24 PM, Matt Amos  wrote:
>
>> i might be missing the point here, but are you suggesting that OSM
>> would be an openID provider, consumer or both?
>
> I'm suggesting that instead of taking on a technology, we take on a
> set of requirements, or needs, and address them. And from there I'm
> saying that OpenID itself is a side issue.

ok - so what's the problem, or "requirement" in corporate-speak, that
we're trying to address here?

>>> The second issue is OpenID support in general.
>>>
>>> MHO on the issue is that while it might seem that OpenID support is
>>> great and widely useful, I think there are a number of implementation
>>> issues and concerns which are really hard to overcome.
>>>
>>> For example, how does one handle OpenID with an API? This has been
>>> discussed in other contexts, and unfortunately there's no good single
>>> answer.
>>
>> my understanding is that openID would be usable on the website to
>> generate an OAuth token, which would then be used to access the API.
>> this seems like a good enough solution to me, but i could be missing
>> something.
>
> That sounds like it makes non-web based editors hard to develop.

nope. the user has to log into the website to authorize an OAuth token
- what does it matter whether they log in via the usual username and
password or OpenID? certainly it doesn't matter to the editor, even if
it's doing some sort of ugly hack like JOSM's "fully automatic" mode.

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] OpenID for OpenStreetMap?

2011-02-10 Thread Matt Amos
On Thu, Feb 10, 2011 at 7:11 PM, Serge Wroclawski  wrote:
> I think the conversation is really two different questions.
>
> First is the original question about OpenID in the context of OSM and
> the other is about OpenID support generally.
>
> I think part of the feeling for a need for OpenID in OSM is about the
> fact that for several of our services, authentication isn't unified.
> That is the wiki credentials don't map to the OSM API credentials.
> help.osm does though. It's confusing.

i might be missing the point here, but are you suggesting that OSM
would be an openID provider, consumer or both?

i had thought that the discussion was limited to being an openID
consumer - i'm not sure we'd want to be a provider, with all that it
entails.

> The second issue is OpenID support in general.
>
> MHO on the issue is that while it might seem that OpenID support is
> great and widely useful, I think there are a number of implementation
> issues and concerns which are really hard to overcome.
>
> For example, how does one handle OpenID with an API? This has been
> discussed in other contexts, and unfortunately there's no good single
> answer.

my understanding is that openID would be usable on the website to
generate an OAuth token, which would then be used to access the API.
this seems like a good enough solution to me, but i could be missing
something.

> I personally believe OpenID is a great idea, but as Tom and others
> point out, the devil's in the details.
>
> OTOH the former issue, of a unified authentication mechanism- that's
> not technically hard (for the most part)- that's just a lot of work
> and a lot of disruption to the community while the transition takes
> place.

the devil's in the details and the SMOP. for instance, case
sensitivity of usernames, allowable characters, length and other
conditions. iirc, this wasn't a big deal when the wiki was set up, and
many people signed up with the same username. but since then things
have drifted a little bit (including the slightly inadvertent switch
to case-sensitive usernames for the main site) and it seems like way
more trouble than it's worth to transition now.

cheers,

matt

> That is, I'm sure the technical team knows how to solve it, but as
> someone whose completed a user authentication merge, I can tell you
> that it's painful. It's especially painful on something like a wiki,
> where not only must a user remapping take place, but that user mapping
> must be applied to each and every page.
>
> I think for new services (like help), we've solved the issue, but we
> may have to either accept the pain of the current situation for the
> wiki, or get buy in from the community for the disruption any change
> would cause.
>
> - Serge
>
> ___
> dev mailing list
> dev@openstreetmap.org
> http://lists.openstreetmap.org/listinfo/dev
>

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] scaling

2011-01-11 Thread Matt Amos
On Tue, Jan 11, 2011 at 1:28 PM, Kai Krueger  wrote:
> What this shift of pattern means for scaling is imho not entirely clear. On
> the one hand more and large imports would likely put more burden on the
> write paths of the API, as imports don't need to download data first to edit
> it. On the other hand with the growing density of map data, each map call is
> likely becoming much more complex than it used to be, shifting the burden
> back towards read loads.

however, those imports probably *should* be doing some sort of reading
in order to ensure they're not duplicating or stomping on existing
data. the fact that they don't generally at the moment is (probably)
not a reflection on the speed of the API ;-)

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] scaling

2011-01-10 Thread Matt Amos
On Mon, Jan 10, 2011 at 11:09 PM, Frederik Ramm  wrote:
> Brett Henderson wrote:
>>
>> That's not ideal. What's the implication of this? What OSM code uses
>> temp tables?
>>
>> Osmosis uses them for extraction of minutely diffs. Are there others?
>
> Matt's cgimap code does.

it's possible to rewrite it so that it doesn't use temp tables, but
that would be at the cost of more complex queries or higher memory
usage. i presume that postgres doesn't currently allow it because it's
technically non-trivial, but beyond that i have no idea what the level
of difficulty is...

i had a quick look on the postgres mailing lists and didn't spot
anything obvious about it - is it worth bringing it up there and
seeing what their take on it is?

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] scaling

2011-01-10 Thread Matt Amos
On Mon, Jan 10, 2011 at 1:31 PM, Andy Allan  wrote:
> On Mon, Jan 10, 2011 at 12:44 PM, Robert Scott  wrote:
>> On Monday 10 January 2011, Kai Krueger wrote:
>>> Depending on how far you really want to scale, I think a lot of the
>>> necessary components are already in place. If none of the "out of the box"
>>> solutions such as the new Postgresql 9.0 replication mechanism  work, we
>>> would possibly get a fair distance by splitting out reads and writes onto
>>> separate db servers.
>>
>> This is how the postgres 9 replication works. The replicating servers become 
>> "hot standbys" which you can use for read requests. So in theory the read 
>> requests could be scaled quite easily once set up. Atomicity of the API 
>> would potentially suffer though.
>
> We need to be careful about what purposes we scale read requests. A
> lot of the read-load on the API is using the data for non-editing
> purposes (such as rendering or other 'cool' things that need /map
> calls) and this should be avoided[1]. The advantage of making sure
> that read-scaling for non-editing purposes is *not* via db-replication
> is that anyone (and everyone) can add their own servers to scale
> things out. For example, everything that currently fetches live data
> via the diffs at planet.openstreetmap.org (or a service that depends
> on them, like xapi, geofabrik extracts etc) is a Good Thing, and
> everything that currently fetches live read-only data via
> osm.org/api/0.6/map is a Bad Thing.
>
> It's a bad thing since there's only so much hardware the OSMF can
> own/host/run, but there's nothing stopping 50 other organisations
> running cool stuff based off of planet.openstreetmap.org

it seems postgres 9 isn't able to do temporary table creation on
hot-standby servers [1]. maybe this is something that'll be fixed in
future versions (but it doesn't seem to be on the TODO [2]) or maybe
it's an opportunity to put a bounty to good use.

cheers,

matt

[1] 
http://www.postgresql.org/docs/9.0/interactive/hot-standby.html#HOT-STANDBY-USERS
[2] http://wiki.postgresql.org/wiki/Todo

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Node stuck in the database?

2010-12-24 Thread Matt Amos
On Thu, Dec 23, 2010 at 10:15 PM, Phil! Gold  wrote:
> I'm not sure whether this is the right venue, but I was directed here from
> IRC.
>
> I can't upload a changeset (from JOSM); the upload seems to hang on node
> 37521665.  I've had a few problems with this set of changes and I finally
> dialed JOSM down to sending one object at a time.  Every other object
> seems to be uploaded perfectly well, but when it hits that node (which I
> moved), it prints
>
>    PUT http://api.openstreetmap.org/api/0.6/node/37521665...
>
> on the console, displays "Uploading node: '37,521,665' (id: 37,521,665)"
> in the upload progress dialog box, and just sits there like that until I
> cancel the upload.  I'm using JOSM 3739, though the past couple testing
> builds have all had the same issue, as does the current tested release,
> 3701.
>
> Occasionally, I've had things stall on nodes 37521667 and 37719724.
>
> This seems like a database issue, but could it be something in my upload
> that's causing problems?

there were a few dead processes holding locks in the database, mainly
changeset closes for some reason. i've killed these and the locks have
been released. could you please try again and let me know if you're
still able to reproduce the stall on any of those nodes?

cheers,

matt

> --
> ...computer contrarian of the first order... / http://aperiodic.net/phil/
> PGP: 026A27F2  print: D200 5BDB FC4B B24A 9248  9F7A 4322 2D22 026A 27F2
> --- --
> 'Twas brillig, and the slithy toves
>        Did gyre and gimble in the wabe.
> All mimsy were the borogroves
>        And the mome raths outgrabe.
>                       -- Lewis Carroll, "Jabberwocky"
>  --- --
>
> ___
> dev mailing list
> dev@openstreetmap.org
> http://lists.openstreetmap.org/listinfo/dev
>

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Compression types in PBF Format

2010-11-30 Thread Matt Amos
On Tue, Nov 30, 2010 at 8:41 PM, Stefan de Konink  wrote:
> -BEGIN PGP SIGNED MESSAGE-
> Hash: SHA512
>
> Hi,
>
>
> Op 30-11-10 20:49, Frederik Ramm schreef:
>> Stefan de Konink wrote:
>>> This is the place for the 'too little, too late'. We are beyond the
>>> point of 'what' the bitstream should look like: you ought to handle
>>> what is defined now.
>>
>> This is not how we work in OSM. We don't have standards.
>
> For some reason we do, this is not a free form fight. And if we can
> change the API every week, I wonder why we are still at XML then.

because XML is a nearly human-readable, easy to explain and inspect
format. the same cannot be said of the PBF format, but then the
declared design goals of it were reduction in parsing time and file
size, not readability - and it achieves those goals superbly. however,
i think using it in the API wouldn't provide enough of a speedup,
limited by Amdahl's law, to offset the loss of those other benefits.

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] full history dump 2010-10-22

2010-10-28 Thread Matt Amos
On Mon, Oct 25, 2010 at 8:20 PM, Marco Lechner - FOSSGIS e.V.
 wrote:
> Hi Matt,
>
> thank you. Is it just a more recent one or are there any changes in the
> schema?

it's just a more recent one for convenience. it should be identical to
the intervening replication diffs applied to the previous full history
planet.

cheers,

matt

> Marco
>
> Am 25.10.2010 21:03, schrieb Matt Amos:
>> there's a new full history file available:
>>
>> http://planet.openstreetmap.org/full-experimental/full-planet-101022.osm.bz2
>>
>> cheers,
>>
>> matt
>>
>> ___
>> dev mailing list
>> dev@openstreetmap.org
>> http://lists.openstreetmap.org/listinfo/dev
>>
>
> ___
> dev mailing list
> dev@openstreetmap.org
> http://lists.openstreetmap.org/listinfo/dev
>

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


[OSM-dev] full history dump 2010-10-22

2010-10-25 Thread Matt Amos
there's a new full history file available:

http://planet.openstreetmap.org/full-experimental/full-planet-101022.osm.bz2

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Data source for robot

2010-10-12 Thread Matt Amos
On Tue, Oct 12, 2010 at 6:20 PM, Jan Sandbrink  wrote:
> sorry, but what i see happening on the list a lot of times is, that people
> who just want to contribute somehow are smashed into the ground.

i welcome mr budny's contribution - as a tool for users to identify
and fix problems with the map. since the beginning of this thread the
advice has simply been "don't run a bot". as you rightly point out,
we've had bad experiences with people running bots and we'd rather it
didn't happen again.

i encourage mr budny to do his project and release the results (and
hopefully even the source code, but some universities have issues with
that), but *not* to automatically run them against the live database.
putting them up on a site, or making them visible on a map, would
allow them to be integrated by individual mappers with local knowledge
and without damaging any existing data.

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Data source for robot

2010-10-12 Thread Matt Amos
On Tue, Oct 12, 2010 at 5:50 PM, Andrew  wrote:
> Matt Amos  gmail.com> writes:
>
>> of course, the best thing is that these automated edits never happen
>> at all, instead that tools are provided (like the geofabrik inspector,
>> keep-right or the duplicate nodes map) to help the community fix these
>> errors themselves. we need to start cracking down on these disruptive
>> and counter-productive bots.
>
> The duplicate nodes map is not the best example of this. People have sometimes
> jumped in and done heavy edits not based on local knowledge.

i totally agree, which is why the about page of the duplicate nodes
map says: "This means that each point marked on the map isn't
definitely an error, but it could be one" and the wiki page says "It
is important to note that, despite the design of the map, these are
not all necessarily errors, and should not all be 'fixed' by merging
nodes". no-one should be doing edits, especially on a large scale,
without a healthy dose of common sense and, preferably, local
knowledge.

> A bot that joined
> roads to other roads (and only to other roads) at county boundaries would
> actually be more constructive as we could then add a notice that this issue
> with TIGER roads has been fixed completely.

i don't think so - it might be more constructive to have a map which
only displayed duplicates between roads and other roads at county
boundaries so that US mappers could concentrate their efforts - in
fact, it's something i once ran and (time permitting) hope to develop
again. a bot, however, would always be less constructive than real
people looking at the data. see
http://matt.dev.openstreetmap.org/dupe_nodes/about.html#just_use_a_bot

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Data source for robot

2010-10-12 Thread Matt Amos
On Tue, Oct 12, 2010 at 3:48 PM, Peter Budny  wrote:
> Matt Amos  writes:
>
>> On Tue, Oct 12, 2010 at 11:56 AM, Serge Wroclawski  wrote:
>>> 3) The road to hell in OSM is paved with bot intentions.
>>>
>>> OSM has a long, negative history with bots. We have a very small
>>> number of good imports, and dozens (if not more) bad imports. Bad
>>> imports are so commonplace in OSM that within the OSM community, bots
>>> of any sort are discouraged, but especially any imports, and
>>> especially (as you appear to be proposing), merging existing data with
>>> imported data.
>>
>> +1
>>
>> bots and imports are also fantastic ways of washing away community
>> consensus for the ideas of a single person.
>
> http://wiki.openstreetmap.org/wiki/Interstate_Highway_relations
> http://wiki.openstreetmap.org/wiki/United_States_Numbered_Highway_Relations
> http://wiki.openstreetmap.org/wiki/Michigan/Highway_Relations
>
> Maybe I'm unclear what you mean by "consensus"?  Those look pretty
> agree-upon to me.

what i mean by "consensus" is "general agreement by a group of
people". but this doesn't mean everyone agrees and then bots go and
stomp over all the data. i mean the kind of consensus where each
individual mapper is editing in a way that makes sense to them and a
common system emerges. the very concept of a mass edit or bot is, by
definition, unilateral and not consensual.

secondly, just because something says "agreed" on the wiki doesn't
make it so. the wiki is for documentation and discussion of existing
tags. the real test of whether a scheme is "agreed" is whether it is
popularly-used - by many users, not by a small number of proponents
and bot-writers. have a look on tag[watch|info|stat] to see whether
these schemes are in use and encourage people to use them - provide
them tools and let them vote with their feet, rather than imposing a
unilateral "agreement" by bot.

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Data source for robot

2010-10-12 Thread Matt Amos
On Tue, Oct 12, 2010 at 11:56 AM, Serge Wroclawski  wrote:
> 3) The road to hell in OSM is paved with bot intentions.
>
> OSM has a long, negative history with bots. We have a very small
> number of good imports, and dozens (if not more) bad imports. Bad
> imports are so commonplace in OSM that within the OSM community, bots
> of any sort are discouraged, but especially any imports, and
> especially (as you appear to be proposing), merging existing data with
> imported data.

+1

bots and imports are also fantastic ways of washing away community
consensus for the ideas of a single person.

there has been, for a very long time, a code of conduct for such
automated or mechanical edits (not just bots and imports, but also
mass edits with the usual editors) [1]. it's interesting that almost
no-one has followed the code of conduct, including logging their
activity on the log [2].

of course, the best thing is that these automated edits never happen
at all, instead that tools are provided (like the geofabrik inspector,
keep-right or the duplicate nodes map) to help the community fix these
errors themselves. we need to start cracking down on these disruptive
and counter-productive bots.

cheers,

matt

[1] http://wiki.openstreetmap.org/wiki/Automated_Edits/Code_of_Conduct
[2] http://wiki.openstreetmap.org/wiki/Automated_Edits/Log

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Fail to get OsmChange for some changesets

2010-10-09 Thread Matt Amos
On Sat, Oct 9, 2010 at 1:04 AM, Manchun Yao  wrote:
>> Yes, it's perfectly possible. Individual changes to the data can occur 
>> without the changeset being closed. This is completely by design, although 
>> many people are confused into thinking that changesets are atomic or delayed 
>> until they are closed or similar. They aren't, and that's deliberate. As for 
>> how likely it is to happen without someone trying to do it deliberately? I 
>> can't answer that.
>
> I can imagine this now.:
>        Suppose we start with:  node a, version 7; node b version 7
>        Two persons operate on the these two nodes in change set A and change 
> B respectively with the following sequence:
>        - A changes node a first: node a version 7 -> 8
>        - B changes node b: node b version 7->8
>        - A changes node b: node b version 8->9
>        - B changes node a: node a version 8->9
>        - A change set closes
>        - B change set closes
> So we have:
>        Change A:  node a, version 7->8, node b version 8->9
>        Change B:  node a, version 8->9, node b version 7->8
> Although change A closes before change B, if we applied change A first 
> without considering change B at all, there will be problems.
>
> If this is the case, will the accumulative replication process (applying osc 
> changes daily/minutely) ran into problems?

not in my experience. i've been running a tool consuming minutely
replication osc files since feb and i haven't seen these problems.
this is because the replication osc files do not wait to contain whole
changesets - since they're not atomically committed it wouldn't make
sense. however, the osc files do contain the changes which are applied
to the history tables on the level of database transactions rather
than changesets.

cheers,

matt

> -Manchun
> -Original Message-
> From: Andy Allan [mailto:gravityst...@gmail.com]
> Sent: Thursday, October 07, 2010 3:14 AM
> To: Manchun Yao
> Cc: dev@openstreetmap.org
> Subject: Re: [OSM-dev] Fail to get OsmChange for some changesets
>
> On Wed, Oct 6, 2010 at 5:59 PM, Manchun Yao  wrote:
>> Hi Andy,
>>
>> If all change sets were created and operated  strictly based on API 0.6, 
>> would the scenario you described (two change sets affecting different 
>> objects in non-sequential orders) ever happen?  Is the optimistic locking 
>> mechanism meant to avoid such things?  The problem might exist in the 
>> database because of historical reasons though.
>
> Yes, it's perfectly possible. Individual changes to the data can occur 
> without the changeset being closed. This is completely by design, although 
> many people are confused into thinking that changesets are atomic or delayed 
> until they are closed or similar. They aren't, and that's deliberate. As for 
> how likely it is to happen without someone trying to do it deliberately? I 
> can't answer that.
>
>> I indeed want to build the history of all the changes so that I can find the 
>> map at some time (for example, Jan. 1, 2008) before.  The full-planet dump 
>> files at (http://planet.openstreetmap.org/full-experimental) contain 
>> historical info. However, the file is a bit too big for my computer.  I was 
>> trying to test whether I can build the history from ground up for a small 
>> area by getting the individual change sets one by one. Looks like there are 
>> some issues because some "changes" were made before API 0.6 and I have 
>> problem to access them using 0.6 API calls.
>
> You could do it by collecting all the data that those changesets refer to, 
> and then "replay" all the data changes in the order the individual changes 
> occurred (not in the order that the changesets  were either opened or closed, 
> which I've explained isn't necessarily the correct order).
>
> The wider point is that the history dumps are, almost entirely, currently 
> unusable for most people. If anyone has any suggestions for solving this 
> problem, or wants to try making a toolchain to process the history dumps into 
> history extracts, that would be great. There was discussion previously on 
> adapting osmosis to handle history dumps, but I don't know if that resulted 
> in any code.
>
>> Do you have any suggestions about how to get these problematic change sets?  
>> Maybe someone with a database populated with the full-planet dump can get 
>> these change sets for me (145015, 4318130, 4318132, 4318136).
>
> Perhaps this is supported by one of the read-only api services? I'm not sure 
> if any of them (XAPI, ROMA, OWL etc) have access to enough history though.
>
> Cheers,
> Andy
>
> ___
> dev mailing list
> dev@openstreetmap.org
> http://lists.openstreetmap.org/listinfo/dev
>

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] OAuth - 401 Authorization Required (OAuth::Unauthorized)

2010-09-27 Thread Matt Amos
On Mon, Sep 27, 2010 at 4:24 PM, Christoph Bünte
 wrote:
> Ok, it seems to me, that I'am the only one having this problem. That makes me 
> think, that either I'm the only one who recently tried to connect an oauth 
> consumer with a provider. Or it is just a ruby issue, like some weeks ago, 
> when a new commit broke this functionality. Or, just to be complete, it is 
> just me having this problem at all, cause my code is somewhat faulty.
>
> Could anyone try the example code with your app? Cause this issue keeps us 
> from starting our beta phase and publish wheelmap to a broader audience.
>
> As always: Any help is highly appreciated.

it looks like oauth now needs the verification token from the callback
/ printed on the website when a token is authorised for OAuth 1.0a
support [1].

i've updated the example at
http://wiki.openstreetmap.org/wiki/OAuth/Examples to reflect this and
it now works for me. hopefully it'll work for you too.

cheers,

matt

[1] 
http://git.openstreetmap.org/?p=rails.git;a=commit;h=1c3a9ee62b7d1a0dc97d52b1a498be1339d49ebf

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] [Marble-devel] NOTICE: gazetteer.osm.org being retired

2010-09-04 Thread Matt Amos
Kai just showed me your demo running on dev, which is great - sorry I missed
that. The data looks pretty up-to-date, is it being updated weekly or more
often? Is it world-wide?

Cheers,

Matt

On Sep 4, 2010 6:55 PM, "Matt Amos"  wrote:

On Fri, Sep 3, 2010 at 10:14 AM, Nic Roets  wrote: > On
Thu, Sep 2, 2010 at 5:20 P...
are you saying that kernel.org can be unfriendly because the amount of
effort put in by each contributor is higher? i don't think that's the
right way to look at it - kernel.org doesn't need to showcase the
capabilities of linux because there are many other providers and
distributions doing this. the people who are using linux, for example
on an android phone, don't need to know about kernel.org, and don't
need to care. the people who are using ubuntu don't need to know about
kernel.org, since ubuntu provides almost all of their needs. in my
opinion it can be the same with osm.org - people can be using the data
(for example, on bing, mapquest or cloudmade) without ever needing to
see the osm.org site. those who are interested enough to want to
contribute can do it directly through the provider, or be directed to
osm.org.

> So we need to make the environment as > friendly as possible.
i wouldn't disagree with you there, and providing routing on the main
page might make the experience slightly nicer, but it comes with a
cost to OSM in terms of developing, integrating and running the
service which is non-zero. anyone who's interested in getting a
service on the front page has to make it as friendly as possible, not
just to the users, but also to the admins.

my suggestion for getting anything integrated with the main site is to
set up a patched rails port* and your running service on dev.osm.org,
announce it and keep it up to date for a while. when people have been
using it, and it's clear that it's stable and what computational and
admin requirements it has, it has a much higher chance of being
integrated into the main page.

cheers,

matt

* it has to be a rails port and not a static page. either integrating
it into rails port is easy (in which case you can do it - it's easy)
or it's hard (in which case there's a big barrier to getting it
adopted, so you should do it to make it easier to integrate).
___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] [Marble-devel] NOTICE: gazetteer.osm.org being retired

2010-09-04 Thread Matt Amos
On Fri, Sep 3, 2010 at 10:14 AM, Nic Roets  wrote:
> On Thu, Sep 2, 2010 at 5:20 PM, Ævar Arnfjörð Bjarmason 
> wrote:
>>
>> > You'd want me to spend $12 a day to provide geolocalisation for an
>> > OSM editor (if you didn't read the thread, I remind you I'm speaking
>> > of Merkaartor)!!??
>>
>> What sort of crazy hosting costs $360 per month (asking Nic Roets)? I
>
> I said $12 a day for a server that can provide a decent routing service.
> (EC2 High Memory Instance).
>
> The kernel.org analogy is not really valid. They don't call themselves the
> "wiki kernel". When I hack a kernel for an embedded project running on
> specialized hardware, they certainly are interested in my changes.
>
> By contrast, we will accept users who come in and make a few changes,
> spending as little as 10 minutes.

are you saying that kernel.org can be unfriendly because the amount of
effort put in by each contributor is higher? i don't think that's the
right way to look at it - kernel.org doesn't need to showcase the
capabilities of linux because there are many other providers and
distributions doing this. the people who are using linux, for example
on an android phone, don't need to know about kernel.org, and don't
need to care. the people who are using ubuntu don't need to know about
kernel.org, since ubuntu provides almost all of their needs. in my
opinion it can be the same with osm.org - people can be using the data
(for example, on bing, mapquest or cloudmade) without ever needing to
see the osm.org site. those who are interested enough to want to
contribute can do it directly through the provider, or be directed to
osm.org.

> So we need to make the environment as
> friendly as possible.

i wouldn't disagree with you there, and providing routing on the main
page might make the experience slightly nicer, but it comes with a
cost to OSM in terms of developing, integrating and running the
service which is non-zero. anyone who's interested in getting a
service on the front page has to make it as friendly as possible, not
just to the users, but also to the admins.

my suggestion for getting anything integrated with the main site is to
set up a patched rails port* and your running service on dev.osm.org,
announce it and keep it up to date for a while. when people have been
using it, and it's clear that it's stable and what computational and
admin requirements it has, it has a much higher chance of being
integrated into the main page.

cheers,

matt

* it has to be a rails port and not a static page. either integrating
it into rails port is easy (in which case you can do it - it's easy)
or it's hard (in which case there's a big barrier to getting it
adopted, so you should do it to make it easier to integrate).

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] NOTICE: gazetteer.osm.org being retired

2010-08-30 Thread Matt Amos
On Mon, Aug 30, 2010 at 10:05 PM, Nic Roets  wrote:
> On Mon, Aug 30, 2010 at 10:05 PM, Matt Amos  wrote:
>>
>> > Is there a list available of APIs and services that OSM provides which
>> > are
>> > considered "enterprise ready"? In terms of  "enterprise ready" I'd
>> > expect a
>> > few commitments to be made, most importantly:
>>
>> i don't think any of the services OSM provides are "enterprise ready".
>> OSM runs these services as aids to mapping and to show what can be
>> done with the data. all of the software used to run OSM services is
>> free, so any enterprise needing those services can run them itself.
>
> Matt, my calculations show that running such services is an extremely cheap
> way of improving the OSM database. Serving a query can cost as little as
> 0.1 cent. So if there is just a one percent chance that the data
> underlying the query is wrong and if only 1 in 100 users actually decides to
> fix the data, then the cost to fix one mistake is still less than one cent.
> Compare that to the fuel cost of someone driving just one block to capture
> an unnamed street.

absolutely. but you're missing the opportunity cost of using that
hardware to run other services which might work better, or give new
abilities. note that no-one is saying that OSM should do without a
geocoding server - nominatim is that service and (to my knowledge)
there are no plans to retire it. it's simply that namefinder, as a
secondary, older, geocoder takes some hardware to run that could be
put to better purposes.

>> alternatively, a commercial service from one of the companies selling
>> OSM-based services might be willing to give an "enterprise" guarantee.
>
> Let me just mention to Torsten that you have conflict of interest here: Your
> employer is such a company.

not since november - the london development team was let go due to
lack of funding. for the last 9 months my employer is someone entirely
unrelated to OSM.

> Torsten, I think the option you would be more interested in would be to run
> the software on your own servers, or to collaborate with the providers of
> such services that use open source software.

which is pretty much what i was saying, thanks ;-)

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] NOTICE: gazetteer.osm.org being retired

2010-08-30 Thread Matt Amos
On Mon, Aug 30, 2010 at 4:33 PM, Torsten Rahn  wrote:
> Would there be a way of leaving nominatim support intact via the given API
> just with more limited capabilities (so that it still works with reduced
> quality and taking less hardware resources)?

nominatim will continue to work and if someone were to write a
forwarding service which talks namefinder protocol, it could provide a
stop-gap.

> Is there a low traffic OSM announcement list that provides announcements like
> these for people who rely on OpenStreetMap services?

there is an announce list, but for development announcements like this
one, this dev list is the best place.

> Is there a list available of APIs and services that OSM provides which are
> considered "enterprise ready"? In terms of  "enterprise ready" I'd expect a
> few commitments to be made, most importantly:

i don't think any of the services OSM provides are "enterprise ready".
OSM runs these services as aids to mapping and to show what can be
done with the data. all of the software used to run OSM services is
free, so any enterprise needing those services can run them itself.
alternatively, a commercial service from one of the companies selling
OSM-based services might be willing to give an "enterprise" guarantee.

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] preconditions_ok? in app/models/relation.rb and way.rb

2010-08-23 Thread Matt Amos
On Sun, Aug 22, 2010 at 5:24 PM, Mitja Kleider  wrote:
> Richard Fairhurst wrote:
>> I think you're misreading how the relation integrity checks work. The
> API
>> only checks whether an entity is still visible when the entity is
> _added_
>> to a relation.
>> For performance reasons, it does not check the entities
>> formerly in the relation.
>
> Thanks for clarifying. That is probably the cause.
>
>
>> It assumes that they're ok, which in the case of historic
>> (pre-integrity check) data may not be the case. See preconditions_ok? in
>> app/models/relation.rb in the Rails port for the code.
>
> When were the integrity checks introduced?

many of them were in-place before API 0.6 went live on 2009-04-21, but
0.6 (i think) included far more of them and in a more consistent
manner.

> Are you sure that checking only _added_ entities is enough? I read it like
> that: for performance reasons the checks are done when the entity is
> deleted (way node in the next case).

when a referenced entity is deleted it is checked for membership in
any way or relation and, if it's a member of another entity, the
deletion is disallowed. when a new entity is created, or elements
added to an existing entity, then those members are checked to ensure
they're not deleted. in a fully serialised model this should be enough
to prevent any referential integrity problems. but...

> Looking at the other example
> ("http://www.openstreetmap.org/api/0.6/way/60263972/1";), the deleted nodes
> contained in that way (e.g.
> "http://www.openstreetmap.org/api/0.6/node/338392109/5";) were deleted May
> 2010 (2010-05-26).
>
> In my understanding it still should not have happened, or am I missing
> something?

unfortunately, we don't have a fully serialised model and many
uploads, particularly from potlatch, execute in parallel. this leads
to the case above where the node is being deleted and the way created
at very similar times, but in different transactions. the timeline
probably looks like:

1. (transaction A) request is received for node deletion. checks in
software indicate that it isn't part of any ways or relations. the
node is deleted.
2. (transaction B) request is received for way creation. checks in
software indicate that the nodes are all not-deleted (since
transaction A hasn't committed yet). the way is created.
3. (transaction B) commits without problems, as the FK constraint is satisfied.
4. (transaction A) commits without problems.

since nodes are stored in the database with a "visible" flag to
indicate whether they're deleted or not, the FK constraint can't catch
this error. it's a similar story with relation members, as the current
schema can't be FK checked.

API 0.6 drastically improved things over 0.5 in its referential
integrity, but we didn't manage to catch all the problems. hopefully
we'll get them in 0.7. until then, when you find one of these it's
best just to fix it.

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] How to clean up Negative IDs

2010-08-03 Thread Matt Amos
On Tue, Aug 3, 2010 at 7:28 PM, Eric Wolf  wrote:
> I assume there is something in the API that says "negative IDs == BAD". I've
> been trying to test that theory but keep hitting stumbling blocks. Postgres
> doesn't seem to want to let me defer integrity constraints, so my efforts to
> change a few IDs to positive values keeps failing. Maybe I've lost my SQL
> chops (or maybe I just can't do that as the "openstreetmap" database user).
> Am I barking up the right tree? Should I just go ahead and destroy the
> database and repopulate it using bulk_upload.py instead of osmosis?

all IDs in the database should be positive - negative IDs are only
used on the client side to indicate elements which need to be created
by the server (so that they can be referenced by other elements in the
file). if you have negative IDs then you should use an uploader
against an API, rather than directly inserting the data into the
database with osmosis.

a further complication is that postgres allocates new IDs from a
"sequence", which is separate from the table and needs to be kept
up-to-date after any direct-to-database import (not sure if osmosis
does this?), otherwise postgres may attempt to allocate duplicate
primary key IDs...

uploading using bulk_upload.py is probably safer, but much slower,
unfortunately...

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] relations refering to deleted objects

2010-05-23 Thread Matt Amos
On Sun, May 23, 2010 at 6:28 PM, Frederik Ramm  wrote:
> Mitja Kleider wrote:
>> Shouldn't the API reject a changeset creating such inconsistencies?
>
> It should. I haven't looked at your list but be aware that (contrary to
> what some people think) the planet dump does NOT possess referential
> integrity.

this should no longer be the case. recent planets (i can't remember
exactly when it changed) should be dumped with full integrity, so any
examples of inconsistencies that you find in the planet should also be
in the database. if they're not that's a bug in the dump process i'd
like to know about, please. :-)

as for the inconsistencies in the database, some pre-date API 0.6 and,
although efforts were made to clean it up when we migrated we
obviously missed some. there were also some created since then,
presumably due to bugs in the API. i'd be very interested if anyone
can find a reproducible way of creating these errors (on api06.dev,
please). my hunch is that it's something to do with concurrent editing
and insufficient locking, but i've never been able to pin it down.

the ultimate solution is to have the database enforce this referential
integrity, but that's a big enough change it'll have to wait until
0.7. in any case, these problems are (should be?) much, much rarer
than they were under 0.5, and if you encounter one please fix it by
removing deleted nodes from the ways that reference them or deleted
members from relations that reference them.

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Planet OSM 'bad result during COPY'

2010-04-15 Thread Matt Amos
On Thu, Apr 15, 2010 at 8:01 PM, Seth Voltz
 wrote:
> My installation is CentOS 5.4 with the latest SVN (20912) osm2pgsql. 
> PostgreSQL 5.4.3 and postgis 1.3.6-1 were installed via Yum from the PGDG84 
> repository.

PostgreSQL 5.4.3? that seems awfully old. are you sure?

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Hourly/Daily Updates

2010-03-29 Thread Matt Amos
On Sat, Mar 27, 2010 at 9:31 PM, Simone Cortesi  wrote:
> 2010/3/27 Matt Amos :
>> i've cleaned up the wiki page describing the use of the diffs
>> http://wiki.openstreetmap.org/wiki/Planet.osm/diffs and please let me
>> know if you have any more questions so it can be further improved.
>
> What would be the best option for people wishing to keep minutely
> updated mapnik installation on their own server?

definitely the minute replicate diffs using the instructions on
http://wiki.openstreetmap.org/wiki/Minutely_Mapnik modified to use the
rrii and rri tasks rather than rcii and rci as described in Ldp's post
at the bottom of that page.

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] [Talk-GB] OSM shortlinks problem

2010-03-29 Thread Matt Amos
On Mon, Mar 29, 2010 at 11:34 PM, Gregory  wrote:
> On 29 March 2010 15:15, Matt Amos  wrote:
>>
>> http://trac.openstreetmap.org/changeset/16271
>>
>> yeah, it was originally =, but changed to - to work with twitter.
>> given that the characters A-Z, a-z, 0-9, _, @, - and = have already
>> been used, what's the next best character? ~? +?
>
> Well I suggested that you remove problematic symbols, and just use A-Z, a-z,
> 0-9 if need be.

we're already using those symbols in a way that makes it difficult to
reuse them for trailing characters without breaking existing links.
if anyone can find a character which isn't in the list above, and
doesn't break at least one URL-detector out there, then that's the
best way to fix this.

the alternative is to use some other scheme, for example appending
.1/.2 instead of -/-- which makes some zoom levels a char longer. or
possibly pre-pending the dashes, since they're usually picked up in
the middle of an URL - but this makes char deletion from the end work
slightly differently.

it's also worth noting that, in the current scheme, shortlinks can end
in @ or _ - do these also break the URL detectors?

> Hmm, sorry Matt I see you only sent to Dev so might have missed the
> discussion on Talk-GB.

gmail seems to be clever enough to bring it into the thread. but i
figured this was a development discussion, so better to keep it here.

> It was said (if we have powers over Google, Microsoft, and other big e-mail
> applications) we tell them to change...
> It's also allowed behaviour. '-' is a valid URI character, so any user
> agent that chops them off the end of a URI is broken. I realise that
> doesn't help the people using them, but it would be better to fix the
> problem in the correct place.

sure, and if someone can find a char which works in more clients than
-, it's an easy fix.

> Robert Scott referenced RFC4648 base32 and said we could change it to =
> perhaps (changing it back and breaking twitter again!).

using = still works, so if that's working better then i guess a quick
fix is to edit it manually...

> Steve Doerr wondered if replacing it with %2D would solve it. I'm not sure
> at what point it would be best to do this (in the link or manually).

it might do, but the idea of shortlinks is that they're short and, if
possible, easy to type on an ascii keyboard. but it seems like a
difficult brief to fill. ;-)

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] [Talk-GB] OSM shortlinks problem

2010-03-29 Thread Matt Amos
http://trac.openstreetmap.org/changeset/16271

yeah, it was originally =, but changed to - to work with twitter.
given that the characters A-Z, a-z, 0-9, _, @, - and = have already
been used, what's the next best character? ~? +?

cheers,

matt

On Mon, Mar 29, 2010 at 10:52 AM, Gregory  wrote:
> Forwarding to dev as requested.
> I think it has been noticed already, it can also break in applications such
> as twitter. But is anything going to be changed?
> I understand avoiding the ending character from being non-alphanumeric(not
> letters or numbers) would mean a serious rewrite of how the shortlinks are
> made/decoded. How much longer would short links be if all digits were
> alphanumeric only?
> Also the code is public so people can create these themselves or decode
> them, has anyone actually done that?
> On 29 March 2010 02:34, Tom Chance  wrote:
>>
>> One for devs really but I'm not on the list. Perhaps someone could pass
>> this on?
>>
>> I've noticed that the osm.org shortlinks ending with a dash "-" often
>> don't work in emails. The dash gets missed off from the link. In my area, at
>> least, this seems to move the centre slightly and switch from zoom level 17
>> to 18.
>>
>> For example this:
>> http://osm.org/go/euut_VcMR-
>>
>> Gets interpreted as this:
>> http://osm.org/go/euut_VcMR
>>
>> Is there any chance the shortlinks could avoid ending in a punctuation
>> mark?
>>
>> Regards,
>> Tom
>>
>> --
>> http://tom.acrewoods.net   http://twitter.com/tom_chance
>>
>> ___
>> Talk-GB mailing list
>> talk...@openstreetmap.org
>> http://lists.openstreetmap.org/listinfo/talk-gb
>>
>
>
>
> --
> Gregory
> o...@livingwithdragons.com
> http://www.livingwithdragons.com
>
> ___
> dev mailing list
> dev@openstreetmap.org
> http://lists.openstreetmap.org/listinfo/dev
>
>

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Hourly/Daily Updates

2010-03-27 Thread Matt Amos
2010/3/26 Andreas Höschler :
> Hi all,
>
>>>     I'll leave the daily changesets running for now because they are
>>>     running with a much longer delay and shouldn't miss data.
>
> I need to incorporate changes into our map server and am currentlöy
> building on
>
>        http://planet.openstreetmap.org/daily
>
> and plan down make a wget on
>
>        wget http://planet.openstreetmap.org/daily/20100325-20100326.osc.gz
>
> once a day and then import that changes into our database. Is this
> save?

yes. obviously assuming you change the date each day ;-)

> Will  http://planet.openstreetmap.org/daily be maintained in the
> future.

i don't think there are any plans to disable it, but keep an eye on
the lists - it'll be announced here if the plans change.

> Hourly files would be even better for us. Is this maintained,
> if so where. It seems to be disabled right now!?

there are hourly replication diffs at
http://planet.openstreetmap.org/hour-replicate which are recommended
for hourly use.

i've cleaned up the wiki page describing the use of the diffs
http://wiki.openstreetmap.org/wiki/Planet.osm/diffs and please let me
know if you have any more questions so it can be further improved.

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Inconsistency in database?

2010-03-04 Thread Matt Amos
Has anyone found any more of these? I had a look through the changesets, but
the only commonality I could find is they're all tagged as PL in live mode.
However 3 is a small sample size - it would be better to have more examples
to track down the bug. Or, even better, if anyone can find a repeatable
method for reproducing it.

Cheers,

Matt

On Mar 1, 2010 8:49 PM, "Frank Bielig"  wrote:

Hallo,

I found two more inconsistent ways:

45298547:
 http://api.openstreetmap.org/api/0.6/way/45298547
 http://api.openstreetmap.org/api/0.6/way/45298547/1

45770380:
 http://api.openstreetmap.org/api/0.6/way/45770380
 http://api.openstreetmap.org/api/0.6/way/45770380/4

44141527: (from previous mail)

http://api.openstreetmap.org/api/0.6/way/44141527

http://api.openstreetmap.org/api/0.6/way/44141527/2
The problem of inconsistency seems to affect more than one way. It might
be good idea to fix these special ways in database directly.

BR,
Frank

> Hallo, > > I just checked the way 44141527. Calling >
http://api.openstreetmap.org/api/0.6/way/4...
--
Frank Bielig   Tel:   +49 33398 687848
OneStepAhead AGFax:   +49 33398 687849
Research & Development Mobil: +49 177 3339868
Leuschnerstr. 45   eMail: frank.bie...@onestepahead.de
D-70176 Stuttgart  Web:   http://www.OneStepAhead.de

Firma: OneStepAhead AG
Firmensitz/Amtsgericht:Stuttgart, HRB 22686
Vorstand/Vorsitzender: Nihat Kücük
Aufsichtsratvorsitzender:  Prof. Dr. Reinhold von Schwerin

___ dev mailing list
dev@openstreetmap.org http://list...
___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] osm2pgsql deltas

2010-01-18 Thread Matt Amos
On Mon, Jan 18, 2010 at 12:05 PM, Steve Hill  wrote:
> On Mon, 18 Jan 2010, John Smith wrote:
>
>> You can stuff the new diff files straight into osm2pgsql, it's all I'm
>> doing, I coded my own shell script to keep track of where my system is
>> up to and to pull new ones when they are available.
>
> I think thats what I'm going to have to do.  I can't figure out how to get
> Osmosis to work - it just sits there doing nothing when I fire it up with
> "--rri --simc --wxc blah.osc".  Even with "-v 9" on the commandline all I
> get is it sitting there saying "FINE: Reading current server state." and
> doing nothing more. :(

this is very odd... for me, it completes in about 6s after downloading
one hour's worth of changes and merging them. (i didn't change
maxInterval in configuration.txt, and that's the default). if you're
running under linux it would be very helpful if could you run osmosis
via strace and let us know what syscall it's in when it's hanging.

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] osm2pgsql deltas

2010-01-16 Thread Matt Amos
On Sat, Jan 16, 2010 at 11:27 PM, Steve Hill  wrote:
> I've just realised that the hourly and minutely planet deltas have been
> replaced by the new transaction based formats, but I can't seem to find
> any documentation regarding their use with osm2pgsql.
>
> Can anyone provide any info on this?

for the things i'm using them for they seem to work fine with
osm2pgsql without any changes. if you're using osmosis to grab the
changes then you'll need to change the rcii/rci tasks to rrii/rri
tasks [1], though.

cheers,

matt

[1] http://wiki.openstreetmap.org/wiki/Osmosis/Detailed_Usage#Replication_Tasks

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Tagwatch Editor Counts

2010-01-14 Thread Matt Amos
more here http://wiki.openstreetmap.org/wiki/Editor_usage_stats

cheers,

matt

On Sun, Jan 10, 2010 at 12:05 PM, Matt Amos  wrote:
> On Wed, Dec 16, 2009 at 4:20 PM, Matt Amos  wrote:
>> On Wed, Dec 16, 2009 at 11:25 AM, Nick Black  wrote:
>>> Wondered if there had been any update on these numbers from the latest 
>>> planet?
>
> after a christmas hiatus, here's the numbers from the 2010/01/06 planet:
>
>         editor          |   num   | num_data | num_users
> -+-+--+---
>  Potlatch                | 2123705 |  1607511 |     60405
>  JOSM                    | 2369729 |  2268500 |     13978
>  Merkaartor              |  269419 |   254868 |      2305
>  Mapzen POI Collector    |    4975 |     4901 |       505
>  Mapzen Beta             |    4537 |     3092 |       477
>  BigTinCan Upload Script |     642 |      537 |       181
>  iLOE                    |    2551 |     2355 |       134
>  osm2go                  |    1806 |     1740 |        99
>  Osmose Raw Editor       |    1879 |     1013 |        96
>  bulk_upload             |  124159 |   114749 |        66
>  osmtools                |   20108 |    18405 |        64
>  Vespucci                |    1273 |      892 |        62
>  Mapzen Alpha            |     666 |      370 |        37
>  andnav                  |     432 |      394 |        35
>  QGIS OSM v              |     270 |      234 |        32
>  OpenSeaMap-Editor-      |     168 |      155 |        22
>  PythonOsmApi            |    1892 |     1522 |        18
>  upload                  |   71282 |    68044 |        14
>  KMLManager              |   34886 |    34626 |         9
>  GpsMid_                 |     246 |      200 |         7
>
> cheers,
>
> matt
>

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Tagwatch Editor Counts

2010-01-10 Thread Matt Amos
On Wed, Dec 16, 2009 at 4:20 PM, Matt Amos  wrote:
> On Wed, Dec 16, 2009 at 11:25 AM, Nick Black  wrote:
>> Wondered if there had been any update on these numbers from the latest 
>> planet?

after a christmas hiatus, here's the numbers from the 2010/01/06 planet:

 editor  |   num   | num_data | num_users
-+-+--+---
 Potlatch| 2123705 |  1607511 | 60405
 JOSM| 2369729 |  2268500 | 13978
 Merkaartor  |  269419 |   254868 |  2305
 Mapzen POI Collector|4975 | 4901 |   505
 Mapzen Beta |4537 | 3092 |   477
 BigTinCan Upload Script | 642 |  537 |   181
 iLOE|2551 | 2355 |   134
 osm2go  |1806 | 1740 |99
 Osmose Raw Editor   |1879 | 1013 |96
 bulk_upload |  124159 |   114749 |66
 osmtools|   20108 |18405 |64
 Vespucci|1273 |  892 |62
 Mapzen Alpha| 666 |  370 |37
 andnav  | 432 |  394 |35
 QGIS OSM v  | 270 |  234 |32
 OpenSeaMap-Editor-  | 168 |  155 |22
 PythonOsmApi|1892 | 1522 |18
 upload  |   71282 |68044 |14
 KMLManager  |   34886 |34626 | 9
 GpsMid_ | 246 |  200 | 7

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Querying changesets

2009-12-26 Thread Matt Amos
On Sat, Dec 26, 2009 at 8:48 PM, Karl Guggisberg
 wrote:
> Hmm, strange, now
>     http://api.openstreetmap.org/api/0.6/changesets?closed=true
> seems to hang (no response after 60s yet).
>      http://api.openstreetmap.org/api/0.6/changesets?open=true
> still works fine, though.
>
> Any hint what is going on? Is there maintenance work underway?

probably not, but there was a bug in the query for closed changesets
(which i've now fixed) which was causing postgres to scan the whole
changesets table, which would take a while. should be fixed after
r19215 is deployed, though.

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] X_HTTP_METHOD_OVERRIDE

2009-12-18 Thread Matt Amos
On Fri, Dec 18, 2009 at 9:16 PM, hy-soft  wrote:
> Matt Amos wrote:
>> On Fri, Dec 18, 2009 at 2:27 PM, hy-soft  wrote:
>
>>> I can not get it to work, neither with curl
>>> which hasn't implemented a delete Method, so I tried
>>> X_HTTP_METHOD_OVERRIDE: DELETE
>>>
>>> curl -v  -G -H "X_HTTP_METHOD_OVERRIDE: DELETE" -T DOSM0134.XML
>>> "http://api.openstreetmap.org/api/0.6/node/595647268";
>>
>> that's not the command listed on the wiki page. the command listed
>> there (updated for delete) is:
>> curl -v -v -u username:password -d @node.osm -H
>> "X_HTTP_METHOD_OVERRIDE: DELETE"
>> "http://api06.dev.openstreetmap.org/api/0.6/node/1168959";
>
> Well I omitted user&password in the example above
> The command I sent did actually trigger an update of the node.
>
> Now when I use a post [option -d] as given in your example it works -
> with curl.
>
> But using POST with indy-sockets returns an Error 405: method not allowed.
>
> So thanks for clearing up the matter regarding curl, but actually I
> still can not use the api via indy-sockets.
>
> The header is being sent - but obviously not recognized then.

if the header is being sent then i don't see why that would be any
different from what curl is doing. do you have a trace of the
conversation between your app and the server to help debug what's
going wrong?

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] X_HTTP_METHOD_OVERRIDE

2009-12-18 Thread Matt Amos
On Fri, Dec 18, 2009 at 2:27 PM, hy-soft  wrote:
> Has anybody tried this workaround as descibed in the api page of the
> wiki: http://wiki.openstreetmap.org/index.php/OSM_Protocol_Version_0.6
> ?

it works fine for me. see
http://api06.dev.openstreetmap.org/browse/node/1168959/history which
is a node i created with JOSM and deleted using curl. :-)

> I can not get it to work, neither with curl
> which hasn't implemented a delete Method, so I tried
> X_HTTP_METHOD_OVERRIDE: DELETE
>
> curl -v  -G -H "X_HTTP_METHOD_OVERRIDE: DELETE" -T DOSM0134.XML
> "http://api.openstreetmap.org/api/0.6/node/595647268";

that's not the command listed on the wiki page. the command listed
there (updated for delete) is:
curl -v -v -u username:password -d @node.osm -H
"X_HTTP_METHOD_OVERRIDE: DELETE"
"http://api06.dev.openstreetmap.org/api/0.6/node/1168959";

where node.osm contains the node xml downloaded from
http://api06.dev.openstreetmap.org/api/0.6/node/1168959 modified to
reference the correct changeset, which i also opened with curl.

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Tagwatch Editor Counts

2009-12-16 Thread Matt Amos
for JOSM:
 lang  |  num   | num_users
---++---
 de| 491078 |  6457
 en| 240905 |  3086
   |  76081 |  2704
 fr|  91720 |   919
 en_GB |  55408 |   852
 ru|  45467 |   527
 it|  34936 |   357
 es|  21888 |   248
 nl|  10850 |   177
 fi|  18956 |   159
 sv|   8482 |   149
 cs|  12716 |   144
 pl|   7006 |   126
 ja|   9313 |66
 da|   3892 |63
 sk|   3120 |42
 nb|   1047 |35
 pt|583 |23
 ro|994 |23
 bg|   1762 |13

for potlatch:
 lang  |  num   | num_users
---++---
   | 800309 | 46679
 en| 109151 |  8023
 de|  63330 |  4901
 fr|  11479 |   981
 ru|  10075 |   732
 es|   5274 |   549
 it|   7374 |   523
 nl|   6520 |   347
 pl|   1733 |   226
 sv|   3447 |   155
 pt-BR |989 |   152
 fi|   1410 |   105
 no|   1763 |84
 cs|505 |82
 ja|   1242 |80
 da|   2261 |77
 hu|382 |67
 ro|   1041 |59
 pt|163 |29
 tr| 88 |26

not sure what the null language results are - presumably at some point
the editors weren't putting a language in their changeset comments or
something?

cheers,

matt

On Fri, Dec 4, 2009 at 11:45 PM, Ævar Arnfjörð Bjarmason
 wrote:
> On Fri, Dec 4, 2009 at 20:09, Matt Amos  wrote:
>> as a massive self-plug, here's a more in-depth look at some of the
>> editor data. unfortunately, there isn't enough data to do this for any
>> editor other than the "big three", but hopefully in six months time...
>
> This looks nice. It's certainly a lot better than my ad-hoc statistics
> I posted in september:
>
>    http://lists.openstreetmap.org/pipermail/talk/2009-September/042902.html
>
> One thing that I missed about your stats though is language statistics
> like the ones I posted (but done better with fancy graphs, obviousl
> :).
>
> Since I posted my stats I've added information about the user language
> to Potlatch (previously only JOSM had them), but unfortunately it
> looks like the Merkaartor people have ignored my request of adding it
> to their created_by string.
>
> It would be really cool to see statistics per-editor and per-language
> presented in such a way that one could gauge whether an editor being
> localized had an effect on its update.
>
> I recently found out for example that the Potlatch translation into
> Italian was really incomplete while JOSM was almost 100% translated
> into Italian (and JOSM has like 4000 strings while Potlatch has around
> 300). I contacted some Italian translators about this and Potlatch now
> has a much better Italian translation.
>
> Perhaps some Italians where shunning Potlatch because of this, it
> would be interesting to see stats to confirm or disprove that.
>

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Tagwatch Editor Counts

2009-12-16 Thread Matt Amos
On Wed, Dec 16, 2009 at 11:25 AM, Nick Black  wrote:
> Wondered if there had been any update on these numbers from the latest planet?

these are the numbers from today's planet:

 editor  |   num   | num_data | num_users
-+-+--+---
 Potlatch| 1029797 |   775495 | 57637
 JOSM| 1139962 |  1090173 | 13282
 Merkaartor  |  129510 |   122438 |  2160
 Mapzen POI Collector|2124 | 2088 |   389
 Mapzen Beta |1261 |  821 |   271
 BigTinCan Upload Script | 293 |  242 |   143
 iLOE|1202 | 1106 |   117
 osm2go  | 903 |  870 |99
 Osmose Raw Editor   | 813 |  399 |85
 bulk_upload |   61908 |57209 |64
 osmtools|8951 | 8614 |59
 Vespucci| 614 |  426 |52
 Mapzen Alpha| 333 |  185 |37
 andnav  | 214 |  195 |33
 QGIS OSM v  | 128 |  110 |28
 PythonOsmApi| 946 |  761 |18
 upload  |   35585 |33968 |14
 OpenSeaMap-Editor-  |  66 |   60 |11
 KMLManager  |   17443 |17313 | 9
 GpsMid_ | 123 |  100 | 7

impressive gains for mapzen POI collector, up 150 users over two
weeks. potlatch is up 1,696, but that's only +2.9% whereas mapzen's
gain is +38.6%. the most impressive fractional gain is mapzen beta, by
+76%.

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Structured error messages from API

2009-12-11 Thread Matt Amos
On Fri, Dec 11, 2009 at 6:24 PM, Ian Dees  wrote:
> Actually, I was just now creating a stub page for API 0.7 brainstorming:
>
>  http://wiki.openstreetmap.org/wiki/API_v0.7
>
> Remember, it's a brainstorm: all ideas are good ideas at this point... ;-).

cool! although i'd consider "verified users / locked tags" to be an
anti-feature ;-)

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Structured error messages from API

2009-12-11 Thread Matt Amos
On Fri, Dec 11, 2009 at 5:05 PM, Matthias Julius  wrote:
> Richard Fairhurst  writes:
>
>> Tom Hughes wrote:
>>> Basically, even if sending a streaming response from rails was
>>> possible (which I'm not sure it is - it's certainly hard)
>>
>> You can do it using render :text=>proc, as amf_controller does. But I
>> suspect this would be non-trivial to work into the existing XML API.
>
> The API could also return an URL which the client can poll to get the
> status for an upload.
>
> This would also avoid issues with client timeouts.
>
> Currently, when the client hits a timeout and aborts the connection it
> has no way of knowing whether the upload succeeded or not.  For new
> data this creates duplicates when the user tries to upload again.

i have been thinking about something similar, which would work along
the following lines. the changeset is a container and POSTing a diff
to it returns a 202 Accepted result if diff processing takes more than
a few seconds. the result is a Location redirect to the individual
resource for the diff upload, which could be polled for status.

unfortunately, that's where it gets complicated. because the diff
upload occurs in a transaction, none of its outputs are visible until
it commits. therefore any status would need to be posted on a
different connection, in a different transaction. this makes things
annoying a messy. if anyone's got any ideas then i'd be very
interested to talk.

this is something that we can put into API 0.7 - which it might be a
good idea to start thinking about now, since the last API took about a
year to develop ;-)

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


[OSM-dev] full history planet (experimental)

2009-12-09 Thread Matt Amos
hi devs,

i've just finished running Lars' full history planet dumper and the
results are here:
http://planet.openstreetmap.org/full-experimental/full-planet-091207.osm.bz2
(13Gb)

although a skim through the file shows nothing obviously wrong, this
should be considered experimental. please report any problems you find
to us and we'll try and figure it out. also, the dump should have
referential integrity (a way might reference a deleted node, but
shouldn't reference a non-existent node) so please report if that
isn't the case.

it seems to take about 2-3 days to dump, although much of that is
dependent on pbzip2 performance, and causes enough of a load on the DB
to delay the daily dumps by an hour or two. so i don't think this is
something that's worth running every week, but maybe every month or
two.

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Tagwatch Editor Counts

2009-12-04 Thread Matt Amos
On Fri, Dec 4, 2009 at 6:49 PM, Shaun McDonald
 wrote:
>
> On 4 Dec 2009, at 17:50, John Smith wrote:
>
>> 2009/12/5 Nick Black :
>>> Thanks for the update Matt.  As this is changeset based, this means
>>> that Mapzen POI Collector is the fourth most used OSM editor ever,
>>> since changesets were introduced?
>>
>> The order is by numbers of users, not number of uses.
>
> +1

yeah, i think it's fair to say that Mapzen POI collector is the fourth
most popular OSM editor over the last 6 months.

as a massive self-plug, here's a more in-depth look at some of the
editor data. unfortunately, there isn't enough data to do this for any
editor other than the "big three", but hopefully in six months time...

http://www.asklater.com/matt/wordpress/2009/12/editor-retention/

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Tagwatch Editor Counts

2009-12-03 Thread Matt Amos
here's the results for wednesday's (2nd) planet dump, again over the
history since each editor started adding changeset tags. Mapzen POI
collector has gained over 200 users in the past week, adding 636
changesets. well done to the Mapzen team!

 editor  |   num   | num_data | num_users
-+-+--+---
 Potlatch|  988916 |   739623 | 55941
 JOSM| 1092702 |  1043776 | 12945
 Merkaartor  |  124756 |   117812 |  2100
 Mapzen POI Collector|1374 | 1341 |   239
 BigTinCan Upload Script | 266 |  219 |   124
 iLOE|1113 | 1019 |   105
 osm2go  | 901 |  868 |99
 Osmose Raw Editor   | 756 |  358 |82
 Mapzen Beta | 218 |  155 |65
 bulk_upload |   60110 |55417 |61
 osmtools|8326 | 8021 |57
 Vespucci| 558 |  374 |49
 Mapzen Alpha| 333 |  185 |37
 andnav  | 207 |  191 |31
 QGIS OSM v  | 117 |  100 |24
 PythonOsmApi| 937 |  753 |18
 upload  |   35560 |33959 |14
 KMLManager  |   17443 |17313 | 9
 GpsMid_ | 121 |   98 | 7
 OpenSeaMap-Editor-  |  41 |   35 | 6

cheers,

matt

On Sun, Nov 29, 2009 at 4:21 PM, Matt Amos  wrote:
> On Sunday, November 29, 2009, Simone Cortesi  wrote:
>> On Sun, Nov 29, 2009 at 15:46, Nick Black  wrote:
>>> Interesting data.  If there are 190,000 OSM contributors and around
>>> 10% of them are active, how come there are 50,000 users using Potlatch
>>> on one day?
>>
>> probably that is the total changesets "produced" with the given
>> editor. not on a single day. 25th is the date of the snapshot.
>
> That's right, the numbers are from last week's changeset dump. So it
> only counts changesets created between then and whenever the editor
> started writing created_by tags on changesets. About six months or so.
>
> Since mapzen wasn't out on the 25th those numbers are low, but I
> expect they'll pick up on the 2nd.
>
> Cheers,
>
> Matt
>

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Distance Grid

2009-12-01 Thread Matt Amos
On Tue, Dec 1, 2009 at 7:03 PM, Ævar Arnfjörð Bjarmason
 wrote:
> On Tue, Dec 1, 2009 at 17:06, Matt Amos  wrote:
>> On Tue, Dec 1, 2009 at 4:04 PM, Vitor George  wrote:
>>> I want to start a project in Brazil that is similar to the Tiger Fixup 250
>>> Cities [1], and I looking forward to build a script that generates a
>>> distance grid, like this:
>>>
>>> http://matt.sandbox.cloudmade.com/usa-routes.html
>>>
>>> I've tried to contact the authors via wiki, but I was unsuccessful, so I
>>> want to develop it by myself. I'm not a professional programmer, but I have
>>> some experience in java. Is the LibOSM the easiest way tool to implement it?
>>
>> sorry you missed us on the wiki. i have to say, i don't really check
>> the discussion pages like a good wikian should :-(
>>
>> the script is very simple, and uses cloudmade's ruby API [1] to get
>> the routes. you'll need to sign up for a cloudmade API key, if you
>> haven't already. the script i've attached will output CSV on stdout,
>> but i usually put them in a database for easy access.
>>
>> run it like "ruby routes.rb ". i've attached an example
>> configuration file, which is just a yaml map of string to lat/lon
>> array.
>>
>> i look forward to seeing brazil's 250 cities :-)
>
> I made a hacky script to generate .yml from XAPI output (attached):
>
> wget 
> 'http://www.informationfreeway.org/api/0.6/node[place=city|town|village|hamlet][bbox=-25.74085,62.84553,-12.41708,67.50085]'
> -O iceland-places.osm
> perl -CI place2yaml.pl < place.osm  > place.yml
>
> But I just get internal server errors from the CM API:
>
> $ ruby routes.rb place.yml
> [Tue Dec 01 18:59:57 + 2009] HTTP error: Couldn't read data. HTTP
> status: #, retrying...

interesting. if you could send me a trace of what's going on that
would be very helpful.

> Anyway do you have a script to generate that cute HTML matrix? And how
> can I link to a route on CM's website. I have to supply lat/lon/zoom
> it seems and not just starting/ending lat/lon (at least the URLs I've
> seen are all like that).

i've just thrown something together here - might be buggy, etc...
you'll need to play with the factor variable to see what looks right
for your area.

cheers,

matt


to_html.rb
Description: application/ruby
___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Distance Grid

2009-12-01 Thread Matt Amos
On Tue, Dec 1, 2009 at 4:04 PM, Vitor George  wrote:
> I want to start a project in Brazil that is similar to the Tiger Fixup 250
> Cities [1], and I looking forward to build a script that generates a
> distance grid, like this:
>
> http://matt.sandbox.cloudmade.com/usa-routes.html
>
> I've tried to contact the authors via wiki, but I was unsuccessful, so I
> want to develop it by myself. I'm not a professional programmer, but I have
> some experience in java. Is the LibOSM the easiest way tool to implement it?

sorry you missed us on the wiki. i have to say, i don't really check
the discussion pages like a good wikian should :-(

the script is very simple, and uses cloudmade's ruby API [1] to get
the routes. you'll need to sign up for a cloudmade API key, if you
haven't already. the script i've attached will output CSV on stdout,
but i usually put them in a database for easy access.

run it like "ruby routes.rb ". i've attached an example
configuration file, which is just a yaml map of string to lat/lon
array.

i look forward to seeing brazil's 250 cities :-)

cheers,

matt

[1] http://developers.cloudmade.com/projects/show/ruby-lib


routes.rb
Description: application/ruby


usa_extract.yml
Description: Binary data
___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Tagwatch Editor Counts

2009-11-29 Thread Matt Amos
On Sunday, November 29, 2009, Simone Cortesi  wrote:
> On Sun, Nov 29, 2009 at 15:46, Nick Black  wrote:
>> Interesting data.  If there are 190,000 OSM contributors and around
>> 10% of them are active, how come there are 50,000 users using Potlatch
>> on one day?
>
> probably that is the total changesets "produced" with the given
> editor. not on a single day. 25th is the date of the snapshot.

That's right, the numbers are from last week's changeset dump. So it
only counts changesets created between then and whenever the editor
started writing created_by tags on changesets. About six months or so.

Since mapzen wasn't out on the 25th those numbers are low, but I
expect they'll pick up on the 2nd.

Cheers,

Matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Tagwatch Editor Counts

2009-11-28 Thread Matt Amos
in case anyone is interested, here's the data from the 25th. num is
the number of changesets, num_data is those with bboxes, num_users is
the number of distinct (public) user ids. of course, this was before
mapzen poi collector was released in the app store, so it's not very
helpful for that ;-)

                   creator                   |   num   | num_data | num_users
--+-+--+---
Potlatch                                     |  964387 |   718768 |     54823
JOSM                                         | 1064183 |  1015659 |     12741
Merkaartor                                   |  121672 |   114819 |      2057
BigTinCan Upload Script                      |     245 |      201 |       109
osm2go                                       |     898 |      865 |        98
iLOE                                         |     940 |      850 |        84
Osmose Raw Editor                            |     714 |      330 |        78
bulk_upload                                  |   59740 |    55051 |        58
osmtools                                     |    7914 |     7622 |        56
Vespucci                                     |     542 |      359 |        48
Mapzen Alpha                                 |     310 |      164 |        37
andnav                                       |     207 |      191 |        31
Mapzen POI Collector                         |     738 |      709 |        27
QGIS OSM v                                   |     114 |       98 |        23
PythonOsmApi                                 |     935 |      753 |        18
upload                                       |   35026 |    33431 |        14
KMLManager                                   |   17443 |    17313 |         9

cheers,

matt

On Sat, Nov 28, 2009 at 6:37 PM, Shaun McDonald
 wrote:
>
> On 28 Nov 2009, at 13:00, Nick Black wrote:
>
>> Hi Guys,
>>
>> A few people have been asked if I have any stats on Mapzen POI
>> Collector usage.  I took a look at the tagwatch editor page [1] but
>> neither Mapzen Alpha or Mapzen POI Collector in any of its versions
>> appears in the list.  Both apps tag the change set [2] rather than the
>> node or way that is created.  I'm wondering if this is why there are
>> no stats on the editor's usage in Tagwatch?  If this is the case, how
>> can I help update Tagwatch to look at change sets?
>>
>> [1] http://tagwatch.stoecker.eu/Europe/En/top_used_editors.html
>> [2] http://www.openstreetmap.org/browse/changeset/2996362
>>
>>
>
> It looks like that up until the 26 November there have been less than 783 
> changesets with the exact editor name that mapzen is using.
>
> You can use the changeset info dump to do this analysis. It is available from 
> http://planet.openstreetmap.org/ on a weekly basis.
>
> Shaun
>
>
> ___
> dev mailing list
> dev@openstreetmap.org
> http://lists.openstreetmap.org/listinfo/dev
>

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Complete history of OSM data - questions and discussion

2009-11-12 Thread Matt Amos
On Thu, Nov 12, 2009 at 4:28 PM, Lars Francke  wrote:
> - As of now the XML is not indented. I use Woodstox[1] for XML output
> and that doesn't have an option to "pretty print" the output. It is
> not a problem for me but if it is requested I can use StaxMate or
> something else to properly indent the XML

i'm pretty sure no-one minds. as you say, anyone who wants it indented
can easily do it with xmllint and friends.

> - Changesets: num_changes from the database isn't dumped in planet.c.
> It is queried from the database but not used anywhere. The data _can_
> be calculated but it isn't that easy if not using the standard db
> schema and not easily done by reading the XML stream. I could just
> dump it too. I haven't had a look at the API if this field is set
> correctly at all?!

it should be set correctly. you're welcome to dump it out on the
changesets if you think it's useful.

> - I'm using the same technique as planet.c in regards to the output of
> the data (just streaming it to standard output), I just assume that
> this is okay? Are there any other things I'll have to change in
> comparison to the way planet.c works?

yeah. the output will be piped directly into pbzip2, most probably.

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Complete history of OSM data - questions and discussion

2009-11-11 Thread Matt Amos
On Wed, Nov 11, 2009 at 1:29 PM, Lars Francke  wrote:
> I had
> brief discussions with Brett about Osmosis and incorporating certain
> changes into it so I've spent quite some time in its source code.
> Having said that: I probably won't program this as a new task for
> Osmosis but as a standalone program as this probably won't be used
> widely and doesn't justify the extra work required to incorporate this
> into Osmosis.

just remember that new code = new bugs ;-)

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Complete history of OSM data - questions and discussion

2009-11-11 Thread Matt Amos
On Wed, Nov 11, 2009 at 6:41 AM, Lars Francke  wrote:
> There are a few questions that probably need answering first and I
> hope we can start a discussion about this:
> - Am I correct in assuming that there are no general objections from
> the OSM server folks against such a dump? (Which would render the rest
> of this E-Mail useless ;-)

the response has always been "if someone writes it, and it's good,
we'll run it" :-)

> - Is anyone else currently working on this?

for some values of "working", yes. it's on my list of things to do for
the license change plan - clearly we'll need a full data dump before
we can re-license.

> - Which format should the data be dumped in

(3) is the easiest to get done and most easily supported, in my opinion.

> - Distribution of the data and storage space requirements

i have a feeling that the data, while big, won't be so big that the
usual method of planet.osm.org + heanet mirror won't work.

> - Interval of dumps

based on back-of-the envelope calculations, a full dump in planet
format would take something like 7-10 days to do in parallel with
normal server activity. so it couldn't be run every week and would
probably be cumbersome to do every month. in my opinion, we should be
looking at every 3-6 months.

> 3) A dump of all OSM elements in OSM format
> (http://www.openstreetmap.org/api/0.6/node/60078445/history)

this is my favourite method as well. the easiest approach would be to
modify planet.c to dump the full history, instead of just the
current_* tables.

note that brett has been working on option (2) by using osmosis to
dump very historical diffs going back to the inception of the
database. you can see the experimental results in
http://planet.openstreetmap.org/history/

for my money, if we do both (2) and (3), then we cater for all
consumers, and in a standard format. the output of the COPY command,
while good for backups, isn't really suited to dumping the information
that we have in the planet (given there will be edits by users who are
still not public, etc...)

if you want to get started hacking on planet.c then i'm happy to help.
otherwise i'm hoping to get around to it by the end of the month, but
there are never any guarantees ;-)

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Missing changes in minute-replicate

2009-11-09 Thread Matt Amos
On 11/9/09, Jon Burgess  wrote:
> On Mon, 2009-11-09 at 21:17 +0100, Peter Körner wrote:
>> >
>> > The changes in changeset 3073502 are completely missing from the
>> > minute-replicate diffs. They appear in both the minute and hourly
>> diffs.
>> >
>>
>> I checked node 559977476 [1] that has been created in changeset
>> 3073502
>> [2]. I can't find them in the minute-replicate diffs. They should be
>> in
>> the index between 000/084/536 - 000/084/537 but grepping for this id
>> returns nothing.
>
> I looked on the mapnik database being fed by these diffs and it does not
> show this node either. There is not a single reference to changeset
> 3073502 in any of the 100 - 999 diffs in the 000/084 directory.
>
> References to the changesets on either side (3073501 & 3073503) appear
> in the 539.osc.gz file. I have no idea why this one has gone missing.

possibly it's a coincidence, but changeset 3073502 was committed in
transaction 174978321, which is the txnMax of replication diff 84538.
i don't see anything special about that number, though.

is it possible that the connection pooling in the apache DBCP
BasicDataSource means that the transaction state query is occurring in
a connection which has txn 174978321 committed, but the replication
query happens in one where it hasn't been committed yet?

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] usernames, keys, and values

2009-10-29 Thread Matt Amos
On Thu, Oct 29, 2009 at 6:16 PM, Anthony  wrote:
> On Thu, Oct 29, 2009 at 1:31 PM, Matt Amos  wrote:
>> On Wed, Oct 28, 2009 at 3:38 PM, Andy Allan  wrote:
>>> I would say that if the dump code and
>>> http://www.w3.org/TR/REC-xml/#NT-Char
>>> are in conflict, there's a bug in the dump code. But since I'm not
>>> going to fix it, maybe I'll keep my opinions quiet :-)
>>>
>>> As for the rails code, there is (AFAIK) no explicit character
>>> checking. The server implicitly relies on libxml to ensure the
>>> characters in the XML requests and responses are only those allowed by
>>> the XML spec above.
>>
>> there is explicit checking in the potlatch API, as that doesn't go
>> through libxml:
>>
>> http://trac.openstreetmap.org/browser/sites/rails_port/app/controllers/amf_controller.rb#L909
>
> There doesn't seem to be a spec, so everyone's just making it up as
> they go along.

sort of. the "spec" is just that the API talks XML, and we aim not to
have any restrictions beyond that. so anything that's a valid XML
character (http://www.w3.org/TR/REC-xml/#charsets) should be allowed
in OSM.

> But I'm going to attempt to clarify, with a quote from W3: "In
> attribute values, the character information items TAB (#x9), newline
> (#xA), and carriage-return (#xD) are represented by "	", "
",
> and "
" respectively."
> (http://www.w3.org/TR/2000/WD-xml-c14n-2119.html)
>
> Including a tab, newline, or carriage return unescaped in an xml
> attribute would clearly be incorrect.  But as long as it's escaped,
> it's valid xml.   is valid xml.
> It may or may not represent valid OSM data.  This is why I'm saying my
> question has nothing at all to do with XML.

but there are other escaped values which are invalid XML (e.g: �).

> Apparently under that potlatch code, tabs, carriage returns, and
> newlines are not allowed in keys or values (I don't actually know
> ruby/rails enough to say for sure, but that seems to be what Matt just
> pointed out).

the code, which is a bit opaque:

delete "\000-\037", "^\011\012\015"

says that it'll delete any char in the range 0-37 (octal), but not 11,
12 or 15. in ascii this corresponds to anything "less than" a space,
but not tab, newline or carriage return. it should be (modulo bugs)
the same as the XML valid character productions.

> On the other hand, usernames apparently *can*, at this
> point, contain these characters.  Actually changing one's username to
> include them would require using an input method other than the web
> page, but I don't see any code to forbid this.

yes, we should probably add a rails validation to stop people using
those chars in their username (who needs a tab in their username
anyway?)

> On the other hand, the planet dump code is silently changing control
> characters to "?".  This could cause problems (for instance, two
> usernames might wind up being silently changed to identical values),
> though it would probably require a deliberate attack.

the planet dump format includes the uids, so this case would be
detectable. however, this behaviour in planet dump is a bug and i'll
have a go at fixing it.

> I wonder, what happens if someone enters tabs into keys or values
> through the API (where there apparently are no checks for this), and
> then someone tries to edit it in potlatch?  Looks like a denial of
> service attack to me.

there don't need to be any checks - the xml is parsed using an xml
parser, which would barf if those chars were in the API calls.
potlatch also can't enter those chars via its API. i don't think
there's any attacks possible here, but i'd like to know if there are!

> It would be a good idea to release an official spec on exactly what
> characters are allowed in keys, values, and usernames.  Just
> disallowing control characters (decimal value less than 32) altogether
> would probably be the best.  But if the decision is made to allow
> them, fine, they need to be handled properly.

that's a good idea. i'll stick something up on the wiki - for
reference i think the current "spec" is the XML valid character
productions, although i can't think of any particular reason to keep
\t, \n or \r:

Char ::= #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD] | [#x1-#x10]

http://www.w3.org/TR/REC-xml/#NT-Char

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] usernames, keys, and values

2009-10-29 Thread Matt Amos
On Wed, Oct 28, 2009 at 3:38 PM, Andy Allan  wrote:
> On Wed, Oct 28, 2009 at 3:22 PM, Anthony  wrote:
>> On Wed, Oct 28, 2009 at 10:21 AM, Frederik Ramm  wrote:
>>> Hi,
>>>
>>> Anthony wrote:

 Just took a look at the planet dump code at
 http://svn.openstreetmap.org/applications/utils/planet.osm/C/output_osm.c

 In xmlescape(): "} else if ((*in >= 0) && (*in < 32)) {
 escape_tmp[len] = '?'; len++; } else {"
>>>
>>> The planet dump code decides what gets *out* (to the planet file), not what
>>> gets in (to the database) and neither what gets out over the API. If you
>>> want to look at code to answer your questions then you should look at the
>>> rails port (/sites/rails_port) which guards the API.
>>
>> My real question was "how can I parse the planet dump file".  :)
>>
>> I briefly skimmed through the rails_port, but not knowing ruby/rails I
>> didn't get very far.  I suppose if I care I'll make a few tests
>> through the dev server api.  But for now I'm content with adding "if
>> (buf[ptr]<32) { fail(buf, "unexpected character data"); }" to my dump
>> parser.
>>
>> If someone decides that's a bug in the dump code, and not a feature,
>> please let us know here.
>
> I would say that if the dump code and
> http://www.w3.org/TR/REC-xml/#NT-Char
> are in conflict, there's a bug in the dump code. But since I'm not
> going to fix it, maybe I'll keep my opinions quiet :-)
>
> As for the rails code, there is (AFAIK) no explicit character
> checking. The server implicitly relies on libxml to ensure the
> characters in the XML requests and responses are only those allowed by
> the XML spec above.

there is explicit checking in the potlatch API, as that doesn't go
through libxml:

http://trac.openstreetmap.org/browser/sites/rails_port/app/controllers/amf_controller.rb#L909

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] usernames, keys, and values

2009-10-27 Thread Matt Amos
On 10/28/09, Anthony  wrote:
> Regarding usernames, keys, and values in the OSM database, are there
> are restrictions on what characters are allowed?  Tabs, carriage
> returns, line feeds, nulls?  All allowed?

i think the current set of allowed characters are those which are in
the XML character range: http://www.w3.org/TR/REC-xml/#NT-Char

this includes \t, \n and \r, but not nulls. to be honest, i can't see
a good use for these characters, but someone out there is bound to be
using them for something ;-)

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] get user id on api

2009-10-12 Thread Matt Amos
2009/10/12 Frederik Ramm :
> Hi,
>
>> Etienne Chové a écrit :
>>> How to get user id when we have the username ? (GET changesets needs
>>> userid and do not accept username).
>
> Then fix the API ;-)

done.

http://trac.openstreetmap.org/changeset/18103
http://wiki.openstreetmap.org/wiki/API_v0.6#Query:_GET_.2Fapi.2F0.6.2Fchangesets

note that it might not be deployed yet by the time you read this. ;-)

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] Hourly diffs are missing edits (too)

2009-10-07 Thread Matt Amos
On Wed, Oct 7, 2009 at 11:22 AM, Frederik Ramm  wrote:
> Hi,
>
> Tom Hughes wrote:
>>> Could Postgres be persuaded to abort any transaction that runs longer
>>> than "n" minutes (e.g. 30), and the we run the hourlies at hh:31 or so?
>>> That would probably be a slight inconvenience to those who happen to
>>> start 35-minute transactions but they should just learn to do their bulk
>>> imports properly ;-)
>>
>> Um... Are you being serious?
>
> Yes. In most cases, whatever HTTP client they were using to upload will
> already have terminated the TCP connection and complained about a
> timeout anyway, leading them to upload the same thing again...

unless they're on an extremely slow connection and the diff is
trickling into the server at a few bytes per second.

>> Why? We've already solved the problem with the transactional diffs...
>
> For many uses, I view the hourly diffs as superior as they will contain
> less "noise" (edits that cancel each other out will not be present in
> the hourlies). So I would really like to have reliable hourly diffs -
> even if they come with a considerable delay.

it's possible to do this by aggregating the replication diffs and
dropping intermediate versions. of course, this won't exactly match an
hour boundary, but should usually be pretty close.

> My reasoning is that I feel uneasy when these transactions are
> completely unlimited. For all I know, there might be a freak case where
> a transaction runs for 8 days and I suddenly have an object in this
> week's planet which is 8 days old but wasn't in last week's planet (etc.).

but since the transaction commits atomically, the only evidence of
this is would be the timestamp on the element. the timestamp on the
element may as well be replaced with the timestamp of the replication
diff it came in.

> So I would like to have *some* certainty about the run time of
> transactions. Even if it is 24 hours or so - just some value where I
> know that no transaction can possibly exceed this duration. And if we
> should find that only one promille of all transactions takes longer than
> thrity minutes then I'd be prepared to sacrifice that one transaction if
> that buys me accurate hourly diffs at 31 minutes past the hour.

aggregating replication diffs gives you accurate nearly-hourly diffs
at 1 minute past the hour - wouldn't you prefer those? all it takes is
a little mental re-adjustment away from a time-based stream towards a
transaction-based stream. come on in, the mental water is fine ;-)

cheers,

matt

___
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev


  1   2   3   4   >