On Wed, Jan 28, 2009 at 11:51 AM, Brett Henderson <br...@bretth.com> wrote:

> Frederik Ramm wrote:
> > Will the planet dump and/or diffs be extended so that they contain all
> > changesets too, or what should be the preferred mode of operation for a
> > third-party application that wants to track changesets? The only way I
> > can currently think of would be looking at diffs to find out which
> > changeset IDs were active and then download them individually. Have I
> > overlooked something?
> >
> Replication of changesets is on my undocumented long term TODO list for
> osmosis (I should add it to trac) but I don't know when I'll be able to
> do it.  I had some discussions with Shaun and Matt a while back on how
> this might be done efficiently.  Identifying changesets for replication
> is a bit tricky and would probably involve two passes, first pass would
> identify all changesets created in a time interval, and the second pass
> would identify all changesets modified (ie. have entities referring to
> them) in a time interval.  Once identified they could be read and
> included in a changeset file just like any other entity.
>

Perhaps you only need to worry about exporting the changeset headers.  The
members of a changeset can be derived from the elements themselves.

<node id='456' changeset='123'> provides enough information in the current
diff files to be able to create a table of all changeset members.  The only
thing missing is the changeset header.



>
> The reason they're more difficult than other entities is twofold.
> 1. They don't have a history table so modifications are much harder to
> detect.
> 2. The bounding box information attached to each cannot be computed by
> osmosis in a "streamy" fashion which means they have to be re-replicated
> if they change.
>
> The two stages of changeset identification cause other problems such as
> being able to retrieve changesets after identification in an efficient
> manner (ie. not one by one), and potentially having to store identifiers
> in memory which is problematic at least theoretically if long interval
> changesets are being extracted (not likely to be an issue for the
> current daily changesets).
>
> Bit of a brain dump there but they're some of the reasons I'm holding
> off until I have a decent amount of time to invest in it.
>
> However, if replicating into a database osmosis will "invent"
> changesets.  It will create one changeset per user per changeset
> interval which will approximate the real thing.  It won't help if you
> need the real changeset id though.
>
> Brett
>
>
> _______________________________________________
> dev mailing list
> dev@openstreetmap.org
> http://lists.openstreetmap.org/listinfo/dev
>
_______________________________________________
dev mailing list
dev@openstreetmap.org
http://lists.openstreetmap.org/listinfo/dev

Reply via email to