Re: [OSM-legal-talk] ODbL: Where do we stand regarding collective/derivative databases

2009-07-28 Thread Matt Amos
On Tue, Jul 28, 2009 at 9:56 AM, Dave Stubbs wrote:
> On Tue, Jul 28, 2009 at 2:54 AM, Frederik Ramm  wrote:
>>
>> Hi,
>>
>> Matt Amos wrote:
>> > LWG cannot entirely resolve these questions, as they need open
>> > discussion and community consensus (which we obviously can't provide
>> > on our own). even then, final interpretation is up to the courts.
>>
>> Of course.
>>
>> Thanks for your comments, I especially liked the a(b(X)@c(Y)) part which
>> is a nice structure to think about this.
>>
>> But about my Navteq+OSM example, you said that
>> > my reading would be that the deletions from the OSM data are a
>> > derivative database of both the OSM data and the navteq data and that
>> > the combination of navteq + (OSM - derivative) constitutes a public
>> > use of that derivative database, requiring the release of the navteq
>> > data.
>>
>> Now if I loaded my Navteq database into postgis and created a buffer
>> around every object, generating one giant buffer area multipolygon for
>> the whole world, then I could use that to subtract data from my OSM data
>> base and would then only have to publish the giant multipolygon under
>> ODbL (because that was mixed with OSM data) and not the original Navteq
>> data.
>>
>> So this means I'd have to get permission from Navteq to release the
>> giant buffer multipolygon under ODbL but if that is granted, I could
>> continue with my OSM-enhanced Navteq tiles plan, and OSM would gain
>> precious little from having access to the Navteq buffer multipolygon.
>> Right?
>>
>
> Do you even have to go that far? The Navteq multipolygon isn't actually part
> of the resulting derivative database, it's just part of the algorithm to get
> there. Assuming the result is just a shrunk version of the OSM DB I'd have
> thought the only thing you had to release in this case was the alterations
> made to the OSM DB -- ie: a list of the bits you deleted. We'd be within our
> rights to try and reconstruct the multipolygon from those deletions, but you
> wouldn't have to actually release it?
>
> or put another way: if I do o...@navteq = DD (where @ is some function that
> combines the datasets), there's no circumstance in which I have to release
> Navteq. My obligation is to release DD under ODbL (I can hand out the DD-OSM
> diff). This happens to entitle anybody else to attempt to reconstruct as
> much of Navteq as possible.
>
> The ODbL says I have to release changes, it doesn't say I have to tell you
> why I'm making them.
>
> Is that remotely the right reading?

hmm... i think you may be right. i guess it depends on how it's done.
if the merging is done by clipping out OSM data then maybe at most the
polygon would need to be released, but maybe not even that. if the
merging is done by matching (e.g: roadmatcher) then there's an
argument that the database of matches is also a derivative database
(which gets used to make the final derivative database) and would
require the release of (maybe only some) navteq data...

a very specific example may help: if i wanted to take navteq's list of
petrol stations and merge it with OSM's - always preferring navteq's
then i guess it would be simple to delete those in OSM which are close
(fsvo close) to navteq's and then render the two superimposed by
method #1. i think this fits your argument above - i can't see any
reason why ODbL wouldn't be satisfied by the release of just the
deleted items.

taking it further, if i wanted to join those petrol stations to a
routable network, or put them into a geocoding database... i think you
might get away with the geocoding example if, like rendering, your
geocoding search was independently performed on both the navteq and
OSM lists of points and composited. routing... may be different. i
can't, off the top of my head, think of any sensible way to keep the
two datasets separate and still useful for those purposes...

my head hurts now.

cheers,

matt

___
legal-talk mailing list
legal-talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/legal-talk


Re: [OSM-legal-talk] ODbL: Where do we stand regarding collective/derivative databases

2009-07-28 Thread Dave Stubbs
On Tue, Jul 28, 2009 at 2:54 AM, Frederik Ramm  wrote:

> Hi,
>
> Matt Amos wrote:
> > LWG cannot entirely resolve these questions, as they need open
> > discussion and community consensus (which we obviously can't provide
> > on our own). even then, final interpretation is up to the courts.
>
> Of course.
>
> Thanks for your comments, I especially liked the a(b(X)@c(Y)) part which
> is a nice structure to think about this.
>
> But about my Navteq+OSM example, you said that
> > my reading would be that the deletions from the OSM data are a
> > derivative database of both the OSM data and the navteq data and that
> > the combination of navteq + (OSM - derivative) constitutes a public
> > use of that derivative database, requiring the release of the navteq
> > data.
>
> Now if I loaded my Navteq database into postgis and created a buffer
> around every object, generating one giant buffer area multipolygon for
> the whole world, then I could use that to subtract data from my OSM data
> base and would then only have to publish the giant multipolygon under
> ODbL (because that was mixed with OSM data) and not the original Navteq
> data.
>
> So this means I'd have to get permission from Navteq to release the
> giant buffer multipolygon under ODbL but if that is granted, I could
> continue with my OSM-enhanced Navteq tiles plan, and OSM would gain
> precious little from having access to the Navteq buffer multipolygon.
> Right?
>
>
Do you even have to go that far? The Navteq multipolygon isn't actually part
of the resulting derivative database, it's just part of the algorithm to get
there. Assuming the result is just a shrunk version of the OSM DB I'd have
thought the only thing you had to release in this case was the alterations
made to the OSM DB -- ie: a list of the bits you deleted. We'd be within our
rights to try and reconstruct the multipolygon from those deletions, but you
wouldn't have to actually release it?

or put another way: if I do o...@navteq = DD (where @ is some function that
combines the datasets), there's no circumstance in which I have to release
Navteq. My obligation is to release DD under ODbL (I can hand out the DD-OSM
diff). This happens to entitle anybody else to attempt to reconstruct as
much of Navteq as possible.

The ODbL says I have to release changes, it doesn't say I have to tell you
why I'm making them.

Is that remotely the right reading?

Dave
___
legal-talk mailing list
legal-talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/legal-talk


Re: [OSM-legal-talk] ODbL: Where do we stand regarding collective/derivative databases

2009-07-27 Thread Frederik Ramm
Hi,

Matt Amos wrote:
> LWG cannot entirely resolve these questions, as they need open
> discussion and community consensus (which we obviously can't provide
> on our own). even then, final interpretation is up to the courts.

Of course.

Thanks for your comments, I especially liked the a(b(X)@c(Y)) part which 
is a nice structure to think about this.

But about my Navteq+OSM example, you said that
> my reading would be that the deletions from the OSM data are a
> derivative database of both the OSM data and the navteq data and that
> the combination of navteq + (OSM - derivative) constitutes a public
> use of that derivative database, requiring the release of the navteq
> data.

Now if I loaded my Navteq database into postgis and created a buffer 
around every object, generating one giant buffer area multipolygon for 
the whole world, then I could use that to subtract data from my OSM data 
base and would then only have to publish the giant multipolygon under 
ODbL (because that was mixed with OSM data) and not the original Navteq 
data.

So this means I'd have to get permission from Navteq to release the 
giant buffer multipolygon under ODbL but if that is granted, I could 
continue with my OSM-enhanced Navteq tiles plan, and OSM would gain 
precious little from having access to the Navteq buffer multipolygon. Right?

Bye
Frederik

-- 
Frederik Ramm  ##  eMail frede...@remote.org  ##  N49°00'09" E008°23'33"

___
legal-talk mailing list
legal-talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/legal-talk


Re: [OSM-legal-talk] ODbL: Where do we stand regarding collective/derivative databases

2009-07-27 Thread Matt Amos
On 7/27/09, Frederik Ramm  wrote:
> generally good progress on ODbL; many things have been cleared up
> and we will soon be at a point where the proposal for a license change
> is not some cloudy abstract thing any longer but a very concrete
> proposal that people can evaluate.

i'm glad you think so :-)

> After the LWG has made an effort to resolve the questions about what is
> substantial and what is a derived work, in my eyes there's one big issue
> that remains, and that is "what is a derivative database".

LWG cannot entirely resolve these questions, as they need open
discussion and community consensus (which we obviously can't provide
on our own). even then, final interpretation is up to the courts.

> To recap, my understanding is that if I produce (+publish) works based
> on a derivative database that I have created then I have to make that
> database available, fully, under ODbL. If I, on the other hand, produce
> works based on a collective database that is half ODbL and half
> proprietary, then I only have to make the ODbL part available. Is that
> everyone else's reading as well?

it's my understanding also.

> Let us look at someone who mixes OpenStreetMap and Navteq data. Say I
> produce map tiles (clearly a produced work, no?) where all the streets
> come from Navteq, but all the footways come from OpenStreetMap. There
> are a number of ways to do this, all leading to the exact same result,
> and nobody from the outside can see which of 1,2,3 I am using:
>
> 1. Configure my Mapnik tile generator so that it accesses two different
> postgis databases - one containing Navteq and one containing OSM - to
> produce merged map tiles.

or, equally, configure two different mapnik instances to produce two
different tilesets (one with a transparent background) and composite
the two in a later postprocessing stage.

> 2. Pour OSM and Navteq data into the same postgis instance but have
> different tables (e.g. planet_osm_roads and navteq_roads) which are
> joined by Mapnik's SELECT statement.
>
> 3. Extract all footway geometries from OSM and insert them into my
> postgis database containing Navteq street data, then run Mapnik on the
> resulting database.
>
> The way I read the license, option 1 would be definitely ok, option 3
> would definitely lead to my having to release the Navteq data, and
> option 2 would be somewhere in between (probably ok until unknown to me,
> Matt comes along and makes Mapnik internally create temporary tables on
> the fly for better performance in which case I'd be creating temporary
> derivative databases without even noticing...)
>
> Evil business genius that I am, I would of course claim to be doing 1
> even when doing 3 and nobody would have the right to challenge me,
> right? Which would ultimately mean that:

i'd say that if 1 were technically plausible, then what's the real
difference between that and 2 or 3?

> "If there is any conceivable way that a produced work could have been
> created by using a collective rather than a derivative database, then
> only the ODbL licensed part of the data source has to be released."
>
> This is becoming interesting, we're very much into real-world business
> scenarios now. There are lots of people who'd shy away from using OSM
> outright but if they could use a Navteq basemap and sprinkle that with
> any additional detail that OSM might have that would be just great for
> them.

indeed, so your use-case with navteq roads and OSM footways wouldn't
require the navteq data to be opened.

unless you don't like the way the navteq and OSM data don't quite
match up; the point at which you alter the OSM/navteq data to match
up, then 1, 2 and 3 become inconceivable and the navteq data must be
released (along with the whole/diffs to OSM data).

it gets more difficult when we consider the collision detection in
mapnik - does the fact that an item isn't rendered in one layer
because it would occlude a previously rendered item constitute a
derivative database inside mapnik, or is it simply part of the
creative process of produced works?

> Let us look at someone who has a Navteq and an OSM data base, and runs a
> comparing analysis which results in *removing* all features from the OSM
> database which were also in Navteq. He clearly creates a derivative
> database but one which has no data added, just data deleted. He now
> employs technique #1 from above to merge the Navteq data set and the
> reduced OSM data set into one that contains the "best of both worlds".
> Since he is clearly operating on a collective database, he only has to
> release the derived OSM database under ODbL - the value of which is
> almost zero to the community since it has no data added (the only thing
> you can do with it is find out which of OSM's features are present in
> Navteq as well).
>
> Is everything I write here correct and compatible with what others are
> thinking? Is there some lawyer opinion on cases like this documented
> somewhere in the vast depths

Re: [OSM-legal-talk] ODbL: Where do we stand regarding collective/derivative databases

2009-07-27 Thread Frederik Ramm
Hi,

Frederik Ramm wrote:
> He now 
> employs technique #1 from above to merge the Navteq data set and the 
> reduced OSM data set into one that contains the "best of both worlds". 

I meant that he employs technique #1 to create a produced work from the 
Navteq and the reduced OSM data set - not that he actually merges them, 
that's the whole point of course.

Bye
Frederik

-- 
Frederik Ramm  ##  eMail frede...@remote.org  ##  N49°00'09" E008°23'33"

___
legal-talk mailing list
legal-talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/legal-talk


[OSM-legal-talk] ODbL: Where do we stand regarding collective/derivative databases

2009-07-27 Thread Frederik Ramm
Hi,

generally good progress on ODbL; many things have been cleared up 
and we will soon be at a point where the proposal for a license change 
is not some cloudy abstract thing any longer but a very concrete 
proposal that people can evaluate.

After the LWG has made an effort to resolve the questions about what is 
substantial and what is a derived work, in my eyes there's one big issue 
that remains, and that is "what is a derivative database".

To recap, my understanding is that if I produce (+publish) works based 
on a derivative database that I have created then I have to make that 
database available, fully, under ODbL. If I, on the other hand, produce 
works based on a collective database that is half ODbL and half 
proprietary, then I only have to make the ODbL part available. Is that 
everyone else's reading as well?

Let us look at someone who mixes OpenStreetMap and Navteq data. Say I 
produce map tiles (clearly a produced work, no?) where all the streets 
come from Navteq, but all the footways come from OpenStreetMap. There 
are a number of ways to do this, all leading to the exact same result, 
and nobody from the outside can see which of 1,2,3 I am using:

1. Configure my Mapnik tile generator so that it accesses two different 
postgis databases - one containing Navteq and one containing OSM - to 
produce merged map tiles.

2. Pour OSM and Navteq data into the same postgis instance but have 
different tables (e.g. planet_osm_roads and navteq_roads) which are 
joined by Mapnik's SELECT statement.

3. Extract all footway geometries from OSM and insert them into my 
postgis database containing Navteq street data, then run Mapnik on the 
resulting database.

The way I read the license, option 1 would be definitely ok, option 3 
would definitely lead to my having to release the Navteq data, and 
option 2 would be somewhere in between (probably ok until unknown to me, 
Matt comes along and makes Mapnik internally create temporary tables on 
the fly for better performance in which case I'd be creating temporary 
derivative databases without even noticing...)

Evil business genius that I am, I would of course claim to be doing 1 
even when doing 3 and nobody would have the right to challenge me, 
right? Which would ultimately mean that:

"If there is any conceivable way that a produced work could have been 
created by using a collective rather than a derivative database, then 
only the ODbL licensed part of the data source has to be released."

This is becoming interesting, we're very much into real-world business 
scenarios now. There are lots of people who'd shy away from using OSM 
outright but if they could use a Navteq basemap and sprinkle that with 
any additional detail that OSM might have that would be just great for 
them.

Let us look at someone who has a Navteq and an OSM data base, and runs a 
comparing analysis which results in *removing* all features from the OSM 
database which were also in Navteq. He clearly creates a derivative 
database but one which has no data added, just data deleted. He now 
employs technique #1 from above to merge the Navteq data set and the 
reduced OSM data set into one that contains the "best of both worlds". 
Since he is clearly operating on a collective database, he only has to 
release the derived OSM database under ODbL - the value of which is 
almost zero to the community since it has no data added (the only thing 
you can do with it is find out which of OSM's features are present in 
Navteq as well).

Is everything I write here correct and compatible with what others are 
thinking? Is there some lawyer opinion on cases like this documented 
somewhere in the vast depths of our Wiki and LWG minutes?

(I'm just trying to determine what exactly ODbL mandates - not trying to 
find out what would be desirable in an ideal world.)

Bye
Frederik

-- 
Frederik Ramm  ##  eMail frede...@remote.org  ##  N49°00'09" E008°23'33"

___
legal-talk mailing list
legal-talk@openstreetmap.org
http://lists.openstreetmap.org/listinfo/legal-talk