Being able to do it, and do something useful with it would be good, and might act as a good demonstrator. Again, I think the crucial question is what it is *linking to* that gives it the added value: I doubt that anyone would choose to do simple statistical analysis on the data set in rdf (although I would be glad to be shot down). Therefore if someone knows something we could link it with, I'd be interested. I have done something similar (in non-rdf) linking SEER with genomic data, but it's not big enough to make use of this.

It may also be that clear demonstration of the utility of this might encourage them to relax the licensing restrictions.

I have an idea for a different data set which I will send as a separate email.

Matt

Alan Ruttenberg wrote:
[cc changed to public-semweb-lifesci]

We could distribute a script that does the conversion to RDF so that individuals who wanted to use it could still get it themselves and put it into a local store.

There are two possible benefits of working with the data: 1) Learning something from it 2) Adding it to the pool of rdf that is in the demo

We can still perhaps benefit from 1), even if 2) is not possible - but you tell us whether you think that is of value...

-Alan

On Oct 5, 2007, at 4:26 AM, Matt Williams wrote:


I've had a very quick look at this. It might be salutary to read some parts of the data-user agreement.



1. You will not use nor permit others to use the data in any way other than for statistical reporting and analysis for research purposes. The SEER Program must be notified if it is discovered that there has been any other use of the data.

<snip>

3. You will not attempt to link nor permit others to link the data with individually identified records in another data base.

<snip>

6. You will not release nor permit others to release the data in full or in part to any person except with the written approval of the SEER Program. In particular, all members of the research team who have access to the data must have signed data-use agreements.

<snip>

7. You will use appropriate safeguards to prevent use or disclosure of the information other than as provided for by this data-use agreement. If accessing the data from a centralized location on a time sharing computer system or LAN with SEER*Stat or another statistical package, you will not share your logon name and password with any other individuals. You will also not allow any other individuals to use your computer account after you have logged on with your logon name and password.



I don't know to what extent this therefore causes problems with the idea of sharing the data; while it can still be copied into an rdf format, doing so and then keeping it on a local server seems (mostly) pointless.


--http://acl.icnet.uk/~mw
http://adhominem.blogsome.com/
+44 (0)7834 899570



--
http://acl.icnet.uk/~mw
http://adhominem.blogsome.com/
+44 (0)7834 899570

Reply via email to