Re: Ambiguous names. was: Re: URL +1, LSID -1

Alan Ruttenberg Mon, 16 Jul 2007 11:18:42 -0700

Summary: Answering Phil's questions, and clarifying one thing heasserts about what I said.



On Jul 16, 2007, at 12:22 PM, Phillip Lord wrote:

"Alan" == Alan Ruttenberg <[EMAIL PROTECTED]> writes:


Take these rhethorical questions:

I am interpreting these as questions of fact, that "same" meansinstances of the same class, with the classes you name considerednarrowly construed. That doesn't mean that we can't define broaderclasses in which instances of these two types are considered to bemembers of the same class.


Is Red Opsin in human the same as Red Opsin in Cattle?

No.

Is Red Opsin in me, necessarily the same as Red Opsin in you?

No.

What if they have a polymorphism?

No.

Are two isoforms from an alternate splice the same protein?

No.

If a protein has been partly digested, is it still the same?

No.

Are haemoglobin alpha and beta the same?

No.

The point is that you can't deal with a protein computationally.You can'tresolve it, analyze it computationally. It's always second handinformation
that you want to deal with.

Yes, but we generalize and boldly make statements about what we candirectly see, and find that these are supported by furtherexperiments or not, and possibly revise our statements. I *think* wewant to be able to capture such statements on the semantic web, no?

Yes, exactly. A uniprot record defines a class of proteinsextensionally. Thismeans, antibodies to the proteins described by OPSD_HUMAN (forexample).

Well, if I tell my agent to go order some OPSD_HUMAN from Invitrogen,what will you expect to get back. Or do you deny that I will want touse identifiers such as this for this kind of purpose.

<snip in the interest of brevity>

If we have the ability to express "the class of protein moleculesdefined by the swissprot record OPSD_HUMAN"
then I think we have all we need.

That would be a good start. How will we see if we've succeeded? Ihave some ideas, like picking two people who work in the field,asking them to describe what the set of proteins are that aredescribed by the swissprot record OPSD_HUMAN, and then comparing whatthey say. How would you know when we've succeeded at this?

I think that if we were there, then we could effectively start tobuild formal statements.

If we make our own definitions, all that we have done is duplicatewhat the uniprot team are already doing. And we will, almostinevitably, do it somewhat differently. All we would do is createconfusion. The only way that we ensure that we do the same thing asuniprot is say "yeah, what they said".
Unsatisfying, maybe. Clear definitions are important. Butinteroperability, and the lack of duplication are more so.

Forgive my confusion, but how exactly will we achieveinteroperability and lack of duplication if we don't havedefinitions? How would we know that we don't have duplication, forexample?

<snip>

And, yet, you just told me that you could buy a antibody with justa swissprot ID. So, let me restate the question, what are you goingto do with a protein ID that you are not going to do with aswissprot ID, or "the protein formally known as OPSD_HUMAN".

I did not say that. I've said some people have identified antibodiesby such ids. Unfortunately this information is of limited use whenactually ordering an antibody, where I am interested in much moreinformation, such as how specific it is, how it has been validated,and other properties related to how it behaves in certainexperimental settings. I *want* to be able to have identifiers(URIs)that are up to the job of ordering reagents.


-Alan

Re: Ambiguous names. was: Re: URL +1, LSID -1

Reply via email to