Re: protein entities (was Re: Rules (was Re: Ambiguous names. was: Re: URL +1, LSID -1)

Darren Natale Thu, 19 Jul 2007 14:02:25 -0700

Quite a nice example! These are the sorts of issues that we mustcontend with while creating the PRO framework. In fact, this addressesanother issue of scope; that is, whether or not (in the long or shortterm) to also account for homodimers, trimers, and so on (currently, GOhandles hetermeric complexes). This also provides a good opportunityfor me to mention that our most immediate goal is to provide a frameworkthat can be built upon by others as well as us. That is, we wouldencourage you to unfold your own corner of the protein world! ;)


June Kinoshita wrote:

If I may put forward a key protein in Alzheimer disease as an examplethat we are grappling with, there is full-length APP (which itself has anumber of forms as well as mutations); various peptides derived fromcleavage of APP; and then multimeric forms of the peptides, particularlyAbeta42, which is known to form soluble dimer, trimer, tetramer,hectamer, and dodecamer, each of which may have different functions ortoxicities, as well as "misfolded" protofibrillar and insolublefibrillar forms, and possibly a pore-like form consisting ofI-forget-how-many Abetas. In addition, proteins form complexes that havefunctions that are different from those of the non-complexed protein. Ilook forward to seeing how the Protein Ontology unfolds, so to speak! -June
On Jul 19, 2007, at 11:23 AM, Darren Natale wrote:
We don't yet have formal definitions for many of the classes andrelations (the effort only began in earnest a few months ago). But,basically, there is a distinction made between the full-length (interms of amino acid sequence) protein and the sub-length parts ofproteins (commonly called domains by protein scientists,unfortunately). The term "whole protein" is somewhat of aplaceholder; it is used to signify the evolutionary classes (families)of full-length proteins as opposed to the evolutionary classes ofdomains. Sequence form is again a placeholder term used to denote theinitial translation product from an mRNA, which itself might be basedon a "normal" gene or a mutant thereof, or which might be one ofseveral possible alternatively spliced transcripts from the normal ormutant gene. The cleaved or modified product is a further breakdownof those initial translation products, and allows one to distinguishbetween a phosphorylated version of a protein and thenon-phosphorylated version (as an example). The need for the latterderives from the fact that the two versions might have differentfunctions.
Eric Jain wrote:
Darren Natale wrote:
We recently began a new Protein Ontology (PRO) effort gearedprecisely toward the formal definition of the "smaller entities"referred to by Alan. By "we" I mean the PRO Consortium, comprisingthe PIs Cathy Wu of PIR (which is also a member organization of theUniProt Consortium), Barry Smith of SUNY Buffalo, and Judy Blake ofJackson Labs. PRO is being developed within the framework of theOBO Foundry, and aims to specify protein entities at the levelmentioned by Chris (accounting for splice variation andpost-translational modification and cleavage). Where appropriate,PRO will indeed make reference to both other ontologies and toUniProt Knowledgebase (UniProtKB) records. Furthermore, we are alsoundertaking the "wildly ambitious" job of representing broader,more-inclusive classes of similar proteins based on evolutionaryrelatedness.
A further description of PRO (with examples and link to a paper) canbe found at http://pir.georgetown.edu/pro
This will no doubt be interesting to quite a few people here! For thesake of this discussion, could you elaborate a bit more on how thedifferent concepts in PRO are defined, i.e. what is a "protein","whole protein", "sequence form" and "cleaved and/or modified product"?

Re: protein entities (was Re: Rules (was Re: Ambiguous names. was: Re: URL +1, LSID -1)

Reply via email to