Re: protein entities (was Re: Rules (was Re: Ambiguous names. was: Re: URL +1, LSID -1)

June Kinoshita Thu, 19 Jul 2007 13:48:50 -0700

If I may put forward a key protein in Alzheimer disease as an examplethat we are grappling with, there is full-length APP (which itselfhas a number of forms as well as mutations); various peptides derivedfrom cleavage of APP; and then multimeric forms of the peptides,particularly Abeta42, which is known to form soluble dimer, trimer,tetramer, hectamer, and dodecamer, each of which may have differentfunctions or toxicities, as well as "misfolded" protofibrillar andinsoluble fibrillar forms, and possibly a pore-like form consistingof I-forget-how-many Abetas. In addition, proteins form complexesthat have functions that are different from those of the non-complexed protein. I look forward to seeing how the Protein Ontologyunfolds, so to speak! - June


On Jul 19, 2007, at 11:23 AM, Darren Natale wrote:

We don't yet have formal definitions for many of the classes andrelations (the effort only began in earnest a few months ago).But, basically, there is a distinction made between the full-length(in terms of amino acid sequence) protein and the sub-length partsof proteins (commonly called domains by protein scientists,unfortunately). The term "whole protein" is somewhat of aplaceholder; it is used to signify the evolutionary classes(families) of full-length proteins as opposed to the evolutionaryclasses of domains. Sequence form is again a placeholder term usedto denote the initial translation product from an mRNA, whichitself might be based on a "normal" gene or a mutant thereof, orwhich might be one of several possible alternatively splicedtranscripts from the normal or mutant gene. The cleaved ormodified product is a further breakdown of those initialtranslation products, and allows one to distinguish between aphosphorylated version of a protein and the non-phosphorylatedversion (as an example). The need for the latter derives from thefact that the two versions might have different functions.
Eric Jain wrote:
Darren Natale wrote:
We recently began a new Protein Ontology (PRO) effort gearedprecisely toward the formal definition of the "smaller entities"referred to by Alan. By "we" I mean the PRO Consortium,comprising the PIs Cathy Wu of PIR (which is also a memberorganization of the UniProt Consortium), Barry Smith of SUNYBuffalo, and Judy Blake of Jackson Labs. PRO is being developedwithin the framework of the OBO Foundry, and aims to specifyprotein entities at the level mentioned by Chris (accounting forsplice variation and post-translational modification andcleavage). Where appropriate, PRO will indeed make reference toboth other ontologies and to UniProt Knowledgebase (UniProtKB)records. Furthermore, we are also undertaking the "wildlyambitious" job of representing broader, more-inclusive classes ofsimilar proteins based on evolutionary relatedness.
A further description of PRO (with examples and link to a paper)can be found at http://pir.georgetown.edu/pro
This will no doubt be interesting to quite a few people here! Forthe sake of this discussion, could you elaborate a bit more on howthe different concepts in PRO are defined, i.e. what is a"protein", "whole protein", "sequence form" and "cleaved and/ormodified product"?

Re: protein entities (was Re: Rules (was Re: Ambiguous names. was: Re: URL +1, LSID -1)

Reply via email to