Re: Experiment Ontology

Tim Clark Tue, 11 Dec 2007 10:52:44 -0800


Hi Susie,

I think it might be worthwhile to arrange a discussion with the SWANteam about this ontology. Could we invite you to one of our regularmeetings in January to discuss?


Best

Tim


On Dec 11, 2007, at 12:21 PM, Susie M Stephens wrote:

Hi Bill,

Thanks for all of your great feedback. :-)

The folks at Lilly who developed the ontology did review a number of
existing ontologies, but they didn't meet our needs. I don't havethe fulllist of ontologies that they explored, but they definitely took alook at
OBI. We are very interested in working with the community to further
develop the ontology, and are in the process of scheduling a callwith some
of the OBI folks.

Cheers,

Susie















            Bill Bug
            <[EMAIL PROTECTED]
edu> To
                                      Susie Stephens
            12/06/2007 11:16          <[EMAIL PROTECTED]>
PM ccMatthias Samwald<[EMAIL PROTECTED]>,"public-semweb-lifesci@w3.orghcls"<public-semweb-[EMAIL PROTECTED]>, Kei
                                      Cheung <[EMAIL PROTECTED]>,
                                      "Karen (NIH/NIDA) [E] Skinner"
                                      <[EMAIL PROTECTED]>, Alan
                                      Ruttenberg
                                      <[EMAIL PROTECTED]>
Subject
                                      Re: Experiment Ontology










Hi Susie,
We certainly do need an "Experiment Ontology" - or Ontology ofBiomedical
Investigation (OBI).
I believe Matthias, Michael, and Kei have all made exactly thepoints I
think are most important to consider:
1) Matthias's comments
Are you following "best practices" in creating the ontology. IbelieveMatthias gives many instructive examples on how to adjust what ishere to
bring it much more in sync with the emerging "best practices" that are
coming out of the community development surrounding a variety of OBO
Foundry ontologies. Matthias also makes the point that itsimportant to
seek to re-use (or directly contribute to) the emerging community
ontologies to cover the required domains. In the case of thisparticularExperiment Ontology, the ontologies to consider are Ontology ofBiomedical
Investigation (OBI), the OBO Relations Ontology, the Gene Ontology
(specifically the Molecular Function and Cellular Componentbranches, the
latter of which is designed to capture components down to the level of
macromolecular complexes), the Sequence Ontology, Protein Ontology(nascent- but proceeding rapidly), the Cell Ontology - at a minimum. Asmany onthis list know - and I'm certain the talented folks at Lilly whoinvestedtime in assembling this ontology also learned - many of these arenot fullyready for prime-time, and/or may not FULLY cover the breadth anddepth ofthe domains a specific application requires. However, if onedoesn't seek
to work with these community efforts, you cannot expect to achieve the
ultimately goal, which is to make your data maximally "semantically
sticky", so as to ensure the least amount of custom logic and humaneffort
will be required to get the most value from your data.  Otherwise, you
stand the chance of creating what may be a useful ontology thatmeets your
specific requirements (as has been true of "investigation"-oriented
ontologies that have come before such as the MAGE Ontology,ExperiBase,
EXPO, myGRID KAVE, etc.), but don't help the community at-large to
appropriately re-use your data.  In each case, these ontologies or KR
frameworks have been extremely useful in the local applicationcontext forwhich they were constructed, but they cannot be effectively employedas thebasis for semantically-driven integration across data sets that maynot be
able to accept the constraints (or lack thereof) of this
application-oriented ontology.
Would you know off-hand, Susie, whether the folks who worked on this
ontology at Lilly have both reviewed the relevant community effortscitedabove and/or have sought to interact with those groups to get someinput onhow best to meet the overall requirements that underlie thisparticularExperiment Ontology with the minimal required effort and in a mannerthatcould help to ensure Lilly's sunk investment could be of benefit tous all.
2) Michael's comments
It's very helpful to know what the target is when it comes to
exporting/exchanging the actual data. As Michael points out, agreat dealof work has gone into the production of FuGE (and MaGE before it) tocomeup with the appropriate division of labor between the semantically-opaque,syntactical requirements as represented in a data model such as MaGEorFuGE and the explicit semantics as captured in the ontology. Forthoseusing FuGE, as Michael states, in the realm of syntax, the intentionfor
FuGE is to provide a shared structure for universal elements such as
biomaterials, experiment populations/pools/groups, protocol details,
reagents details, etc..  Built on that shared, generic foundation, any
specific discipline - e.g., microarray expression, GC-MS, FISH, MRI,etc. -can sub-class FuGE components and add what additional detailrequired intheir discipline. In parallel with this effort on data structure,the OBIontology cooperative seeks to provide that same foundation for theshared
semantic domains, and a clear set of recommended practices for how to
re-use entities from other OBO Foundry ontologies such as ChEBI,SequenceOntology, Protein Ontology, OBO Cell, Organism Taxonomy (OWLversions of
NCBI Tax), etc. to specify the critical biomedical entities and their
complex relations. As I say above, these are works in progress.For thoseof us who must have something working now, the recommended practiceis toactively participate in these projects with an eye toward followingtheirpractice - and replacing any "proxy" you create in the interim withthecommunity ontology, when it is ready for use. This is what we havedone in
the BIRN ontology BIRNLex.  We actually have an OWL module called
"BIRNLex-OBI-Proxy.owl" which we fully intend to replace with OBIentities,when they are ready for use. We also have "BIRNLex-Investigation.owl" thatbuilds on this "proxy" to cover entities BIRN researchers mustcapture. Weexpect to eventually see the contents of "BIRNLex-Investigation" inOBI insome form. We intend to "contribute" those elements from this OWLfile
directly to OBI, when OBI is ready for them, and we have the time work
through this migration process.

3) Kei's comments
Examples - examples - examples. This is critical. Working throughthe
example Kei cites from the NIH Neuroscience Microarray Consortium is a
wonderful way to determine whether:
- there are existing community ontologies that can meet the KR and
processing requirements
- where the gaps are in those community ontologies
- whether the ontology you are creating effectively fills those gaps(if itdoes, that makes it very clear how the community effort can makeeffective
use of your ontology)
In regards to Gene Lists, Kei is certainly correct. If these arecapturedthrough algorithmic means, it's critical to capture the details onthat
algorithm - typically both the version of the algorithm as well as the
version of the data repository you ran it against.
Also - where gene entities are concerned - there is ongoing workbetween
the GO groups, the Sequence Ontology, and the Protein Ontology that is
particularly targeted toward capturing the specific relationsbetween typesof genomic sequence elements and types of biologically activeprotein-based
molecules (e.g., macromolecular complexes composed of a collection of
proteins in a variety of post-translationally modified states -e.g., GPCreceptors, ion channels, transporters, pathway enzymes, etc. - i.e.,Rx
drug targets).  These are the details we'll all require in order to do
round-trip pharmacogenetics - i.e.,effects of genetic constructs on
target susceptibility to drugs - AND - the ways in which drugsultimatelyalter macromolecular complexes by leading to changes in geneexpression.
Just my $0.02 filtering on these helpful comments from Matthias,Michael,
and Kei.

Cheers,
Bill

On Dec 3, 2007, at 1:00 PM, Kei Cheung wrote:


     This is great!

     I have a microarray experiment description (that has to do with
     Alzheimer Disease) extracted from NINDS microarray consortium:

     
http://arrayconsortium.tgen.org/np2/viewProject.do?action=viewProject&projectId=433773
I just wonder how this example would fit this experimentontology (as
     well as others such as OBI) As shown in this example, we record
     information such as organ type, organ region, cell type (layer II
     pyramidal neuron), etc. NINDS microarry consortium uses different
array platforms (e.g., agilent, Affymetrix, and cDNA) fordifferentorganisms so one may need to divide chips into groupscorrespondingto different platform types. Each group can then be furtherdivided
     into subgroups corresponding to different organisms.
We also would like to capture gene lists (not the raw genelists but
     the ones (much shorter) that indicate what genes are over/under
     expressed under certain experimental conditions). Such gene lists
     would usually be extracted from the literature. Also the analysis
     package (including version) that was used to generate a gene list
     should be identified. One possible use of these gene lists is to
compare them to identify genes are differentially expressedunder the
     same/similar experimental condition across different microarray
     experiments. This would help identify true signals from noises.

     Hope it helps.

     Cheers,

     -Kei



     Matthias Samwald wrote:

           Hi Susie,

           Susie wrote:
                 It would be great if you could take a look at it and
                 provide comments. The
                 ontology is available at:
                 
http://esw.w3.org/topic/HCLSIG_BioRDF_Subgroup/Tasks/Experiment_Ontology
* Some of the entities/properties are missing ardfs:label or
           have an empty label (a string with lenght 0).
* Some of the entities could be taken from existingontologieslike OBI, RO or some of the OBO Foundry ontologies. Thiswould
           save work and makes integration with other data sources and
ontologies much easier. By the way, there seem to beseveralgroups working on ontologies for mircoarray experiments,or areat least planning to do that. It would be great if thesegroups
           could work together.
* The class 'Chip type' should be removed and be replacedbysubclasses of 'chip', e.g., 'chip (human)', 'chip(mouse)' etc.* Some of the object properties appear like they areintended
           to be datatype properties (e.g., 'has proteome id').
* Many of the datatype properties could be replaced withobjectproperties, possibly referring to third party ontologies-- ofcourse this would require a richer ontology and more workspenton creating mappings. 'has molecular function' couldrefer toentities from the gene ontology, 'has associated organ'could
           refer to an ontology about anatomy and so on.
           * Object properties and their ranges are quite redundant.
           Property 'has reagent' has range 'Reagent', property 'has
treatment' has range'Treatment' and so on. Maybe theontology
           could be designed in such a way that there are only some
           generic properties such as 'has part'. This would make the
ontology much easier to maintain, query and understand inthe
           long term.
           * It is unclear how 'Gene list' is intended to be used.
           * 'Hardware' and 'Software' should not be subclasses of
           'Protocol'.


           Many of the datatype properties in this ontology look very
           interesting and might provide requirements for other
           ontologies. It would be great if some of them could be
described/commented in more detail so that we know moreabout
           the requirements that motivated the creation of these
           properties.

           I hope that was somewhat helpful.

           cheers,
           Matthias Samwald
William Bug, M.S., M.Phil.email:
[EMAIL PROTECTED]
Ontological Engineer (Programmer Analyst III) work: (610) 457-0443
Biomedical Informatics Research Network (BIRN)
and
National Center for Microscopy & Imaging Research (NCMIR)
Dept. of Neuroscience, School of Medicine
University of California, San Diego
9500 Gilman Drive
La Jolla, CA 92093

Please note my email has recently changed

Re: Experiment Ontology

Reply via email to