Re: ontology specs for self-publishing experiment

Alan Rector Sun, 09 Jul 2006 04:12:27 -0700

All


Just catching up.

Could I strongly support the following. If there is one repeatedlyconfirmed lesson from the medical communities experience with largeterminologies/ontologies/ it is to separate the "terms" from the"entities". There are always linguistic artefacts, and languagechanges more fluidly in both time and space than the underlyingentities. (In medical informatics this is sometimes quaintlyphrased as using "nonsemantic identifiers").


Regards

Alan

On 5 Jul 2006, at 22:43, William Bug wrote:

By the way, the "mapping" I refer to above linking instance datawhere ever it may reside (primary data repositories, pooled/analyzed/interpreted data, the scientific literature) to entitiesin the ontologies requires reference to the lexicon - the TERMSused to describe the ontological fundamentals by the scientistsreporting them. This is true whether an algorithm or a human istrying to understand and interpret a collection of instance data inthe context of the relevant knowledge framework, even if thatframework resides in the head of the human researcher.
I like to think of this distinction as being very coarselyanalogous to the distinction between the physical data model in anRDBMS and the many tools used to make that more abstracted,normalized collection of related entities directly useful forspecific applications - e.g., SQL SELECT statements, VIEWs, and/orMaterialized VIEWS. Maintaining these as distinct elements goes along way toward ensuring the abstraction is re-usable for a largeset of applications, while simultaneously being able to supporteach application's detailed requirements through custom de-normalization.
This is why I like to keep the lexicon distinct from the ontology.They are intimately linked. No ontology is free of lexicalartifacts (I'm not certain it can or should be), anymore than alexical graph can be assembled without representing semanticrelations. Analysis of the lexicon can inform how to adapt thesemantic graph in the ontology - make it more commensurate with thecurrent state of knowledge as expressed by domain experts, andreview of term use in the context of the ontology can be a greathelp in creating effective, structured, controlled terminologicalresources. However, the two types of knowledge resource areconstructed via different process, support different Use Cases, andrely on different fundamental relations at their core, howeverintimately they may be linked.


-----------------------
Alan Rector
Professor of Medical Informatics
School of Computer Science
University of Manchester
Manchester M13 9PL, UK
TEL +44 (0) 161 275 6149/6188
FAX +44 (0) 161 275 6204
www.cs.man.ac.uk/mig
www.clinical-esciences.org
www.co-ode.org

Re: ontology specs for self-publishing experiment

Reply via email to