Re: Advancing translational research with the Semantic Web

Alan Ruttenberg Fri, 18 May 2007 09:04:56 -0700


On May 18, 2007, at 3:06 AM, Eric Jain wrote:

Alan Ruttenberg wrote:
If you want to say that the protein is found in some tissue,that's what should be said. However, in your email you wrote thatthe protein is expressed in the tissue.
Sorry about that, should run a consistency checker on my outgoingmail :-)

This is not a matter of consistency, it is a matter of saying what ismeant :)

If it is know to be found in the tissue I would make the subclassbe the subclass of the protein each instance of which is locatedin some instance of the tissue. No processes involved at all.
You would use different representations depending on how well it isknown?

All entities are involved in processes all the time. That we don'tknow the specifics doesn't mean they are not there, nor does it meanthat the representation is different when we state the specifics.

By a reasonable definition of process (following, e.g., the BFOpapers), if a process happens in a location, then each participant islocated in some part of that location. So if it turns out that thetruth is that the protein expression process happened in the tissue,and we had the relations appropriately encoded in our computationalsystem, then the location of the protein - the fact that was stated,would be able to be inferred. So we would have extra information, butthe information we have would stay true.

In fact, very few such axioms are currently encoded in the BFO andOBO ontologies, a problem which many people want to and will addressand which some, including myself, are working on. For example, Irecently encoded a bunch of axioms representing constraints onpart_of (e.g. a 3-d spatial region can't be part of a 2-d spatialregion) in OWL and expect them to be added to a version of therelation ontology some time in the near future. Thomas Bittner isworking on a FOL encoding of the BFO at http://www.ifomis.uni-saarland.de/bfo/fol which is substantially more detailed than any ofthe current OWL representations.

There are other computational systems that are candidates for doingsuch inferences. I'm particularly interested in OWL because it hasthe widest adoption and hence work in it has a higher chance, IMO, ofbeing used by people.

I don't think we can make due with core RDF features
Neither do I; just not enthusiastic about reimplementing corefeatures...

I think there is a lot of mileage we can get out of OWL, whichextends RDF. Use of OWL has the dual benefit of saving us work, andhelping the OWL people advance the state of their tools because theyhave realistic use cases. Use of OWL is not without problems - thereasoning techniques don't scale to anything near the size of thedatabase we've created. OTOH there is ongoing research to addressthis and now they have a target (and they are very interested intackling it). One area I am watching is the DL-Lite work, whichoffers some level of reasoning in a way that can be implemented inrelational databases. I'm also aware (but haven't yet tried) theupcoming Oracle RDF store that implements a subset of OWL, the OWLIMsystem, and interest at Openlink in adding further inferencetechniques. I'm sure there are others that I'm not aware of.

In the mean time, the approach we tooks for the demo was to add someability to query over inferred knowledge by precomputing specificpieces of information which we knew would be useful for some of thequeries we wanted to do, for example the part_of relations in the GO.


-Alan

Re: Advancing translational research with the Semantic Web

Reply via email to