Re: NeuronDB RDF and OWL

Alan Ruttenberg Wed, 14 Mar 2007 17:39:14 -0800


On Mar 14, 2007, at 4:44 PM, Kashyap, Vipul wrote:

Alan,
You have proposed some modeling suggestions and of course alignmentwith the OBO
relations ontology.
Other than expressing the semantics of these classes precisely, itwill be great
if you and someone in this group could identify the potential impact
of these modeling choices on:
- Enabling different types of integration that were not feasiblebefore

I think at the moment I am more concerned about data integration thannovel inferences, although I do expect a number of inferencedemonstrations. I view the comments I'm providing as a way to dealwith some integration problems before they arise, but I think it willbe better shown once we start looking at specific queries.

The semantics, however, are somewhat more important, particularlysuch things as clearly defining classes, distinguishing part of, isa, and derives from, etc. Whenever they are mixed up we will getsome wrong answers when we questions using these relations.

Put another way, the goal might be stated as wanting to get both*all* available answers to our questions, and *only* correct answersto our questions, and both the above contribute to achieving that goal.

Regarding this sort of integration not being feasible before, I'dstay away from that argument. I do hope to show that, as a matter offact, this sort of integration is rarely done, that it is possible todo better with an acceptable level of effort, and that both thesemantic web tools and ethos help make it easier and more fruitful.

A small example of this was illustrated yesterday in the discussionabout dart grid. We were looking at mapping a column that recordedgender as a text field with either the character "M" or "F". Nowtypically, this is a distinction we wish to make in our ontologies,and we would generally have a class (ideally the same class acrossontologies) to capture this distinction. In a standard object-relational model, one could make M and F instead "object" by having asecond table, and a foreign key to that table to record the gender.But no one does that because it seems "overkill" - the queries aremore painful, the computational overhead is more, etc. But RDF or OWLthis kind of thing is (or should be) common practice, we incur nopenalty, and having it in this form makes it more straightforward tointegrate across independently constructed ontologies - sameas,subclass, equivalent class all provide standard ways of making theconnection. Compare this to the effort to merge two relationalschemas, where gender columns are used in various tables, nameddifferently, and where one database uses "M" and "F" and the otheruses "Male" and "Female".

- Enabling different types of inferences which would enable furtherintegration
not possible before.

I don't think I have said, or want to say, that integration beforewas not possible. However, I note that in fact it is has not beendone in a usable way for many of the resources we realistically wouldwant to use to ask questions about our scientific use case. There area number of reasons for this, some of which our use of semantic webtechnologies speak to. For example, that there is a shared standardand working tools based on it means that efforts to integrate can bebuilt on by others, which offers more bang for your buck, so tospeak, an important consideration when deciding to devote the notinsubstantial effort necessary to put resources in a form that makesit possible to effectively integrate them. Technically, the fact thatthere is less pain involved with schema extension and evolution whenusing OWL/RDF then when using traditional RDMS table oriented schemareduces the effort to integrate a large number of sources.

Alternatively, for the purpose of the demo, one could just do ashallow alignment so that different data sets can be integrated.

We will do what's necessary. But at this point, since people havevolunteered to own the translation of certain data sources, and sinceone of our goals is to explore and learn, I've been trying to get usfurther than we would be with this approach. There have been previousdemonstrations of this sort of shallow alignment, and from the pointof view of showing something novel, it would be nice to go beyondthat. Given what's been done so far, and the responses I've seen tothe analysis and suggestions people have been offering, I'm feelingoptimistic.


Best,
Alan

Re: NeuronDB RDF and OWL

Reply via email to