Re: XML vs. RDF

Chimezie Ogbuji Sat, 08 Jul 2006 05:26:08 -0700


On Sat, 8 Jul 2006, William Bug wrote:

Dear Philip,
Many thanks for this concise and accessible qualification to Chimezie'sexplanation. I was a little crest-fallen when I saw his original answer toTrish, and thought I really had misunderstood an issue that is becoming ofvery significant importance to several projects with which I'm involved.
There have been several debates recently in the neuroinformatics community asto whether an XML-only (XML, XSD, XSLT, XLink) will suffice when creatingcreating sub-domain knowledge resources - especially if you are justcollecting terminologies, as opposed to creating a full-blown, well-foundedontology. Whether it really isn't necessary to go to Semantic Web tech -i.e., the constellation of RDF-associated specs (RDF++ - sorry to add to theacronym soup - this is just a shorthand for this email) and the growingnumber of utilities for manipulating RDF/OWL and all the other RDF-relatedformalisms.

Here is the crux of the issue. I think there is a misunderstanding of myoriginal response. In suggesting that XSLT makes such a transformationrelatively painless (from an established XML format to one or more RDFrepresentations), I wasn't suggesting this as an argument *for* XML-only representation but as aconsideration that shouldn't be disregarded. I think one of the biggestmisconceptions people who debate whether to go for XML-only solutionsversus RDF++ (as you put it) is that the two technologies are mutuallyexclusive - which the ability to write such XSLT transforms shows isnot the case at all. Afterall, XML *is* in the semantic web stack and for good reason as well.

I think too much time is often invested in comparison and contrast of tworepresentation languages that each address a different set of problemsrather than in focusing on asking the more important question of what therequirements for representation are:


1) Is the data you wish to represent subject to lots of interpretation?
2) Is uniform syntax more important than semantics?
3) Is the domain being modelled subject to expansion in a semantic way?
4) What is the nature of systems with which interoperability is important

etc..

I think a handful of your points below fall more along the line of directcomparison and contrast that I don't think is as useful for answering thequestions the neuroinformatic community may be grappling than focusing onwhat are the specific problems being solved and what are the short andlong term requirements / goals for representation.

Cross-technological debate withwell established trenches often do very little to answer the originalquestions but only further misconceptions - which is why the subject ofthis post (XML vs RDF) concerns me.

Both representation languages bring with them a set of well establishedtools that become readily available once you express your content withthem and you have more to gain in leveraging dual-representation betweenboth (where it's feasible - I agree with the qualification of the use ofXSLT that emphasizes that it's contingent on having a well defined mappingin the first place) via XSLT.

Consider for instance XForms (which we areusing quite heavily for instance data entry). XForms is an XML dialectthat addresses specific and well known pitfalls with legacy brower-baseduser interface dialects and does so in a *very* powerful and promisingway. If a dynamic, expressive means of data entry is an importantrequirement for you data (as it is in our case) then you already havea good argument for having representation in XML for which there is noequivalent alternative in an RDF++ only approach. The main difficulty isthat with forms-based user interfaces uniform syntax and declarativestructure is of more concern than semantics. I've chatted about thisbefore, see this thread:


http://www.dehora.net/journal/2005/08/automated_mapping_between_rdf_and_forms_part_i.html

Ofcourse, you don't get your lunch for free and the price for leveraginguniform syntactical representation in order to simplify your use offorms for data entry is the effort up front in devising a mapping thatprovides the level of semantic grounding (if you will) sufficient for yourneeds and express such a mapping in an XSLT transform.

driver behind the creation of RDF++. You'll have a lot more code to writeand maintain, if you don't take advantage of Semantic Web tech.

This depends more on what it is you are trying to achieve withrepresentation than by the technologies by themselves, so Idon't agree with this very broad assesment.

6) We can leave it to others to create XSLT converters tomove the XML-only resources into the RDF++ spacePhilip & Chris M. have both given clear answers to thisill-advised use of XSLT.

I don't see how use of XSLT in this way can be considered 'ill-advised'and I don't think that was the point. The issue is that a neccessaryprerequisite for using XSLT in this way is a well definedmapping (if such a mapping exists) to begin with. Once you have a wellestablished mapping, XSLT *does* render the remaining mechanics anon-issue and it's for this particular reason that I think diregardingsuch a possibility is more ill-advised, especially if there is alreadya large and valuable body of existing XML content - this is precisely oneof the main motivations for technologies such as GRDDL.

The other issue Eric N. has described clearly isthe N**2 problem - the combinatorial proliferation of XSLTs as more XSDs areadded to the mix.

Once again, a misunderstanding of what I was suggesting. The ability touse XSLT in such a fashion isn't an endorsement to XML-onlyrepresentation solutions but as an effective way to leverage dualrepresentation where there is value to do so.

9) Proponents of RDF++ argue that XML has limited semanticexpressivity, but that's just not true.I think this argument is completely inverted. The problem isXML has nearly unlimited expressivity, but any semantic meaning you want toimbue your XML with must be made explicit in the parsers you write.

An XML parser interprets at the syntactic level (not at the semanticlevel). Semantic mapping from XML dialects typically occurs directly viaXSLT (written perhaps by those familar with the XML schema) to RDF or byother more novel means. See:


http://copia.ogbuji.net/blog/2006-04-03/_Semantic_

Ofcourse, such mappings will not be sufficient if your original needs forrepresentation go above and beyond what XML provides (with regards tosemantic expressiveness), but it's worth noting that there *is* a spectrumof oppurtunity between both technologies.

I) if you try to perform semantically-based KE/KR/KD with XML-only,you will have a lot more code to write & maintain YOURSELF - and much of itwill reproduce what you'd get automatically using RDF++.

XML was never meant to address Knoweledge Representation and attempts touse it in such a fashion is the fault of the author not the technologybeing misused.

II)You just can't provide the flexibility, guaranteed resolvability ofresources, and efficient expression required when representing semanticrelations in the rigid, strictly hierarchical document-oriented world ofXML-only, so you'll likely fall short on a lot of your requirements.

Only with those requirements that have more to do with KR and ubiquitoussemantics than uniform, interoperable syntax. Once again, the moreconstructive questions are about the nature of the requirements not thetwo technologies by themselves - there *is* always a context with theiruse.

Ask yourself why message protocols such as REST / POX and Web Servicesare expressed in XML and not in RDF. Ask yourself why the same is true of user interfacedialects (such as XHTML and it's derivatives - XForms), syndication formats, etc.. and perhaps thevalue of context and the nature of the problem being solved becomes moreevident.

Polarizing comparison and contrast of both ends of the representationstrata does more harm than good to both technologies and the moreconstructive questions should *first* be about what the requirements forrepresentation are.

I'd really appreciate hearing the views both pro & con on these issues fromothers on this list.


Thanks again, Philip, for your lucid and concise explanation.

Cheers,
Bill

On Jul 7, 2006, at 6:35 AM, Phillip Lord wrote:

"TW" == Trish Whetzel <[EMAIL PROTECTED]> writes:


  TW> Hi all,

  TW> As a terribly simple question, is it possible to take the actual
  TW> FuGE-ML that is generated on a per instance reporting of an
  TW> experiment/study/investigation and then convert than to RDF for
  TW> use with semantic web technologies?


Converting between one syntax and another is fairly simple, and there
are some reasonably tools for it. XSLT would work for converting XML
into RDF. I wouldn't like to use it for converting the other way
(actually I wouldn't like to use it at all, but this is personal
prejudice!).

This is assuming, however, that the semantics of the two
representations are compatible. To give an example, syntactically it
is possible to convert between the GO DAG and an OWL representation of
GO. However, the GO part-of relationship doesn't distinguish
universal and existential, while OWL forces you to make this
distinction; you can't sit on the fence.

So, the simple answer to a simple question is: it depends. I wouldn't
assume that FuGE-ML will be convertible into a given
ontology or representation in RDF, unless a reasonable amount of care
is taken in the design of FuGE-ML or the ontology to ensure that it
can happen.

Course, you could always hack it with some rules and a bit of human
intervention. That works as well.

Cheers

Phil


Bill Bug
Senior Analyst/Ontological Engineer

Laboratory for Bioimaging  & Anatomical Informatics
www.neuroterrain.org
Department of Neurobiology & Anatomy
Drexel University College of Medicine
2900 Queen Lane
Philadelphia, PA    19129
215 991 8430 (ph)
610 457 0443 (mobile)
215 843 9367 (fax)


Please Note: I now have a new email - [EMAIL PROTECTED]


Chimezie Ogbuji
Lead Systems Analyst
Thoracic and Cardiovascular Surgery
Cleveland Clinic Foundation
9500 Euclid Avenue/ W26
Cleveland, Ohio 44195
Office: (216)444-8593
[EMAIL PROTECTED]

This email and any accompanying attachments are confidential.This informationis intended solely for the use of the individualto whom it is addressed. Anyreview, disclosure, copying,distribution, or use of this email communicationby others is strictlyprohibited. If you are not the intended recipient pleasenotify usimmediately by returning this message to the sender and deleteallcopies. Thank you for your cooperation.

Re: XML vs. RDF

Reply via email to