On Sat, 8 Jul 2006, William Bug wrote:
Dear Philip,
Many thanks for this concise and accessible qualification to Chimezie's
explanation. I was a little crest-fallen when I saw his original answer to
Trish, and thought I really had misunderstood an issue that is becoming of
very significant importance to several projects with which I'm involved.
There have been several debates recently in the neuroinformatics community as
to whether an XML-only (XML, XSD, XSLT, XLink) will suffice when creating
creating sub-domain knowledge resources - especially if you are just
collecting terminologies, as opposed to creating a full-blown, well-founded
ontology. Whether it really isn't necessary to go to Semantic Web tech -
i.e., the constellation of RDF-associated specs (RDF++ - sorry to add to the
acronym soup - this is just a shorthand for this email) and the growing
number of utilities for manipulating RDF/OWL and all the other RDF-related
formalisms.
Here is the crux of the issue. I think there is a misunderstanding of my
original response. In suggesting that XSLT makes such a transformation
relatively painless (from an established XML format to one or more RDF
representations), I wasn't suggesting this as an argument *for* XML-only representation but as a
consideration that shouldn't be disregarded. I think one of the biggest
misconceptions people who debate whether to go for XML-only solutions
versus RDF++ (as you put it) is that the two technologies are mutually
exclusive - which the ability to write such XSLT transforms shows is
not the case at all. Afterall, XML *is* in the semantic web stack and for good reason as well.
I think too much time is often invested in comparison and contrast of two
representation languages that each address a different set of problems
rather than in focusing on asking the more important question of what the
requirements for representation are:
1) Is the data you wish to represent subject to lots of interpretation?
2) Is uniform syntax more important than semantics?
3) Is the domain being modelled subject to expansion in a semantic way?
4) What is the nature of systems with which interoperability is important
etc..
I think a handful of your points below fall more along the line of direct
comparison and contrast that I don't think is as useful for answering the
questions the neuroinformatic community may be grappling than focusing on
what are the specific problems being solved and what are the short and
long term requirements / goals for representation.
Cross-technological debate with
well established trenches often do very little to answer the original
questions but only further misconceptions - which is why the subject of
this post (XML vs RDF) concerns me.
Both representation languages bring with them a set of well established
tools that become readily available once you express your content with
them and you have more to gain in leveraging dual-representation between
both (where it's feasible - I agree with the qualification of the use of
XSLT that emphasizes that it's contingent on having a well defined mapping
in the first place) via XSLT.
Consider for instance XForms (which we are
using quite heavily for instance data entry). XForms is an XML dialect
that addresses specific and well known pitfalls with legacy brower-based
user interface dialects and does so in a *very* powerful and promising
way. If a dynamic, expressive means of data entry is an important
requirement for you data (as it is in our case) then you already have
a good argument for having representation in XML for which there is no
equivalent alternative in an RDF++ only approach. The main difficulty is
that with forms-based user interfaces uniform syntax and declarative
structure is of more concern than semantics. I've chatted about this
before, see this thread:
http://www.dehora.net/journal/2005/08/automated_mapping_between_rdf_and_forms_part_i.html
Ofcourse, you don't get your lunch for free and the price for leveraging
uniform syntactical representation in order to simplify your use of
forms for data entry is the effort up front in devising a mapping that
provides the level of semantic grounding (if you will) sufficient for your
needs and express such a mapping in an XSLT transform.
driver behind the creation of RDF++. You'll have a lot more code to write
and maintain, if you don't take advantage of Semantic Web tech.
This depends more on what it is you are trying to achieve with
representation than by the technologies by themselves, so I
don't agree with this very broad assesment.
6) We can leave it to others to create XSLT converters to
move the XML-only resources into the RDF++ space
Philip & Chris M. have both given clear answers to this
ill-advised use of XSLT.
I don't see how use of XSLT in this way can be considered 'ill-advised'
and I don't think that was the point. The issue is that a neccessary
prerequisite for using XSLT in this way is a well defined
mapping (if such a mapping exists) to begin with. Once you have a well
established mapping, XSLT *does* render the remaining mechanics a
non-issue and it's for this particular reason that I think diregarding
such a possibility is more ill-advised, especially if there is already
a large and valuable body of existing XML content - this is precisely one
of the main motivations for technologies such as GRDDL.
The other issue Eric N. has described clearly is
the N**2 problem - the combinatorial proliferation of XSLTs as more XSDs are
added to the mix.
Once again, a misunderstanding of what I was suggesting. The ability to
use XSLT in such a fashion isn't an endorsement to XML-only
representation solutions but as an effective way to leverage dual
representation where there is value to do so.
9) Proponents of RDF++ argue that XML has limited semantic
expressivity, but that's just not true.
I think this argument is completely inverted. The problem is
XML has nearly unlimited expressivity, but any semantic meaning you want to
imbue your XML with must be made explicit in the parsers you write.
An XML parser interprets at the syntactic level (not at the semantic
level). Semantic mapping from XML dialects typically occurs directly via
XSLT (written perhaps by those familar with the XML schema) to RDF or by
other more novel means. See:
http://copia.ogbuji.net/blog/2006-04-03/_Semantic_
Ofcourse, such mappings will not be sufficient if your original needs for
representation go above and beyond what XML provides (with regards to
semantic expressiveness), but it's worth noting that there *is* a spectrum
of oppurtunity between both technologies.
I) if you try to perform semantically-based KE/KR/KD with XML-only,
you will have a lot more code to write & maintain YOURSELF - and much of it
will reproduce what you'd get automatically using RDF++.
XML was never meant to address Knoweledge Representation and attempts to
use it in such a fashion is the fault of the author not the technology
being misused.
II)
You just can't provide the flexibility, guaranteed resolvability of
resources, and efficient expression required when representing semantic
relations in the rigid, strictly hierarchical document-oriented world of
XML-only, so you'll likely fall short on a lot of your requirements.
Only with those requirements that have more to do with KR and ubiquitous
semantics than uniform, interoperable syntax. Once again, the more
constructive questions are about the nature of the requirements not the
two technologies by themselves - there *is* always a context with their
use.
Ask yourself why message protocols such as REST / POX and Web Services
are expressed in XML and not in RDF. Ask yourself why the same is true of user interface
dialects (such as XHTML and it's derivatives - XForms), syndication formats, etc.. and perhaps the
value of context and the nature of the problem being solved becomes more
evident.
Polarizing comparison and contrast of both ends of the representation
strata does more harm than good to both technologies and the more
constructive questions should *first* be about what the requirements for
representation are.
I'd really appreciate hearing the views both pro & con on these issues from
others on this list.
Thanks again, Philip, for your lucid and concise explanation.
Cheers,
Bill
On Jul 7, 2006, at 6:35 AM, Phillip Lord wrote:
"TW" == Trish Whetzel <[EMAIL PROTECTED]> writes:
TW> Hi all,
TW> As a terribly simple question, is it possible to take the actual
TW> FuGE-ML that is generated on a per instance reporting of an
TW> experiment/study/investigation and then convert than to RDF for
TW> use with semantic web technologies?
Converting between one syntax and another is fairly simple, and there
are some reasonably tools for it. XSLT would work for converting XML
into RDF. I wouldn't like to use it for converting the other way
(actually I wouldn't like to use it at all, but this is personal
prejudice!).
This is assuming, however, that the semantics of the two
representations are compatible. To give an example, syntactically it
is possible to convert between the GO DAG and an OWL representation of
GO. However, the GO part-of relationship doesn't distinguish
universal and existential, while OWL forces you to make this
distinction; you can't sit on the fence.
So, the simple answer to a simple question is: it depends. I wouldn't
assume that FuGE-ML will be convertible into a given
ontology or representation in RDF, unless a reasonable amount of care
is taken in the design of FuGE-ML or the ontology to ensure that it
can happen.
Course, you could always hack it with some rules and a bit of human
intervention. That works as well.
Cheers
Phil
Bill Bug
Senior Analyst/Ontological Engineer
Laboratory for Bioimaging & Anatomical Informatics
www.neuroterrain.org
Department of Neurobiology & Anatomy
Drexel University College of Medicine
2900 Queen Lane
Philadelphia, PA 19129
215 991 8430 (ph)
610 457 0443 (mobile)
215 843 9367 (fax)
Please Note: I now have a new email - [EMAIL PROTECTED]
Chimezie Ogbuji
Lead Systems Analyst
Thoracic and Cardiovascular Surgery
Cleveland Clinic Foundation
9500 Euclid Avenue/ W26
Cleveland, Ohio 44195
Office: (216)444-8593
[EMAIL PROTECTED]
This email and any accompanying attachments are confidential.This information
is intended solely for the use of the individualto whom it is addressed. Any
review, disclosure, copying,distribution, or use of this email communication
by others is strictlyprohibited. If you are not the intended recipient please
notify usimmediately by returning this message to the sender and deleteall
copies. Thank you for your cooperation.